• David GerardOPMA
    link
    English
    43 months ago

    there’s a research result that the precise tokeniser makes bugger all difference, it’s almost entirely the data you put in

    because LLMs are lossy compression for text

    • @froztbyte
      link
      English
      33 months ago

      latent space go brrrr