• David GerardOPMA
    link
    English
    420 days ago

    there’s a research result that the precise tokeniser makes bugger all difference, it’s almost entirely the data you put in

    because LLMs are lossy compression for text

    • @froztbyte
      link
      English
      320 days ago

      latent space go brrrr