we appear to be the first to write up the outrage coherently too. much thanks to the illustrious @self

  • @Lumisal@lemmy.world
    link
    fedilink
    English
    -22 months ago

    Mistral isn’t trained on copy righted data. It’s based off selective databases that were open use. This article in general is full of false information. But I suppose most people only read the headlines.

      • Steve
        link
        English
        13
        edit-2
        2 months ago

        We had a social contract!

        Mustafa Suleyman

      • @froztbyte
        link
        English
        92 months ago

        but it’s apache2 sega! tooooootes freebies!

        • Steve
          link
          English
          112 months ago

          if you’re not gonna read the fucken thing then fuck off.

          • @Lumisal@lemmy.world
            link
            fedilink
            English
            -52 months ago

            I did read the thing, then provided an article explaining why detecting copyrighted material / determining if something is written by AI is very inaccurate.

            Perhaps take your own advice to “read the fucken thing” next time instead of making yourself look like an idiot. Though I doubt you’ve ever heard of “better to stay silent and let them think you the fool than to speak and remove all doubt”.

            Btw, I even recall that Ars specifically covered the company you linked to in a separate article as well. I’d be glad to provide it once you’ve come to your senses and want to discuss things like an adult.

            • @froztbyte
              link
              English
              142 months ago

              Mistral’s Mixtral-8x7B-Instruct-v0.1 produced copyrighted content on 22% of the prompts.

              did you know that a lesser-known side effect of the infinite monkeys approach is that they will produce whole sections of copyright content abso-dupo-lutely by accident? wild, I know! totes coinkeedink!

              I’d be glad to provide it once you’ve come to your senses and want to discuss things like an adult

              jesus fucking christ you must be a fucking terrible person to work with

              I’ve seen toddlers throw more mature tantrums

              • Steve
                link
                English
                92 months ago

                she wrote harry potter with an llm, didn’t she?

              • @Lumisal@lemmy.world
                link
                fedilink
                English
                -62 months ago

                I’m too old to discuss against bad faith arguments.

                Especially with people who won’t read the information I provide them showing their initial information was wrong.

                One is a company that has something to sell, the other an article with citations showing why it’s not easy to determine what percentage of a data set is infringing on copyright, or whether exact reproduction via “fishing expedition” prompting is a useful metric to determine if unauthorized copyright was used in training.

                The dumbest take though is attacking Mistral of all LLMs, even though it’s on an Apache 2.0 license.

                • @froztbyte
                  link
                  English
                  11
                  edit-2
                  2 months ago

                  I’ve read the article you’ve posted: it does not refute the fucking datapoint provided, it literally DOES NOT EVEN MENTION MISTRAL AT ALL.

                  so all I can tell you is to take your pearlclutching tantrum bullshit and please fuck off already

                  • @selfA
                    link
                    English
                    112 months ago

                    god these weird little fuckers’ ability to fill a thread with garbage is fucking notable isn’t it? something about loving LLMs makes you act like an LLM. how depressing for them.

                • Steve
                  link
                  English
                  82 months ago

                  chatgpt gets it

            • Steve
              link
              English
              112 months ago

              you’re conflating “detecting ai text” with “detecting an ai trained on copyrighted material”

              send the relevant article or shut up

              • @Lumisal@lemmy.world
                link
                fedilink
                English
                -82 months ago

                Ignoring the logical inconsistency you just spouted for a moment (can’t tell if it’s written by AI but knows it used copyrighted material? Do you not hear yourself?), you do realize Mistral is released under the Apache 2.0 license, a highly permissive scheme that has no restrictions on use or reproduction beyond attribution, right?

                I think it’s clear you’re arguing in bad faith however with no intention of changing your misinformed opinion at this point. Perhaps you’d enjoy an echo chamber like the “fuckai” Lemmy instance.

                • @selfA
                  link
                  English
                  122 months ago

                  wait a minute… there’s another “fuck ai” instance and they’ve already told you to go fuck yourself?

                  I wonder if they want to be friends

                  • @froztbyte
                    link
                    English
                    10
                    edit-2
                    2 months ago

                    have seen one on lemmy.world. it’s kinda the dancing baby version of the stubsack and techtakes tho from what I can tell

                • @selfA
                  link
                  English
                  92 months ago

                  holy shit you really are quite dumb. the fuck is wrong with you?

                  actually don’t answer that

                • Steve
                  link
                  English
                  82 months ago

                  You are quite dumb.

                  • @froztbyte
                    link
                    English
                    92 months ago

                    the reading comprehension of a llm and the contextual capacity of a gnat