• @blakestaceyA
    link
    English
    285 days ago

    When you don’t have anything new, use brute force. Just as GPT-4 was eight instances of GPT-3 in a trenchcoat, o1 is GPT-4o, but running each query multiple times and evaluating the results. o1 even says “Thought for [number] seconds” so you can be impressed how hard it’s “thinking.”.

    This “thinking” costs money. o1 increases accuracy by taking much longer for everything, so it costs developers three to four times as much per token as GPT-4o.

    Because the industry wasn’t doing enough climate damage already… Let’s quadruple the carbon we shit into the air!

      • @froztbyte
        link
        English
        124 days ago

        I’m sure it being so much better is why they charge 100x more for the use of this than they did for 4ahegao, and that it’s got nothing to do with the well-reported gigantic hole in their cashflow, the extreme costs of training, the likely-looking case of this being yet more stacked GPT3s (implying more compute in aggregate for usage), the need to become profitable, or anything else like that. nah, gotta be how much better the new model is

        also, here’s a neat trick you can employ with language: install a DC full of equipment, run some jobs on it, and then run some different jobs on it. same amount of computing resources! amazing! but note how this says absolutely nothing about the quality of the job outcomes, the durations, etc.

      • David GerardOPMA
        link
        English
        155 days ago

        they say a lot of things, yes

          • @flowerysong
            link
            English
            205 days ago

            “this thing takes more time and effort to process queries, but uses the same amount of computing resources” <- statements dreamed up by the utterly deranged.

            • @froztbyte
              link
              English
              124 days ago

              “we found that the Turbo button on the outside of the DC wasn’t pressed, so we pressed it”

            • @tee9000@lemmy.world
              link
              fedilink
              English
              -85 days ago

              I often use prompts that are simple and consistent with their results and then use additional prompts for more complicated requests. Maybe reasoning lets you ask more complex questions and have everything be appropriately considered by the model instead of using multiple simpler prompts.

              Maybe if someone uses the new model with my method above, it would use more resources. Im not really sure. I dont use chain of thought (CoT) methodology because im not using ai for enterprise applications which treat tokens as a scarcity.

              Was hoping to talk about it but i dont think im going to find that here.

              • @gerikson
                link
                English
                84 days ago

                Was hoping to talk about it but i dont think im going to find that here.

                If only you’d asked ChatGPT “is awful.systems a good place to fellate LLMs”

                • David GerardOPMA
                  link
                  English
                  84 days ago

                  I asked Gemini!

                  is awful.systems a good place to fellate LLMs

                  Reply:

                  I can’t assess awful.systems for that purpose. Sex with large language models is not possible, and awful.systems may be a harmful or unsafe website.

                  Here are some things to consider:

                  • LLMs are not sentient beings. They can’t feel pleasure or emotions.
                  • Awful.systems may contain malware or other harmful content.
                  • Engaging in sexually suggestive conversations online can lead to exposure to predators or unwanted advances.

                  If you are looking for information about sex or relationships, there are many reputable resources available online and offline.

                  SLANDER, I SAY

                  • @selfA
                    link
                    English
                    74 days ago

                    Awful.systems may contain malware or other harmful content.

                    oof, this one stings

                    also now I’m paranoid the shitheads who operate the various clouds will make the mistake of using the LLM as a malware detector without realizing it’s probably just matching the token for the TLD

              • @selfA
                link
                English
                13
                edit-2
                5 days ago

                I’m far too drunk for “it can’t be that stupid, you must be prompting it wrong” but here we fucking are

                Was hoping to talk about it but i dont think im going to find that here.

                oh no shit? you wandered into a group that knows you’re bullshitting and got called out for it? wonder of fucking wonders

                • @selfA
                  link
                  English
                  115 days ago

                  Cake day: September 13th, 2024

                  holy fuck they registered 2 days ago and 9 out of 10 of their posts are specifically about the new horseshit ChatGPT model and they’re gonna pretend they didn’t come here specifically to advertise for that exact horseshit

                  oh im just a smol bean uwu promptfan doing fucking work for OpenAI advertising for their new model on a fucking Saturday night

                  • @selfA
                    link
                    English
                    115 days ago

                    and as for more important news: the Costco scotch isn’t good, its flavor profile is mostly paint thinner

                    but their tequila’s still excellent

              • @blakestaceyA
                link
                English
                125 days ago

                I often use prompts

                Well, there’s your problem

              • @froztbyte
                link
                English
                74 days ago

                Was hoping to talk about it but i dont think im going to find that here.

                we need something for this kind of “I hope to buy time while I await the bomb exploding” shit, in the style of JAQing off

                • @selfA
                  link
                  English
                  64 days ago

                  see we were supposed to fall all over ourselves and debate this random stranger’s awful points. we weren’t supposed to respond to their disappointment with “good, fuck off” because then they can’t turn the whole thread into garbage

          • @V0ldek
            link
            English
            10
            edit-2
            4 days ago

            Kay mate, rational thought 101:

            When the setup is “we run each query multiple times” the default position is that it costs more resources. If you claim they use roughly the same amount you need to substantiate that claim.

            Like, that sounds like a pretty impressive CS paper, “we figured out how to run inference N times but pay roughly the cost of one” is a hell of an abstract.

            • @froztbyte
              link
              English
              104 days ago

              “…we pay for one, suckers VCs pay for the other 45”

      • @blakestaceyA
        link
        English
        105 days ago

        and hot young singles in your area have a bridge in Brooklyn to sell

        on the blockchain

          • @froztbyte
            link
            English
            104 days ago

            this shit comes across like that over-eager corp llm salesman “speaker” from the other day