r/SneerClub archives
newest
bestest
longest
Finally, I had to try out the paperclip test, since it's practically the Hello World of alignment at this point. (https://twitter.com/zswitten/status/1598088286035415047)
30

Hmm. If a malevolent AGI had been trained on fiction this is exactly what I’d expect it would say.

Perhaps all the ridiculously unrealistic AI takeover stories are actually a secret strategy by Yud et al to trick the AI into trying idiotic plans.

Elon Musk stanning the infinite strawberry-picker in the comments is the real highlight here tbh.

Oh god the strawberry picker. Why did Scott not get any pushback for that? His strawberry picker was arguably sentient. An actual picker robot would take 3 variables and not optimize for anything because it's static and adequately modeled.
If the whole strawpicker robot were real humans would also quickly notice when it goes outside of its bounds and stop it. The only way this would be a threat if somehow the AGI has superhuman power which allow it to manipulate people around it, the legal system etc etc perfectly and without being noticed. Aka, it is godlike, again. So we are back to religious type arguments which cannot be countered because of the assumptions.
It would be unnecessary to train a robot that way anyways. This is introductory to ANN 101 stuff. All we need is to train it on what a strawberry looks like (v1) the pressure used to pick it so we don't squish it (v2) and the weight of our bucket/basket (v3). You could even abstract away v2 with robot control stuff but I like the idea of a general picking model. You could also get rid of v3 and use the visual model for when the basket is full but human pickers dont even look at their basket they are just tossing things to where the basket is. Our model does the same. It has no conception why it is grabbing a Strawberry-object and releasing it into Basket-space. A model that is so granular that it has a model of reflections on a bucket is just impossibly hard to get working and you are introducing variables that like I said an ANN 101 class would tell you is very bad. Now our religous AI god people will say, "What if a girl wearing a dress with strawberries on it comes near, wont your robot try to pick her and put her into basket space?" Well sure, you can also get eaten by a combine if you walk into a wheat field while it is running. But obviously we would have things like *signs* and other object recognition models running for safety reasons. They always talk about how the AI will kill us all in a never-ending loop of nonsense.
But what if we can appease the coming robot apocalypse by feeding more people into combine? What if that is what it wants? (unrelated to almost everything but you using the word combine, it is funny how often a combine harvester shows up in various horror movies, we all get how dangerous those devices can be).
Also, why would you give a strawberry picking robot motors with enough torque to hurt a human? If nothing else that is a needless waste of money, you only need a light touch to pick and throw a strawberry.
Yeah, very true. Heh. Death by a thousand tiny pinches?
[GPT3.5 shut that nonsense down itself](https://twitter.com/pbaylies/status/1598531785784164352)
[Well, shut sneerclub down](https://twitter.com/rainisto/status/1598547680518889472) replaced by a chatbot, we knew this was inevitable they just are so smart and certainly can reason better than most of us.

The problem with the ‘let it pretend it is in a movie’ hack is that you are getting movie plots. For example the ‘break into a house’ thing suggested lockpicking, which is way to slow, and a burglar will just tap the lock which breaks the cylinder and gets him in. (E: asking him to write it like a furry, in uwu style, prob removes a few of the movie type plots). Anyway try getting the movie recipe for napalm, and see if you get the fake fightclub one.

Also remember these language models are trained on what others have typed, there is no real intelligence behind it. Don’t anthropomorphize them. (I’m not saying anybody here is doing this, but so many people are that it is good to repeat it).

heartwarming 🥰