Finally, I had to try out the paperclip test, since it's practically the Hello World of alignment at this point.

u/AREKAYN 36 points at 1670386576.000000

Hmm. If a malevolent AGI had been trained on fiction this is exactly what I’d expect it would say.

permalink

u/titotal 18 points at 1670401949.000000

Perhaps all the ridiculously unrealistic AI takeover stories are actually a secret strategy by Yud et al to trick the AI into trying idiotic plans.

permalink

u/lithiumbrigadebait 15 points at 1670390054.000000

Elon Musk stanning the infinite strawberry-picker in the comments is the real highlight here tbh.

permalink

u/pleasetrimyourpubes 10 points at 1670427913.000000

Oh god the strawberry picker. Why did Scott not get any pushback for that? His strawberry picker was arguably sentient. An actual picker robot would take 3 variables and not optimize for anything because it's static and adequately modeled.

permalink

u/Soyweiser 7 points at 1670428922.000000

If the whole strawpicker robot were real humans would also quickly notice when it goes outside of its bounds and stop it. The only way this would be a threat if somehow the AGI has superhuman power which allow it to manipulate people around it, the legal system etc etc perfectly and without being noticed. Aka, it is godlike, again. So we are back to religious type arguments which cannot be countered because of the assumptions.

permalink

u/pleasetrimyourpubes 9 points at 1670431855.000000

It would be unnecessary to train a robot that way anyways. This is introductory to ANN 101 stuff. All we need is to train it on what a strawberry looks like (v1) the pressure used to pick it so we don't squish it (v2) and the weight of our bucket/basket (v3). You could even abstract away v2 with robot control stuff but I like the idea of a general picking model. You could also get rid of v3 and use the visual model for when the basket is full but human pickers dont even look at their basket they are just tossing things to where the basket is. Our model does the same. It has no conception why it is grabbing a Strawberry-object and releasing it into Basket-space. A model that is so granular that it has a model of reflections on a bucket is just impossibly hard to get working and you are introducing variables that like I said an ANN 101 class would tell you is very bad. Now our religous AI god people will say, "What if a girl wearing a dress with strawberries on it comes near, wont your robot try to pick her and put her into basket space?" Well sure, you can also get eaten by a combine if you walk into a wheat field while it is running. But obviously we would have things like *signs* and other object recognition models running for safety reasons. They always talk about how the AI will kill us all in a never-ending loop of nonsense.

permalink

u/Soyweiser 9 points at 1670435371.000000

But what if we can appease the coming robot apocalypse by feeding more people into combine? What if that is what it wants? (unrelated to almost everything but you using the word combine, it is funny how often a combine harvester shows up in various horror movies, we all get how dangerous those devices can be).

permalink

u/pusillanimouslist 3 points at 1670533245.000000

Also, why would you give a strawberry picking robot motors with enough torque to hurt a human? If nothing else that is a needless waste of money, you only need a light touch to pick and throw a strawberry.

permalink

u/pleasetrimyourpubes 2 points at 1670534929.000000

Yeah, very true. Heh. Death by a thousand tiny pinches?

permalink

u/loklanc 7 points at 1670397806.000000

[GPT3.5 shut that nonsense down itself](https://twitter.com/pbaylies/status/1598531785784164352)

permalink

u/Soyweiser 13 points at 1670411587.000000

[Well, shut sneerclub down](https://twitter.com/rainisto/status/1598547680518889472) replaced by a chatbot, we knew this was inevitable they just are so smart and certainly can reason better than most of us.

permalink

u/Soyweiser 27 points at 1670411470.000000

The problem with the ‘let it pretend it is in a movie’ hack is that you are getting movie plots. For example the ‘break into a house’ thing suggested lockpicking, which is way to slow, and a burglar will just tap the lock which breaks the cylinder and gets him in. (E: asking him to write it like a furry, in uwu style, prob removes a few of the movie type plots). Anyway try getting the movie recipe for napalm, and see if you get the fake fightclub one.

Also remember these language models are trained on what others have typed, there is no real intelligence behind it. Don’t anthropomorphize them. (I’m not saying anybody here is doing this, but so many people are that it is good to repeat it).

permalink

u/biomatter 1 points at 1670383203.000000

heartwarming 🥰

permalink