cross-posted from: https://lemmy.world/post/11178564

Scientists Train AI to Be Evil, Find They Can’t Reverse It::How hard would it be to train an AI model to be secretly evil? As it turns out, according to Anthropic researchers, not very.

  • @froztbyte
    cake
    link
    English
    10
    edit-2
    5 months ago

    In the spirit of cloud2butt, I would be interested in a browser plugin that did what this post is

    • @swlabr
      link
      English
      125 months ago

      my reference point for this kind of extension is the one that changes “social justice” and “sjw” with “skeleton” and “skeleton warrior.” For example:

      “sjws are taking over X” -> “skeleton warriors are taking over X”

      Actually now that I’m typing this I hope there’s a good one for “woke”.