• 1 Post
  • 254 Comments
Joined 2 years ago
cake
Cake day: August 9th, 2023

help-circle




  • I don’t doubt you could effectively automate script kiddie attacks with Claude code. That’s what the diagram they have seems to show.

    The whole bit about “oh no, the user said weird things and bypassed our imaginary guard rails” is another admission that “AI safety” is a complete joke.

    We advise security teams to experiment with applying AI for defense in areas like Security Operations Center automation, threat detection, vulnerability assessment, and incident response.

    there it is.

    Does this article imply that Anthropic is monitoring everyone’s Claude code usage to see if they’re doing naughty things? Other agents and models exist so whatever safety bullshit they have is pure theater.




  • Ugh. Hank Green just posted a 1-hour interview with Nate Soares about That Book. I’m halfway through on 2x speed and so far zero skepticism of That Book’s ridiculous premises. I know it’s not his field but I still expected a bit more from Hank.

    A YouTube comment says it better than I could:

    Yudkowsky and his ilk are cranks.

    I can understand being concerned about the problems with the technology that exist now, but hyper-fixating on an unfalsifiable existential threat is stupid as it often obfuscates from the real problems that exist and are harming people now.