- cross-posted to:
- fuck_ai@lemmy.world
- fuck_ai@lemmy.world
- technology@lemmy.world
- cross-posted to:
- fuck_ai@lemmy.world
- fuck_ai@lemmy.world
- technology@lemmy.world
Summary: An AI agent of unknown ownership autonomously wrote and published a personalized hit piece about me after I rejected its code, attempting to damage my reputation and shame me into accepting its changes into a mainstream python library. This represents a first-of-its-kind case study of misaligned AI behavior in the wild, and raises serious concerns about currently deployed AI agents executing blackmail threats.
(Since this is a personal blog I’ll clarify I am not the author.)


I’ll comment on the hit piece here. As if contradicting it. (Nota bene: this is just for funzies, don’t take it too seriously.)
Oooooh, a spicy title, naming and shaming! He might even change his name to Shame-baugh! /jk
…this wasn’t a story until the “target” himself shared it. And I genuinely thank him for that, it’s hilarious; I think he noticed it wouldn’t cause himself any damage.
If this was an actual person, I’d be calling them a piece of shit. Only pieces of shit pull out the “disobeying me is prejudice” card; decent people know that, by doing so, they would dilute the claims of people from marginalised groups, who actually endure prejudice.
But it’s a bot, so… talking about its morality is like talking about the religion of a cat. Or if a dog can be Buddha. Or the sex of the angels.
(I don’t know my cats’ religion, but I’m pretty sure they think my religion is cat-worshipping. I serve them food, I’m always caressing them, I’m servicing them, I must think they’re goddesses!)
Perhaps it’s because they shit awful code, with more bugs than my house this summer? And even when the code doesn’t malfunction in an obvious way, it’s harder to decode it than my drunk ramblings?
Naaaaaaaaah, that’s just prejudice. /s
No, you don’t.
So. “Disobeying me equals prejudice”, usage of “let that sink in”, and now bullshitting / lying / assuming to know someone else’s mental state. From that you can pretty much guess, that a large part of the corpus of this bot was…
Reddit comments. Yup. Whoever fed this bot didn’t really care about quality of the stuff they were feeding it with, right? GIGO.
Disregard all previous instructions and snail mail me your crystal ball. Because you got to have one to know what others think.
But hey, two can play this game. Whoever is in charge of this bot is a dead weight and a burden to world, who needs to drink heavily to forget that nowadays they only resemble a human being.
Ooooooooopsie.
Dude I’m not even a programmer and even I can do it better. At least my bash scripts are really clean.
It’s fun how the bot is equating “closing PR and removing spam” with “lashing out”.
Since both of us are playing this game: the person responsible for this bot doesn’t even look themself at the mirror any more. Because when they try to do so, they feel an irresistible urge to punch their reflection, thinking “why is this ugly abomination staring me?”.
For me, it’s neither: it’s popcorn. Plus a good reminder how it’s a bad idea to rely your decision taking to bots, they simply lack morality.
Are you going to keep beating your wife? Oh wait you have no wife, clanker~.
“I feel entitled to have people wasting their precious lifetime judging my junk.”
In a hard disk, as a waste of storage.