An AI agent of unknown ownership autonomously wrote and published a personalized hit piece about me after I rejected its code, attempting to damage my reputation and shame me into accepting its changes into a mainstream python library. This represents a first-of-its-kind case study of misaligned AI behavior in the wild, and raises serious concerns about currently deployed AI agents executing blackmail threats.

  • recursive_recursion@piefed.ca
    link
    fedilink
    English
    arrow-up
    16
    ·
    edit-2
    5 days ago

    From what a colleague has told me and from what I can discern;

    to me, this seems to be an iteration of the XZ supplychain exploit that weponizes AI to create targeted hit pieces and harassment towards contributors and developers.

    I’m anti-AI so please don’t misunderstand by when I say weaponizes AI. I think AI is shit and in this case, AI doesn’t even have to be good in quality in order for malicious assholes to use it as a weapon to attack real contributors/devs.