Zerush@lemmy.ml to Technology@lemmy.ml · 6 months agoAgentic Misalignment: How LLMs could be insider threatswww.anthropic.comexternal-linkmessage-square3fedilinkarrow-up116arrow-down13cross-posted to: technology@lemmy.worldtechnology@lemmy.zipTechnology@programming.dev
arrow-up113arrow-down1external-linkAgentic Misalignment: How LLMs could be insider threatswww.anthropic.comZerush@lemmy.ml to Technology@lemmy.ml · 6 months agomessage-square3fedilinkcross-posted to: technology@lemmy.worldtechnology@lemmy.zipTechnology@programming.dev
minus-squarecaptainastronaut@seattlelunarsociety.orglinkfedilinkEnglisharrow-up6·6 months agoI love how these people think “we told it not to break the rules” and think somehow the stochastic parrot has understood them and will obey.
I love how these people think “we told it not to break the rules” and think somehow the stochastic parrot has understood them and will obey.