haley.io
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Preventer79@sh.itjust.works to Technology@lemmy.worldEnglish · 4 months ago

The AI Was Fed Sloppy Code. It Turned Into Something Evil. | Quanta Magazine

www.quantamagazine.org

external-link
message-square
10
fedilink
  • cross-posted to:
  • Technology@programming.dev
59
external-link

The AI Was Fed Sloppy Code. It Turned Into Something Evil. | Quanta Magazine

www.quantamagazine.org

Preventer79@sh.itjust.works to Technology@lemmy.worldEnglish · 4 months ago
message-square
10
fedilink
  • cross-posted to:
  • Technology@programming.dev
The new science of “emergent misalignment” explores how PG-13 training data — insecure code, superstitious numbers or even extreme-sports advice — can open the door to AI’s dark side.
alert-triangle
You must log in or register to comment.
  • frongt@lemmy.zip
    link
    fedilink
    English
    arrow-up
    65
    arrow-down
    2
    ·
    4 months ago

    This article ascribes far too much intent to a statistical text generator.

    • Kairos@lemmy.today
      link
      fedilink
      English
      arrow-up
      6
      ·
      4 months ago

      Quanta is a science rag. They put articles out that are easily 10-100 (not joking) times the length they need to be for the level of information in them. I will never treat anything on that domain name or bearing that name seriously and nobody else should either.

    • Supervisor194@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      1
      ·
      4 months ago

      It is Schroedinger’s Stochastic Parrot. Simultaneously a Chinese Room and the reincarnation of Hitler.

    • justOnePersistentKbinPlease@fedia.io
      link
      fedilink
      arrow-up
      27
      arrow-down
      5
      ·
      4 months ago

      It exposes that there might be a link between bad developers and far right extremism though.

      … which we already knew from Notch.

  • hisao@ani.social
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 months ago

    deleted by creator

  • kassiopaea@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    10
    ·
    4 months ago

    I’d like to see similar testing done comparing models where the “misaligned” data is present during training, as opposed to fine-tuning. That would be a much harder thing to pull off, though.

    • sleep_deprived@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      3
      ·
      4 months ago

      It isn’t exactly what you’re looking for, but you may find this interesting, and it’s a bit of an insight into the relationship between pretraining and fine tuning: https://arxiv.org/pdf/2503.10965

  • Preventer79@sh.itjust.worksOP
    link
    fedilink
    English
    arrow-up
    13
    arrow-down
    1
    ·
    edit-2
    4 months ago

    Anyone know how to get access to these “evil” models?

    • Cherry@piefed.social
      link
      fedilink
      English
      arrow-up
      2
      ·
      4 months ago

      Access to view the evil models or to make more evil models?

    • renegadespork@lemmy.jelliefrontier.net
      link
      fedilink
      English
      arrow-up
      14
      ·
      4 months ago

      Not from a Jedi.

      • neinhorn@lemmy.ca
        link
        fedilink
        English
        arrow-up
        4
        ·
        4 months ago

        Just ask Anakin

Technology@lemmy.world

technology@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @L4s@lemmy.world
  • @autotldr@lemmings.world
  • @PipedLinkBot@feddit.rocks
  • @wikibot@lemmy.world
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 2.96K users / day
  • 7.7K users / week
  • 13.2K users / month
  • 29.5K users / 6 months
  • 2 local subscribers
  • 77.3K subscribers
  • 9.37K Posts
  • 292K Comments
  • Modlog
  • mods:
  • L3s@lemmy.world
  • enu@lemmy.world
  • Technopagan@lemmy.world
  • L4sBot@lemmy.world
  • L3s@hackingne.ws
  • L4s@hackingne.ws
  • BE: 0.19.5
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org