Screenshot of this question was making the rounds last week. But this article covers testing against all the well-known models out there.

Also includes outtakes on the ‘reasoning’ models.

  • Jax@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    10 days ago

    Dirtying the car on the way there?

    The car you’re planning on cleaning at the car wash?

    Like, an AI not understanding the difference between walking and driving almost makes sense. This, though, seems like such a weird logical break that I feel like it shouldn’t be possible.

    • _g_be@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      10 days ago

      You’re assuming AI “think” “logically”.

      Well, maybe you aren’t, but the AI companies sure hope we do