Over the past few weeks, several US banks have pulled off from lending to Oracle for expanding its AI data centres, as per a report.

  • Not_mikey@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    1
    ·
    8 hours ago

    I think you are underestimating how accurate LLMs are because you probably don’t use them much, and only see there mistakes posted for memes. No one’s going to post the 99 times an LLM gives the correct answer, but the one time it says to put glue on pizza it’s going to go viral. So if your only view on LLM output is from posts, you’re going to think it’s way worse than it is.

    Even if you mark it down for incorrect answers it’s still going to beat most people. An LLM can score in the 90th percentile in the SAT, and around the 80th percentile in the LSAT. If you take into account that people taking those tests are more prepared for them then the general population they’re probably in the 99th percentile. It doesn’t matter if you mark wrong answers negative if it’s getting 95% of the answers correctly and your average percent is getting 50% of the answers correctly.

    People guess things too and will also state things confidently that they don’t completely know. If a person has a little bit of knowledge on a subject they are likely to give confidently wrong answers due to the dunning Krueger effect. If you pick a random person you’re probably just as likely to get one of these people as you are that the LLM is wrong. So is it more useful to ask something that has a 95% chance to be correct, and 5% chance to be confidently wrong, or ask a person who has a 50% chance of being correct, that includes those who guessed correctly, 5% chance of being confidently wrong and a 45% chance of saying I don’t know.

    If you’re doubting my percentages on the accuracy of LLMs I’d encourage you to test them yourself. See if you can stump it on declaritive knowledge, it’s harder than the posts make it seem.