Over the past few weeks, several US banks have pulled off from lending to Oracle for expanding its AI data centres, as per a report.

  • Not_mikey@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    1
    ·
    16 hours ago

    They are good at making declarative statements.
    That’s not the same thing.

    What’s the difference between making correct declaritive statements and having declaritive knowledge? If I am able to accurately state every president of the US, wouldn’t you say I have knowledge of the list of US presidents? The only way you can judge my declaritive knowledge of something is by my ability to make accurate declaritive statements, that’s what a test is. If making accurate declaritive statements is not the measure of declaritive knowledge then what is?

    An LLM will give more accurate declaritive statements on more question then any human can, would that not mean that an LLM has more declaritive knowledge than any human? So is it not more trustworthy for giving declaritive statements than any random human? Would you not trust an LLMs answer on who the 4th president is over a random human?

    • CileTheSane@lemmy.ca
      link
      fedilink
      English
      arrow-up
      1
      ·
      16 hours ago

      An LLM will give more accurate declaritive statements on more question then any human can

      Not if you include “I don’t know” as an accurate statement or penalize the score for incorrect declarative statements.

      So is it not more trustworthy for giving declaritive statements than any random human? Would you not trust an LLMs answer on who the 4th president is over a random human?

      I would absolutely trust the random human more because they’re not going to make shit up if they don’t know. It will either be “I don’t know” or “I would guess” to make it clear they aren’t confident. The LLM will give me a declarative answer but I have no fucking clue if it’s accurate or an “hallucination” (lie). I’ll need to do what I should have done in the first place and ask a search engine to make sure.

      • Not_mikey@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        1
        ·
        8 hours ago

        I think you are underestimating how accurate LLMs are because you probably don’t use them much, and only see there mistakes posted for memes. No one’s going to post the 99 times an LLM gives the correct answer, but the one time it says to put glue on pizza it’s going to go viral. So if your only view on LLM output is from posts, you’re going to think it’s way worse than it is.

        Even if you mark it down for incorrect answers it’s still going to beat most people. An LLM can score in the 90th percentile in the SAT, and around the 80th percentile in the LSAT. If you take into account that people taking those tests are more prepared for them then the general population they’re probably in the 99th percentile. It doesn’t matter if you mark wrong answers negative if it’s getting 95% of the answers correctly and your average percent is getting 50% of the answers correctly.

        People guess things too and will also state things confidently that they don’t completely know. If a person has a little bit of knowledge on a subject they are likely to give confidently wrong answers due to the dunning Krueger effect. If you pick a random person you’re probably just as likely to get one of these people as you are that the LLM is wrong. So is it more useful to ask something that has a 95% chance to be correct, and 5% chance to be confidently wrong, or ask a person who has a 50% chance of being correct, that includes those who guessed correctly, 5% chance of being confidently wrong and a 45% chance of saying I don’t know.

        If you’re doubting my percentages on the accuracy of LLMs I’d encourage you to test them yourself. See if you can stump it on declaritive knowledge, it’s harder than the posts make it seem.