Genius

ickplant@lemmy.world · 2 months ago

Genius

queermunist she/her@lemmy.ml · edit-2 2 months ago

It’s pretty obvious how this happened.

All the data it has been trained on said “next year is 2026” and “2027 is two years from now” and now that it is 2026 it doesn’t actually change the training data. It doesn’t know what year it is, it only knows how to regurgitate answers it was already trained on.

tauonite@lemmy.world · 2 months ago

It also happened last year if you asked if 2026 was next year, and that was at the end of last year, not beginning

buddascrayon@lemmy.world · 2 months ago

This is actually the reason why it will never actually become general AI. Because they’re not training it with logic they’re training it with gobbledy goop from the internet.

Frosty@pawb.social · 2 months ago

deleted by creator

kkj@lemmy.dbzer0.com · 2 months ago

It can’t understand logic anyway. It can only regurgitate its training material. No amount of training will make an LLM sapient.

lauha@lemmy.world · 2 months ago

Based on what?

kkj@lemmy.dbzer0.com · 2 months ago

Based on what LLMs are. They predict token (usually word) probability. They can’t think, they can’t understand, they can’t question things. If you ask one for a seahorse emoji, it has a seizure instead of just telling you that no such emoji exists.

edible_funk@sh.itjust.works · 2 months ago

Math, physics, the fundamental programming limitations of LLMs in general. If we’re ever gonna actually develop an AGI, it’ll come about along a completely different pathway than LLMs and algorithmic generative “AI”.

salacious_coaster@feddit.online · edit-2 1 month ago

deleted by creator

just_an_average_joe@lemmy.dbzer0.com · 2 months ago

This instance actually seems more like ‘context rot’, I suspect google is just shoving everything into the context window cuz their engineering team likes to brag about 10m tokens windows, but the reality is that its preeeeettty bad when you throw too much stuff.

I would expect even very small (4b params or less) models would get this question correct

Drew@sopuli.xyz · 2 months ago

nah, training data is not why it answered this (otherwise it would have training data from many different years, way more than of 2025)

0_o7@lemmy.dbzer0.com · 2 months ago

Maybe it uses the most recent date in the dataset for its reference to datetime?

queermunist she/her@lemmy.ml · edit-2 2 months ago

There’s data weights for recency, so after a certain point “next year is 2026” will stop being weighted over “next year is 2027”

It’s early in the year, so that threshold wasn’t crossed yet.