I know this is written to be tounge-in-cheek, but its really almost the exact sa...

mbesto · 2025-11-06T00:50:47 1762390247

> LLMs hallucinate because training on source material is a lossy process and bigger,

LLMs hallucinate because they are probabilistic by nature not because the source material is lossy or too big. They are literally designed to create some level of "randomness" https://thinkingmachines.ai/blog/defeating-nondeterminism-in...

ChadNauseam · 2025-11-06T02:43:06 1762396986

So if you set temperature=0 and run the LLM serially (making it deterministic) it would stop hallucinating? I don't think so. I would guess that the nondeterminism issues mentioned in the article are not at all a primary cause of hallucinations.

joquarky · 2025-11-06T02:55:28 1762397728

I thought that temperature can never actually be zero or it creates a division problem or something similar.

I'm no ML or math expert, just repeating what I've heard.

ChadNauseam · 2025-11-06T03:35:04 1762400104

That's an implementation detail I believe. But what I meant was just greedy decoding (picking the token with the highest logit in the LLM output), which can be implemented very easily

mbesto · 2025-11-06T14:22:32 1762438952

Did you read the whole article?

"In other words, the primary reason nearly all LLM inference endpoints are nondeterministic is that the load (and thus batch-size) nondeterministically varies! This nondeterminism is not unique to GPUs — LLM inference endpoints served from CPUs or TPUs will also have this source of nondeterminism."

andy99 · 2025-11-05T22:25:41 1762381541

Classical LLM hallucination happens because AI doesn’t have a world model. It can’t compare what it’s saying to anything.

You’re right that LLMs favor helpfulness so they may just make things up when they don’t know them, but this alone doesn’t capture the crux of hallucination imo, it’s deeper than just being overconfident.

OTOH, there was an interesting article recently that I’ll try to find saying humans don’t really have a world model either. While I take the point, we can have one when we want to.

Edit: see https://www.astralcodexten.com/p/in-search-of-ai-psychosis re humans not having world models

naniwaduni · 2025-11-05T23:48:46 1762386526

You're right, "journalists don't have a world model and can't compare what they're saying to anything" explains a lot.