More

AJRF · 2025-11-24T20:29:36 1764016176

that chart at the start is egregious

tildef · 2025-11-24T21:23:31 1764019411

Feels like a tongue-in-cheek jab at the GPT-5 announcement chart.

AJRF · 2025-11-24T16:47:18 1764002838

The attempts at controlling the narrative feel a lot more unsubtle since Musk took over.

I bet dollars to donuts that they are tipping the scale on stoking up tensions on UK users with things like migration and class division.

I only follow tech people on twitter, but if you looked at my FYP you'd think I was deeply interested in UK politics - which I am not!

lavezzi · 2025-11-24T23:22:17 1764026537

https://twitter.com/settings/your_twitter_data/twitter_inter...

think you'll be surprised in what has been signed for on your behalf.

Havoc · 2025-11-24T21:16:20 1764018980

Yup. Not just x though. On insta Even a slight misstep and you’re up to your eyeballs in anti migrant content from the algo

AJRF · 2025-11-18T08:30:25 1763454625

Very uncool of Eric! Thank you for the work you've put in over the years.

AJRF · 2025-11-13T20:17:10 1763065030

Do you think sampling is deterministic?

randomgermanguy · 2025-11-13T20:24:41 1763065481

Topk sampling with temp = 0 should be pretty much deterministic (ignoring floating-point errors)

AJRF · 2025-11-13T20:25:06 1763065506

> Ignoring floating point errors.

I think you mean non-associativity.

And you can’t ignore that.

sunrunner · 2025-11-13T20:28:23 1763065703

Ignoring floating point errors, assuming a perfectly spherical cow, and taking air resistance as zero.

AJRF · 2025-11-13T20:34:02 1763066042

Imagine you are predicting the next token, you have two tokens very close in probability in the distribution, kernel execution is not deterministic because of floating point non-associativity - the token that gets predicted impacts the tokens later in the prediction stream - so it's very consequential which one gets picked.

This isn't some hypothetical - it happens all the time with LLM's - it isn't some freak accident that isn't probable

randomgermanguy · 2025-11-13T20:48:26 1763066906

Okay yes, but would you really say that the main part of non-determinism in LLM-usage stems from this ? No its obviously the topk sampling.

I don't think my tech-lead was trying to suggest the floating-point error/non-associativity was the real source.

AJRF · 2025-11-13T21:04:57 1763067897

> Would you really say that the main part of non-determinism in LLM-usage stems from this

Yes I would because it causes exponential divergence (P(correct) = (1-e)^n) and doesn't have a widely adopted solution. The major labs have very expensive researchers focused on this specific problem.

There is a paper from Thinking Machines from September around Batch Invariant kernels you should read, it's a good primer on this issue of non-determinism in LLM's, you might learn something from it!

Unfortunately the method has quite a lot of overhead, but promising research all the same.

randomgermanguy · 2025-11-13T21:21:11 1763068871

Alright fair enough.

I dont think this is relevant to the main-point, but it's definitely something I wasn't aware of. I would've thought it might have an impact on like O(100)th token in some negligible way, but glad to learn.

AJRF · 2025-11-03T13:21:01 1762176061

This happened to me last night! I was going to bed and I clicked Update and Shut Down, then I went in to the other room.

After a few minutes I could see the blue glow of my Windows background shining on the wall.

Glad it is fixed!

AJRF · 2025-10-21T22:33:58 1761086038

Hey, by all means go ahead - my personal reasons for doing it this way;

- Self contained dependencies managed for me by the maintainer of the image.

- I already run docker and keep all my configuration in a git folder that is structured in a way my brain works

- I already run watchtower which updates container for me automatically

- Other containers can use this container to send mail

- sendmail.cf scares me.

- filesystem isolation if it gets pwned

AJRF · 2025-10-21T18:39:41 1761071981

Does this kind of thing happen to China + Russia?

I don't see news about that much - but to be fair, I am not looking for it.

enkonta · 2025-10-21T18:40:40 1761072040

They may also be less likely to admit it or allow any reporting on it

ThinkBeat · 2025-10-21T19:07:56 1761073676

yes. but it doesn't get covered by western media. much like how NATO airplanes violating Russian airspace is not reported about either.

mmooss · 2025-10-22T08:10:04 1761120604

> much like how NATO airplanes violating Russian airspace is not reported about either.

How do you know it's happening?

tryauuum · 2025-10-21T19:32:46 1761075166

Yes, recently some russian airline was hacked, they also used microsoft mail servers

AJRF · 2025-10-19T15:08:46 1760886526

Added a point about that, thanks.

AJRF · 2025-10-19T13:19:12 1760879952

I think iterating on hypothesis to try uncover the truth is a better use of time than saying everything is too complex and giving up.

api · 2025-10-19T13:24:00 1760880240

I agree, but that’s not what people do. People usually fixate on one preferred explanation and then give up. Usually it’s the explanation that confirms their prejudices and biases.

I don’t think doom scrolling is healthy. I just doubt that it’s a single explanation.

AJRF · 2025-10-19T13:18:13 1760879893

Thanks - I don't really take the comments as entirely negative, people want rigour and I agree the point could be made more convincingly.

I would love for a proper study of this hypothesis to be done.