Most people didn't think we were anywhere close to LLM's five years ago. The cap...

jlarocco · 2025-10-28T15:23:27 1761665007

What people thought about LLMs five years ago, and how close we are to AGI right now are unrelated, and it's not logially sound to say "We were close to LLMs then, so we are close to AGI now."

It's also a misleading view of the history. It's true "most people" weren't thinking about LLMs five years ago, but a lot of the underpinnings had been studied since the 70s and 80s. The ideas had been worked out, but the hardware wasn't able to handle the processing.

> True honesty requires acknowledging that we truly have no idea. Progress in AI is happening faster than ever before, but nobody has the slightest idea how much progress is needed to get to AGI.

Maybe, but don't tell that to OpenAI's investors.

rapind · 2025-10-28T14:33:54 1761662034

> Most people didn't think we were anywhere close to LLM's five years ago.

That's very ambiguous. "Most people" don't know most things. If we're talking about people that have been working in the industry though, my understanding is that the concept of our modern day LLMs aren't magical at all. In fact, the idea has been around for quite a while. The breakthroughs in processing power and networking (data) were the hold up. The result definitely feels magical to "most people" though for sure. Right now we're "iterating" right?

I'm not sure anyone really see's a clear path to AGI if what we're actually talking about is the singularity. There are a lot of unknown unknowns right?

dkdcio · 2025-10-28T14:45:01 1761662701

I worked in Microsoft's AI platform from 2018-2022. people were very aware of LLMs & AI in general. it's not magical

AGI is a silly concept

pixl97 · 2025-10-28T15:00:07 1761663607

AGI is a poorly defined concept because intelligence is a poorly defined concept. Everyone knows what intelligence is... until we attempt to agree on a common definition.

fadedsignal · 2025-10-28T14:37:40 1761662260

I 100% agree with this. I suggest the other guy to check history of NLP.

crazygringo · 2025-10-28T15:20:59 1761664859

Not sure what history you're suggesting I check? I've been following NLP for decades. Sure, neural nets have been around for many decades. Deep learning in this century. But the explosive success of what LLM's can do now came as a huge surprise. Transformers date to just 2017, and the idea that they would be this successful just with throwing gargantuan amounts of data and processing at them -- this was not a common viewpoint. So I stand by the main point of my original comment, except I did just now edit it to say 10 years ago rather than 5... the point is, it really did seem to come out of nowhere.

nodja · 2025-10-28T15:14:56 1761664496

GPT3 existed 5 years ago, and the trajectory was set with the transformers paper. Everything from the transformer paper to GPT3 was pretty much speculated in the paper, it just took people spending the effort and compute to make it reality. The only real surprise was how fast openai producterized an LLM into a chat interface with chatgpt, before then we had finetuned GPT3 models doing specific tasks (translation, summarization, etc.)

gravity13 · 2025-10-28T14:38:18 1761662298

At this point, AGI seems to be more of a marketing beacon than any sort of non-vague deterministic classification.

We all thought about a future where AI just woke up one day, when realistically, we got philosophical debates over whether the ability to finally order a pizza constitutes true intelligence.

noir_lord · 2025-10-28T14:49:05 1761662945

We can order the pizza, it just hallucinated and I'm not entirely sure why my pizza has seahorses instead of anchovies.

airstrike · 2025-10-28T14:31:42 1761661902

Notwithstanding the fact that AGI is a significantly higher bar than "LLM", this argument is illogical.

Nobody thought we were anywhere closer to me jumping off the Empire State Building and flying across the globe 5 years ago, but I'm sure I will. Wish me luck as I take that literal leap of faith tomorrow.

JoelMcCracken · 2025-10-28T15:12:49 1761664369

what's super weird to me is how people seem to look at LLM output and see:

"oh look it can think! but then it fails sometimes! how strange, we need to fix the bug that makes the thinking no workie"

instead of:

"oh, this is really weird. Its like a crazy advanced pattern recognition and completion engine that works better than I ever imagined such a thing could. But, it also clearly isn't _thinking_, so it seems like we are perhaps exactly as far from thinking machines as we were before LLMs"

wholinator2 · 2025-10-28T17:57:26 1761674246

Well the difference between those two statements is obvious. One looks and feels, the other processes and analyzes. Most people can process and analyze some things, they're not complete idiots most of the time. But also most people cannot think and analyze the most ground breaking technological advancement they might've personally ever witnessed, that requires college level math and computer science to understand. It's how people have been forever, electricity, the telephone, computers, even barcodes. People just don't understand new technologies. It would be much weirder if the populace suddenly knew exactly what was going on.

And to the "most groundbreaking blah blah blah", i could argue that the difference between no computer and computer requires you to actually understand the computer, which almost no one actually does. It just makes peoples work more confusing and frustrating most of the time. While the difference between computer that can't talk to you and "the voice of god answering directly all questions you can think of" is a sociological catastrophic change.

hackinthebochs · 2025-10-28T17:07:42 1761671262

Why should LLM failures trump successes when determining if it thinks/understands? Yes, they have a lot of inhuman failure modes. But so what, they aren't human. Their training regimes are very dissimilar to ours and so we should expect alien failure modes owing to this. This doesn't strike me as good reason to think they don't understand anything in the face of examples that presumably demonstrate understanding.

saalweachter · 2025-10-28T17:29:43 1761672583

Because there's no difference between a success and failure as far as an LLM is concerned. Nothing went wrong when the LLM produced a false statement. Nothing went right when the LLM produced a true statement.

It produced a statement. The lexical structure of the statement is highly congruent with its training data and the previous statements.

hackinthebochs · 2025-10-28T18:18:41 1761675521

This argument is vacuous. Truth is always external to the system. Nothing goes wrong inside the human when he makes an unintentionally false claim. He is simply reporting on what he believes to be true. There are failures leading up to the human making a false claim. But the same can be said for the LLM in terms of insufficient training data.

>The lexical structure of the statement is highly congruent with its training data and the previous statements.

This doesn't accurately capture how LLMs work. LLMs have an ability to generalize that undermines the claim of their responses being "highly congruent with training data".

og_kalu · 2025-10-28T16:53:15 1761670395

By that logic, I can conclude humans don't think, because of all the numerous times out 'thinking fails'.

I don't know what else to tell you other than this infallible logic automaton you imagine must exist before it is 'real intelligence' does not exist and has never existed except in the realm of fiction.

JoelMcCracken · 2025-10-28T20:35:01 1761683701

You’re absolutely right!

armonster · 2025-10-28T14:39:02 1761662342

I think what is much more plausible is that companies such as this one benefit greatly from being viewed as being close to, or on the way to AGI.

travelalberta · 2025-10-28T19:21:10 1761679270

> Once AGI is declared by OpenAI, that declaration will now be verified by an independent expert panel.

I always like the phrase, "follow the money", in situations like this. Are OpenAI or Microsoft close to AGI? Who knows... Is there a monetary incentive to making you believe they are close to AGI? Absolutely. Take in this was the first bullet point in Microsoft's blog post.

fadedsignal · 2025-10-28T14:39:47 1761662387

I don't think AGI will happen with LLMs. For example, can an LLM drive a car?? I know it's a silly question but it's a fact.

og_kalu · 2025-10-28T22:21:45 1761690105

Well yeah

https://wayve.ai/thinking/lingo-2-driving-with-language/

torginus · 2025-10-28T16:11:27 1761667887

It can?

If you use 'multimodal transformer' instead of LLM (which most SOTA models are), I don't think there's any reason why a transformer arch couldn't be trained to drive a car, in fact I'm sure that's what Tesla and co. are using in their cars right now.

I'm sure self-driving will become good enough to be commercially viable in the next couple years (with some limitations), that doesn't mean it's AGI.

tsimionescu · 2025-10-28T18:13:20 1761675200

There is a vast gulf between "GPT-5 can drive a car" and "a neural network using the transformer architecture can be trained to drive a car". And I see no proof whatsoever that we can, today, train a single model that can both write a play and drive a car. Even less so one that could do both at the same time, as a generally intelligent being should be able to.

If someone wants to claim that, say, GPT-5 is AGI, then it is on them to connect GPT-5 to a car control system and inputs and show that it can drive a car decently well. After all, it has consumed all of the literature on driving and physics ever produced, plus untold numbers of hours of video of people driving.

og_kalu · 2025-10-28T22:16:47 1761689807

>There is a vast gulf between "GPT-5 can drive a car" and "a neural network using the transformer architecture can be trained to drive a car".

The only difference between the two is training data the former lacks that the latter does so not a 'vast gulf'.

>And I see no proof whatsoever that we can, today, train a single model that can both write a play and drive a car.

You are not making a lot of sense here. You can have a model that does both. It's not some herculean task. it's literally just additional data in the training run. There are vision-language-action models tested on public roads.

https://wayve.ai/thinking/lingo-2-driving-with-language/

torginus · 2025-10-28T20:05:44 1761681944

> single model that can both write a play and drive a car.

It would be a really silly thing to do, and probably there are engineering subletities as to why this would be a bad idea, but I don't see why you couldn't train a single model to do both.

tsimionescu · 2025-10-28T20:41:32 1761684092

It's not silly, it is in fact a clear necessity to have both of these for something to be even close to AGI. And you additionally need it trained on many other tasks - if you believe that each task requires additional parameters and additional training data, then it becomes very clear that we are nowhere near to a general intelligence system; and it should also be pretty clear that this will not scale to 100 tasks with anything similar to the current hardware and training algorithms.

oldestofsports · 2025-10-28T18:23:00 1761675780

Okay but then can a multimodal transformer do everything an LLM can?

torginus · 2025-10-28T20:03:47 1761681827

Most SOTA LLMs are multimodal transformers.

JoelMcCracken · 2025-10-28T15:14:17 1761664457

this is something I think about. state of the art in self driving cars still makes mistakes that humans wouldn't make, despite all the investment into this specific problem.

This bodes very poorly for AGI in the near term, IMO

timdiggerm · 2025-10-28T17:55:21 1761674121

> Progress in AI is happening faster than ever before

Is it happening faster than it was six months ago? a year ago?

dreamcompiler · 2025-10-28T15:13:59 1761664439

In 1900 we didn't see a viable path to climb Mount Everest or to go to the moon. This does not make the two tasks equally difficult.

mandeepj · 2025-10-28T16:00:25 1761667225

> Most people didn't think we were anywhere close to LLM's five years ago.

Well, Google had LLMs ready by 2017, which was almost 9 years ago.

https://en.wikipedia.org/wiki/Large_language_model

voidfunc · 2025-10-28T14:41:54 1761662514

Also possible we get something "close enough" to AGI and it's really fucking useful.

AGI is the end-game. There's a lot of room between current LLMs and AGI.

oldestofsports · 2025-10-28T18:24:27 1761675867

Sure, but then OpenAI should not claim it is AGI, even if it is ”close enough”