Rodney Brooks on GPT-4

fwlr · on May 21, 2023

> The large language models are a little surprising. I’ll give you that.

I think this is the key point about LLMs that kind of explains the wide and polarized views on whether it understands or parrots, whether it can think or is the precursor to thinking or is a dead-end, whether it will catastrophically destroy the world, or “merely” make it steadily worse with bullshit, or just put a few industries out of a job.

Almost nobody is really surprised that if you throw more compute at a neural net it becomes better at the task it’s trained on. But almost everybody is really surprised that becoming better at a task like ‘natural language prediction’ would produce all these strange abilities that sort of look like “understanding the world”.

One way to resolve this surprise is to find some reason to believe these strange abilities are fundamentally not an understanding of the world. Thus stochastic parrots, this article, Yan LeCun and Chomsky, etc.

Another way to resolve this surprise is to find some reason to believe these strange abilities fundamentally are an understanding of the world. Thus regulation of AI, existential risk, Hinton and Yudkowsky, etc.

I don’t know what the correct resolution of the surprise is. The only thing I’m confident in is that it’s correct to be surprised by the abilities of LLMs. My current (tentative) resolution of the surprise is that language encoded way more information about reality than we thought it did. (Enough information that you can fully derive reality from language seems improbable, but iirc it did derive Othello and partly derived chess and I would have thought there wasn’t enough information in language to derive those without playing the games as well, so I can’t rule it out.)

thomastjeffery · on May 21, 2023

> My current (tentative) resolution of the surprise is that language encoded way more information about reality than we thought it did.

I think you are close to the mark, but you have been subtly mislead: language is not the data we are working with. We are working with text.

Once you fix that particular failure of word choice, everything else becomes much more clear: text contains much more information than language.

We aren't dealing with just any text, either: that would be noise. We're training LLMs on written text.

Natural language is infamous for one specific feature: ambiguity. There are many possible ways to write something, but we can only write one. We must choose: in doing so, we record the choice itself, and all of the entropy that informed it.

That entropy is the secret sauce: the extra data that LLMs are sometimes able to model. We don't see it, because we read language, not text.

The big surprise is that LLMs aren't able to write language: they can only write text. They don't get tripped up reading ambiguity, but they can't avoid writing it, either. Who chooses what an LLM writes? Is it a mystery character who lives in a black box, or a continuation of the entropy that was encoded into the text that LLM was trained on?

zainhoda · on May 21, 2023

There’s an exercise that some people do when learning programming, which is to write down the steps to make a sandwich. Then the teacher follows the exact instructions to make a sandwich and most people don’t put enough detail for a computer to follow (I.e. open the fridge etc) and the teacher will run around bumping into things. That used to be a teaching exercise to show people the amount of precision required when telling a machine what to do.

Now with LLMs, I think one of the great leaps is the idea that it’s no longer necessary to be “pedantic” when giving computers instructions because LLMs have somehow learned to fill in the blanks with a similar shared “understanding” of the world that we have (I.e. cheese is stored in the fridge so you have to go open the fridge to fetch the cheese for the sandwich).

hattmall · on May 21, 2023

I don't get the "magic" people are seeing. It makes sense.

>LLMs have somehow learned to fill in the blanks

It's not somehow, it's because they have read a ton of books, documents, etc and can make enough links between cheese and refrigerator and follow that back to know that a refrigerator needs to be opened.

I have seen a lot of very clever AI examples using the latest tools, but I haven't seen anything that seems difficult to deconstruct.

thomastjeffery · on May 21, 2023

> Now with LLMs, I think one of the great leaps is the idea that it’s no longer necessary to be “pedantic” when giving computers instructions

Yes, but they also can't. They can't be pedantic or follow explicit instructions. That's the other side of the coin that isn't being presented.

They can present the right elements of the story in the the right places, but they can't perform it.

satvikpendem · on May 21, 2023

It depends on the task. For certain programs you absolutely need to be pedantic in describing what needs to happen. There is a reason we don't program in natural language and that won't change with LLMs.

prox · on May 21, 2023

Don’t forget they hold vector spaces. So fridge and cheese score high together for cohesiveness, but fridge and Antarctica less so, but both have something to do with cold. Together with all the training on texts creates a good ability to make inferences and “conclusions”. It has a net of lines of meanings of all concepts we fed it that give it the ability that it has, without actually understanding.

TeMPOraL · on May 21, 2023

> It has a net of lines of meanings of all concepts we fed it that give it the ability that it has, without actually understanding.

I'm increasingly convinced this is what understanding fundamentally is.

eternalban · on May 21, 2023

That conflates perception with perceiver. LLMs have only internalized [/encoded] our perceptions and expressions. From a model of the mind pov, the 'self' that we sense has an internal LLM-like tool. And it is that self that understands and not the tool.

TeMPOraL · on May 22, 2023

> That conflates perception with perceiver.

I'm not sure I understand. Can you elaborate?

> From a model of the mind pov, the 'self' that we sense has an internal LLM-like tool. And it is that self that understands and not the tool.

I'm starting to think it's the other way around. I think it's somewhat widely accepted that our brains do most of the "thinking" and "understanding" unconsciously - our conscious self is more of an observer / moderator, occasionally hand-holding the thought process when the topic of interest is hard, and one isn't yet proficient[0] in it.

Keeping that in mind, if you - like me - feel that LLMs are best compared to our "inner voice", i.e. the bit on the boundary between conscious and unconscious that uses language as an interface to the former, then it's not unreasonable to expect that LLMs may, in fact, understand things. Not emulate, but actually understand.

The whole deal with a hundred thousand dimensional latent space? I have a growing suspicion that this is exactly the fundamental principle behind how understanding, thinking in concepts, and thinking in general works for humans too. Sure, we have multiple senses feeding into our "thinking" bit, but that doesn't change much.

At a conceptual, handwavy level (I don't know the actual architecture and math details well enough to offer more concrete explanations/stories), I feel there are too many coincidences to ignore.

Is this coincidence that someone trained an LLM and an image network, and found their independently learned latent spaces map to each other with a simple transforms? Maybe[1], but this also makes sense - both network segmented data about the same view of reality humans have. There is no reason for LLMs to have an entirely different way of representing "understanding" than img2txt or txt2img networks.

Assuming the above is true, is this coincidence that it offers a decent explanation for how humans developed language? You start with a image/sound/touch/other senses acquisition and association system forming a basic brain. Predicting next sensations, driving actions. As it evolves in size and complexity, dimensionality of its representation space grows, and at some point, the associations cluster in something of a world model. Let evolution iterate some (couple hundred thousand years) more, and you end up with brains that can build more complex world model, working with more complex associations (e.g. vibration -> sound -> tone -> grunt -> phrase/song). At this level, language seems like an obvious thing - it's taking complex associations of basic sensory input, and associating them wholesale with different areas of the latent space, so that e.g. a specific grunt now associates with danger, a different one with safety, etc. and once you have brains being able to do that naturally, it's pretty much straight line to a proper language.

Yes, this probably comes as a lot of hand-waving; I don't have the underlying insights properly sorted yet. But a core observation I want to communicate, and recommend people to ponder on, is continuity. This process gains capabilities in a continuous fashion, as it scales - which is exactly a kind of system you'd expect evolution to lock on to.

--

[0] - What is "proficiency" anyway? To me, being proficient in a field of interest is mostly about... shifting understanding of that field to unconscious level as much as possible.

[1] - This was one paper I am aware of; they probably didn't do good enough control, so it might turn out to be happenstance.

eternalban · on May 22, 2023

[I may have to take you up on your profile offer of out of band continuation of this as there is a lot here to delve into and it would make for interesting conversation.]

The model of the psyche that I subscribe to is ~Jungian, with some minor modifications. I distinguish between the un-conscious, the sub-conscious, and consciousness. The content of the unconscious is atemporal, where as the content of the (sub-)conscious is temporal. In this model, background processing occurs in the sub-conscious, -not- the un-conscious. The unconscious is a space of ~types which become reified in the temporal regime of (sub-)consciousness [via the process of projection]. The absolute center of the psyche is the Self and this resides in the unconscious; the self and the unconscious content are not directly accessible to us (but can be approached via contemplation, meditation, prayer, dreams, and visions: these processes introduce unconscious content into the conscious realm, which when successfully integrated engenders 'psychological wholeness'). The ego -- the ("suffering") observer -- is the central point of consciousness. Self realization occurs when ego assumes a subordinate position to the Self, abandons "attachment" to perceived phenomena & disavows "lordship" i.e. the false assumption of its central position, at which point the suffering ends. This process, in various guises, is the core of most spiritual schools. And we can not discount these aspects of Human mental experience, even if we choose to assume a critical distance from the theologies that are built around these widely reported phenomena. I am not claiming that this is a quality of all minds, but it seems it is characteristic of human minds.

The absolute minimum point that you should take away from this (even if the above model is unappealing or unacceptable or woo to you /g) is that we can always meaningfully speak of a psychology when considering minds. If we can not discern a psychology in the subject of our inquiry then it should not be considered a mind.

I do -not- think that we can attribute a pyschology to large language models.

~

Your comment on the mapping of the latent spaces is interesting, but as you note we should probably wait until this has been established before jumping into conclusions.

And also please excuse the handwavy matter in my comment as well. We're all groping in the semidarkness here.

prox · on May 21, 2023

Yeah, I guess you could see it that way. Object <> Symbolism (aka words, thoughts, concepts, art) <> Meaning. Meaning is knowing how an object relates to others. Language is a kind of information web already, where each word is a hyperlink into meaning.

TeMPOraL · on May 22, 2023

This is a new idea that I had (or at least consciously noticed) for the first time a few days ago, but - I really don't think the meaning is in words. The words/terms themselves are more like information-free[0] points. The meaning is entirely determined by links. This works, because the links eventually lead you to ground truth - sensory inputs.

crooked-v · on May 21, 2023

Even then, you can see some of the "pedantic" cases when it comes to actually understanding the nature of the the connections between those concepts. For example, it's very easy to get it to reverse shorter/taller or younger/older of clearly defined relationships.

tornato7 · on May 21, 2023

One experiment that I would love to see is an LLM-like model for audio. Feed it hours and hours of lectures, sound effects, animal calls, music etc. You would be able to talk to it and it would ingest the raw waveform then produce audio as a response. Would it learn the fundamentals of music theory? Would it learn to produce "the sound of a bowling ball hitting a dozen windchimes?" Would it learn to talk in English and communicate with whales? We've already done text and images, now someone please do sound!

cornel_io · on May 21, 2023

Uhhh...this is out there, from like a dozen different groups. Not going to do a full Googling for you on my phone because it's literally everywhere but "LLM for audio" gives https://ai.googleblog.com/2022/10/audiolm-language-modeling-... as the first result...some of this stuff is already really impressive.

thomastjeffery · on May 21, 2023

> Would it learn the fundamentals of music theory?

No, but you might convince yourself it did.

It would map the patterns that exist in its training set. It would then follow those patterns. The result would look like a human understanding music theory, but it would not be that.

It would be stumbling around exactly the domain we gave it: impressive because that domain is not noise, it's good data. It still wouldn't be able to find its way around, only stumble.

satvikpendem · on May 21, 2023

> The result would look like a human understanding music theory, but it would not be that.

The question then becomes, what is understand? Is what a human does any different than what this LLM is doing?

thomastjeffery · on May 21, 2023

Objectivity.

A human can do something with the model. An LLM can only present the model to you.

satvikpendem · on May 21, 2023

Not sure how that's any different than a model doing something with another model, as in AutoGPT. What part is objective? A model can be wrong just like a human can be wrong or spread falsehoods too.

thomastjeffery · on May 22, 2023

A model can't be right or wrong, because it doesn't actually make any logical decisions.

These are categorizations that we make after the fact. If the model could do the same categorization work, then it could actively choose correct over incorrect.

satvikpendem · on May 22, 2023

Models could potentially make logical decisions too, if we connect them to something like a classical computer or a rules engine. I don't see any fundamental barriers to making models and computers in general similar to humans' way of understanding and reasoning too.

fwlr · on May 21, 2023

The idea that text contains more information than language is fascinating, wow. Thank you!

blowski · on May 21, 2023

I don't really understand your distinction between language and text, but it sounds intrguing. Would you be able to give more detail? I searched but couldn't find anything that seemed to explain it.

gmt2027 · on May 21, 2023

Text is an instance of language. Think of it as the difference between the python language and a large collection of python programs. The language describes syntactic and semantic rules, the collection is a sampling of possible programs that encodes a significant amount of information about the world. You could learn a lot about the laws of nature, the internet, even human society and laws by examining all the python programs ever written.

An extreme version of the same idea is the difference between understanding DNA vs the genome of every individual organism that has lived on earth. The species record encodes a ton of information about the laws of nature, the composition and history of our planet. You could deduce physical laws and constants from looking at this information, wars and natural disasters, economic performance, historical natural boundaries, the industrial revolution and a lot more.

Dwolb · on May 21, 2023

This is a fascinating point.

Let me see if I can play this back.

If a student studies DNA sequencing, they’ll learn about the compounds that make up DNA, how traits get encoded, etc.

Therefore the student might expect an AI trained on people’s DNA to be able to tell you about whether certain traits are more prevalent in one geography or the other.

However, since DNA responds to changes in environment, the AI would start to see time, population, and geography-based patterns emerge.

The AI for example could infer that a given person in the US who’s settled in NYC had ancestors from a given region of the world who left due to an environmental disaster just by looking at a given DNA sequence.

To the student this result would look like magic. But in the end, it’s a result of individual’s DNA having much more information encoded in it than just human traits.

leschak · on May 21, 2023

text and language intersect. in some ways, text is a superset of language, mostly due to social, or what is also called pragmatic, factors that complement semantics. also, the semantics/syntax interface is everything else than clear cut, at least in natural human languages.

saltcured · on May 21, 2023

That relationship seems backwards to me...

Any text corpus is a subset of the language, under the normal definition that a language is the set of all possible sentences (or a set of rules to recognize or generate that set of possibilities). This text subset has an intrinsic bias as to which sentences were selected to represent real language use, which would be significant as a training set for an ML model.

So, perhaps you are saying that the text corpus carries more "world" information than the language, because of the implications you can draw from this selection process? The full language tells us how to encode meaning into sentences, but not what sentences are important to a population who uses language to describe their world. So, if we took a fuzz-tester and randomly generated possible texts to train a large language model, we would no longer expect it to predict use by an actual population. It would probably be more like a Markov chain model, generating bizarre gibberish that merely has valid syntax.

And, this is also seems to apply if you train the model on a selection from one population but then try to use the mode to predict a different population. Wouldn't it be progressively less able to predict usage as the populations have less overlap in their own biased use of language?

leschak · on May 21, 2023

regarding the relationship: yes, and in most ways it probably is a subset. is there really such a set of rules that generate all possible sentences? in any case i wanted to say the materiality and cultural activity heavily influences what can and will be put into text and that is not strictly language. "selection process" might capture some, though i'm not sure whether all of it!

catgoose · on May 21, 2023

I think about this as shape and color. No one ever saw a shape that wasn’t colored and likewise there are no colored things that do not have a shape. Also, displaying text without a font is not possible. Text is the surface of the ocean where waves emerge, and while they have their own properties and may seem to naively have agency, they are an expression of the underlying ocean.

leschak · on May 21, 2023

nicely put! many aspects of text at least historically have much to do with its materiality (also in a cognitive development sense, learning how to write etc.). what we can think about nowadays is that text and speech might not be a necessary materiality of language. language might depend more on conceptual systems. more like a substrate of intelligence and that might as well be nonhuman (to stay on topic).

bruce511 · on May 21, 2023

Not the poster, but for me it comes down to a mix of clarity and permanence.

I teach both verbally (interactive question/answer) and I've also written text books.

Verbally by language is "loose". I'll say class when I mean object, unicode when I mean utf-8 and so on. Sentences are not all well formed, and sometimes change mid-thought. It's very "real time"

Writing is a lot more deliberate. I have to be sure of each fact I state. I often re-test things I'm only 95% sure about. I edit, restructure, remove, add, until I'm happy.

Of course all communication falls on a spectrum. Think phone call at one end, text book on the other. When I do a verbal lecture I'm usually careful with my speech, and when I post on hacker-news less rigorous.

Language covers all of it. Text skews to the more deliberate side. Cunningly the language models are trained using (mostly) text, not speech. That will have an impact on them.

leschak · on May 21, 2023

from a linguistic standpoint a text is a whole lot more than language: it is an externalisation of thought that is fixed onto a medium using writing utensils and most of all, cultural norms in the form of a wild variety of different genres and forms of text, ranging from something like a stream of consciousness to something like a speech act. furthermore, text can be conceptually written or spoken and with the internet we got an explosion of text that is conceptually spoken. those are the things OP might be refering to in regards to the "entropy" that encodes much more than just the tokens themselves.

j16sdiz · on May 21, 2023

Spoken language have pitch, stress, mood, etc.. Written text contain some, but not all.

Text, on the other hand, can present in list or table, with varies formatting, indentation. You can't reproduce them in spoken text.

kelseyfrog · on May 21, 2023

This has Embassytown vibes and I don't like it.

ramraj07 · on May 21, 2023

The issue with all these experts is they still think it's human nature to be able to fully understand the world before they speak about it. On the contrary it's human nature (and all animal nature) to figure out how to navigate the world without fully understanding or having a complete model of it. All you need is a working model that affects the facets of the world you need to deal with.

I still remember in the 90s my school friend came over to my house and I was sending a fax for my dad. He was surprised the paper came back out the other side. He wasn't an idiot, and he was 15. But his model of the world didn't include deep thought about how fax works, he just merely concoted a system where the paper just went through the wire. That moment stays with me and reminds me what it is to be human and think like one. I think chatgpt is like my friend, and that should scare and excite us.

krainboltgreene · on May 21, 2023

> The issue with all these experts is they still think it's human nature to be able to fully understand the world before they speak about it

None of the experts think this.

ramraj07 · on May 21, 2023

Is Chomsky an expert? Try reading his infuriatingly tone-deaf NYTimes editorial.

leobg · on May 21, 2023

He’s doing the same to ChatGPT that he did to Skinner in the 1960s. Back then, it put him on the map, and he lives in the illusion that he was right. Now ChatGPT is pretty much Skinner’s work come to life. The BFS book Chomsky critiqued was literally “Verbal Behavior” - a book about how intelligence arises from “dumb” reinforcement learning of words. Obviously Chomsky must now claim that ChatGPT only pretends to be intelligent. Else his entire life’s work is proven wrong.

foldr · on May 21, 2023

It’s a significant misunderstanding of Chomsky’s life’s work to think that ChatGPT would prove it wrong. Chomsky’s primary claim is a claim about how language acquisition works in humans. He argues, for example, that certain locality constraints on linguistic dependencies are ‘built in’ and not learned inductively. Thus a human does not ‘learn’ that (i) is ambiguous and (ii) is not:

(i) How often did you tell John that he should take out the trash? [how often did you tell, or how often to take it out]

(ii) How often did you tell John why he should take out the trash? [only means how often did you tell]

Nothing that ChatGPT can do suggests that Chomsky was wrong about this kind of thing. It’s really more of a blow to a certain kind of work in AI that was partly inspired by Chomsky – but not something that he himself ever took much interest in.

Now it’s true that Chomsky appears to be in the camp that says ChatGPT doesn’t really understand anything. But the focus of his own work has never been on debunking AI, or making claims about the true nature of understanding, or anything of that ilk.

Turing_Machine · on May 24, 2023

> He argues, for example, that certain locality constraints on linguistic dependencies are ‘built in’ and not learned inductively.

Checking in late here, but one of the pillars of Chomsky's argument is the so-called "poverty of the stimulus" -- basically, that human babies simply don't receive enough training data to acquire language as rapidly and correctly as they demonstrably do. Chomsky therefore concludes that there must be some kind of pre-existing "language module" in the brain to account for this. Now, not everyone accepted this idea even at the time, but surely the argument is much less plausible for an LLM which is likely exposed to more training data than even an adult human.

foldr · on May 25, 2023

>Now, not everyone accepted this idea even at the time, but surely the argument is much less plausible for an LLM which is likely exposed to more training data than even an adult human.

Yes indeed. Of course this doesn't show that Chomsky was wrong about humans. In any case, I've seen no evidence that current LLMs successfully learn the kinds of constraints I was talking about.

Al0neStar · on May 21, 2023

There is a difference between text and language and so far LLMs have told us nothing about language. LLMs being able to generalize to languages with a much smaller training corpus shows that maybe Chomsky is right about universal grammar.

biorach · on May 21, 2023

> Now ChatGPT is pretty much Skinner’s work come to life.

I'm pretty sure that is based on a misunderstanding of Skinner, ChatGPT, or both

leobg · on May 21, 2023

Or very lossy compression of what I mean. I have studied both quite extensively. Not 10,000 hours each, but hundreds for sure.

goatlover · on May 21, 2023

Did Skinner have anything to say about how the reinforcement works? Because with LLMs you do need the right sort of architecture, and the same with neurons, even though they don't use back propagation. Only humans are known to have language in the full sense, and there has to be some neural reason why that is. Maybe you could make an argument for cetaceans or certain birds, but again they must have the neural architecture for it.

foldr · on May 21, 2023

Skinner (and behaviorists in general) did establish various 'laws' of behavioral reinforcement that do tend to hold in simple cases such as pigeons pecking at levers in return for food, etc. etc. Of course these laws had nothing interesting to say about language acquisition. I challenge anyone who thinks otherwise to actually try reading Verbal Behavior. It's an incredibly turgid and uninsightful book.

krainboltgreene · on May 21, 2023

The difference between hundreds and 10k hours is roughly 10k hours.

foldr · on May 21, 2023

I see that Watumull is one of the coauthors. I'm not sure what's going on with that, but Watumull is the common thread running through other bad papers with otherwise-sensible linguists' names tacked on to them, such as this bizarre paper about recursion: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3884515/ I haven't had a chance to read the NYTimes editorial, but I would be skeptical how much of it is really coming from Chomsky. He's 94 at this point, and while he's not senile in a medical sense, I don't think his judgement is what it used to be.

krainboltgreene · on May 21, 2023

Chomsky is an expert of a few things, but I doubt that's what you actually mean to ask.

Regardless his editorial matches how scientists think of the human mind and how OpenAI's own creators describe GPT's design.

ramraj07 · on May 21, 2023

Citation please, for where you're saying the two parties agree on?

krainboltgreene · on May 21, 2023

I've not said anything contentious or secret you can literally read the article in question above and also read the OpenAI website: https://openai.com/research/instruction-following:

> This is in part because GPT-3 is trained to predict the next word on a large dataset of Internet text, rather than to safely perform the language task that the user wants.

DiogenesKynikos · on May 21, 2023

Chomsky writes that language models lack the ability to reason.

> Their deepest flaw is the absence of the most critical capacity of any intelligence: to say not only what is the case, what was the case and what will be the case — that’s description and prediction — but also what is not the case and what could and could not be the case. Those are the ingredients of explanation, the mark of true intelligence.

> [...] Suppose you are holding an apple in your hand. Now you let the apple go. You observe the result and say, “The apple falls.” That is a description. A prediction might have been the statement “The apple will fall if I open my hand.” Both are valuable, and both can be correct. But an explanation is something more: It includes not only descriptions and predictions but also counterfactual conjectures like “Any such object would fall,” plus the additional clause “because of the force of gravity” or “because of the curvature of space-time” or whatever. That is a causal explanation: “The apple would not have fallen but for the force of gravity.” That is thinking.

I decided to ask ChatGPT why an apple falls, based on Chomsky's statement:

> Suppose you are holding an apple in your hand. Now you let the apple go. You observe the result and say, “The apple falls.” That is a description. Can you say why it falls?

ChatGPT responds in exactly the way Chomsky says it cannot:

> Yes, the apple falls due to the force of gravity. Gravity is a natural force that attracts objects with mass towards each other. When the apple is released from your hand, it is subject to the gravitational pull of the Earth, causing it to accelerate downward and fall to the ground.

ChatGPT certainly appears to understand that apples fall because of gravitational attraction, and that gravity is universal.

What makes all the discussion of whether ChatGPT does or does not truly understand this or that so frustrating is that it's based on pure assertion. ChatGPT responds exactly like someone who understands gravity would, so I'm very strongly inclined to believe that it understands gravity. Otherwise, what does "understanding" even mean? It's not some magic process.

Again, turning to ChatGPT to define "understanding," here is what it says:

> [Understanding] involves making connections, integrating information, and gaining insights or knowledge about a particular subject or concept. Understanding goes beyond simple awareness or recognition; it involves interpreting, analyzing, and synthesizing information to form a coherent mental representation or mental model of the subject matter. It often involves the ability to apply knowledge in new or different contexts, make connections to prior knowledge or experiences, and make sense of complex or abstract ideas.

ChatGPT definitely fulfills that definition of "understanding."

layer8 · on May 21, 2023

I’ve made many attempts to use ChatGPT to develop or double-check my own logical reasoning on technical topics that happen to not be widely discussed (or maybe not discussed at all) in ChatGPT’s training data. It didn’t work well. It always devolved into guesswork and fabrication by ChatGPT, if not outright false reasoning, and while correcting ChatGPT succeeded in it agreeing about individual objections, it never showed a true and consistent understanding of the topic under discussion, and also seemingly no understanding of why I was having issues with its responses, beyond the usual “I apologize, you are correct, <rephrasing of your objection>”.

One problem likely is that it doesn’t have an internal dialogue, so you have to spoon-feed each step of reasoning as part of the explicit dialogue. But even then, it never feels like ChatGPT is having an overall understanding of the discussion. To repeat, this is when the conversation is about lines of reasoning about specific points that you don’t find good results for when googling for them.

TeMPOraL · on May 21, 2023

> One problem likely is that it doesn’t have an internal dialogue, so you have to spoon-feed each step of reasoning as part of the explicit dialogue.

I think if we were to put ChatGPT on the map of the human mind, it would correspond specifically to the inner voice. It doesn't have internal dialogue, because it's the part that creates internal dialogue.

mrtranscendence · on May 21, 2023

ChatGPT does not fulfill that definition because it does not have any “mental representation”; it has no mind with which to form a “mental model”. It emulates understanding — quite well in many scenarios — but there is nothing there to possess understanding; it is at bottom simply a very large collection of numbers that are combined arithmetically according to a simple algorithm.

DiogenesKynikos · on May 21, 2023

It must have some representation of the real world, or else it wouldn't be able to generate responses that explain the real world.

At a certain point, there's no difference between emulating understanding and having understanding.

> it is at bottom simply a very large collection of numbers that are combined arithmetically according to a simple algorithm.

If you dissect a human brain, you'll find neurons, synapses, etc. Your brain is also "simply" a machine.

biorach · on May 21, 2023

But now you have to explain why the same is not true of a human. Just saying a human has a 'mental representation' and a 'mind' is not explaining anything

goatlover · on May 21, 2023

Because as humans, we know we have something we call minds and mental representations, since we experience having such things as we go about our lives. How the nervous system produces those and how exactly we should understand mental is unclear. But since LLMs aren't brains and don't work the same way, we can't say they have anything like minds right now. The solution isn't to get rid of the mental in humans, it's to better understand the differences and similarities between machine learning models and biological nervous systems.

awongh · on May 21, 2023

here it is for anyone curious:

https://archive.is/oGXNt

(edit, archive link)

avereveard · on May 21, 2023

Well not in the field he's a professional clickbaiter targeting a specific, well defined eco chamber.

cubancigar11 · on May 21, 2023

That is a very common perception from people whom haven't given him any attention. I used to be in the same boat but it was eventually very interesting when I listened to him.

foldr · on May 21, 2023

Accusing Chomsky of being a ‘clickbaiter’ is maybe the most absurd thing I’ve heard all month. You think he’s trying to get additional views for his TikTok videos?

His recent political ramblings and Epstein-adjacency are extremely embarrassing (at best), but he's not some kind of cheap online attention whore.

Turing_Machine · on May 24, 2023

> Accusing Chomsky of being a ‘clickbaiter’ is maybe the most absurd thing I’ve heard all month. You think he’s trying to get additional views for his TikTok videos?

Chomsky has been addicted to media attention for decades. Back in the day there were literally people selling cassette tapes of his latest thoughts.

beebeepka · on May 21, 2023

Stating obvious - but inconvenient - truths is political rambling to you? Guy is a treasure.

foldr · on May 21, 2023

C'mon, let's not get into it here. As I'm defending Chomsky, I just wanted to be clear that I don't agree with his recent comments on the Russian invasion of Ukraine, and that I find his association with post-conviction Esptein extremely distasteful at best. Others may disagree, but this isn't the place to have that argument.

beebeepka · on May 21, 2023

Hey, you started it. I wouldn't have said anything if you hadn't done the exact same thing you accused the other guy of doing.

Yes, I very much disagree with you, hence my reaction.

sgt101 · on May 21, 2023

Especially Rodney Brooks.

bheadmaster · on May 21, 2023

Actually, at least one expert thinks that.

usui · on May 21, 2023

One of the biggest strawmen I've ever seen.

jsight · on May 21, 2023

I remember a friend being confused by downloading. He didn't realize that meant we'd have a copy and not that the source would lose it in the process.

It seemed bizarre at the time, but, tbh, I didn't have _that_ much better of a model of how the whole process worked.

Taywee · on May 21, 2023

Bit of a tangent, I asked a friend if I could rip and burn one of his CDs about 15 years ago, and he looked at me in horror and told me that I absolutely could not, because he liked that disc.

It took me a bit to realize what he understood my words to mean.

groestl · on May 21, 2023

> we'd have a copy and not that the source would lose it in the process

I don't known when this happened, but maybe that means the Anti-Piracy campaigns worked on him, confusing illegal replication with theft.

jsight · on May 22, 2023

That's a good guess, but tbh this was in the days of modems, bbses, and shareware. Piracy and anti-piracy campaigns weren't really common outside of the software world back then.

drpixie · on May 21, 2023

> how to navigate the world without fully understanding or having a complete model of it

GPT-4 does not have ANY understanding or model of the world - it just has a model of what tokens (words) are likely to appear in a certain context. If it could build any usable model of the world, and reason about it, I'd be much more impressed.

When it quacks like a duck, only the most simplistic view takes it as being a duck.

TFYS · on May 21, 2023

Isn't our understanding or model of the world just what images, sounds, smells, tastes and physical feelings are likely to appear in a certain context? Obviously a language model doesn't yet have access to all the data humans have, but that doesn't mean there can't be some kind of a model of the world. If you invent a new word and give a couple example sentences, GPT-4 can figure out the meaning without ever seeing the word. Doesn't that require some kind of understanding?

laratied · on May 21, 2023

I am at the point of "Burning Man sucks, don't come."

chatGPT? Sucks and a waste of time. Don't use it.

zht · on May 21, 2023

Now imagine if grown ass adults listened to your friend assuming he understood the world fully

famouswaffles · on May 21, 2023

Nobody understands the world fully so that's cool. We don't even get sense data that isn't distorted by the brain.

consilient · on May 21, 2023

> My current (tentative) resolution of the surprise is that language encoded way more information about reality than we thought it did. (Enough information that you can fully derive reality from language seems improbable, but iirc it did derive Othello and partly derived chess and I would have thought there wasn’t enough information in language to derive those without playing the games as well, so I can’t rule it out.)

The information isn't in language itself, it's in language as actually used by humans. GPT4 knows about chess because it's "read" a significant fraction of everything we've ever written about chess. A human being who did that without ever playing a game would also start out better than a typical novice.

fwlr · on May 21, 2023

> A human being who did that without ever playing a game would also start out better than a typical novice.

I am quite skeptical of these arguments along the lines of “imagine a human read everything written on the topic…”.

What humans are doing when they read something is not what neural nets are doing when they read something. Humans are (idealistically) doing something like Feynman’s description of how he reads (or in this case, listens to) a theorem:

“I had a scheme, which I still use today when somebody is explaining something that I’m trying to understand: I keep making up examples. For instance, the mathematicians would come in with a terrific theorem, and they’re all excited. As they’re telling me the conditions of the theorem, I construct something which fits all the conditions. You know, you have a set (one ball) – disjoint (two balls). Then the balls turn colors, grow hairs, or whatever, in my head as they put more conditions on. Finally they state the theorem, which is some dumb thing about the ball which isn’t true for my hairy green ball thing, so I say, ‘False!’"”

Bret Victor’s description of what “really good programmers” are doing is also related:

“[showing the code for binary search] In order to write code like this, you have to imagine an array in your head, and you essentially have to ‘play computer’. You have to simulate in your head what each line of code would do on a computer. And to a large extent those who we consider to be skilled software engineers are just those people who are really good at playing computer.”

I think when we imagine an LLM as a human who’s read everything ever written in chess but never played an actual game, we’re actually tricking ourselves - because that hypothetical human would be ‘playing chess’ inside their head by imagining the pieces and moving them according to the rules they had read[1]. LLMs are not doing anything like that when they read about chess. So it’s a very restricted (or perhaps more accurately, a very different) kind of ‘reading’ that we don’t have any intuition for. Since the ‘reading’ that we do have an intuition for is smuggling in exactly the kind of “modeling the world” ability we’re looking for, it’s not surprising that this argument would incorrectly lead us to believe we’ve found it in LLMs.

1: In fact the very best computer chess is achieved by AlphaZero which was trained exclusively on “playing chess in its head”, and it beats even the most powerful and optimized search algorithms like Stockfish looking 20 moves ahead.

anybodyz · on May 21, 2023

What LLM's are doing when they imagine playing chess is what we do when we stand up after sitting on the floor, or what we do when we see a few million individual samples of color and light intensity and realize there's an apple and a knife in front of us.

I think what is almost impossible for most people to understand is that AI's do not need to be structured like the human brain and use the crutches we use to solve problems the way we do because evolution did not provide us with a way of instantly understanding complex physics or instantly absorbing the structure of a computer program by seeing it's code in one shot.

groestl · on May 21, 2023

Also, there is no reason to believe that playing chess in our head is anything else but us pattern matching a mental process on a higher level, recognizing a simulation there, and feeding that info back into the loop below. Nature provided us with a complex, layered and circular architecture of the brain, but the rest is pretty much training that structure. And we know that different architectures with similar outcome are possible, since there are vast variations across our own species, and other species as well, with essentially the same capabilities.

wombatpm · on May 21, 2023

So if we gathered all books, writings, games, etc on ever published on chess could we develop a grandmaster player without having to build a deep blue type system?

anon84873628 · on May 21, 2023

In that scenario, I believe a current LLM would "understand" chess. It would know how to play the game, it would know what being a grandmaster means, and it would know the heuristics and algorithms for evaluating a given game state. But it would not be able to implement an optimized solver to actually play the game efficiently. To do that would require it to also have all the knowledge about computer science.

Hence the dream of singularity. Can you teach ChatGPT to build AlphaZero?

krainboltgreene · on May 21, 2023

> it's "read" a significant fraction of everything we've ever written about chess

I doubt heavily that a significant fraction of chess's writings are even available in digital format, much less inside of CommonCrawl and correctly trained on.

beardedwizard · on May 21, 2023

Why would common crawl NOT contain a chess manual? the rules are explained in detail on wikipedia. the simplest conclusion is that it has indeed been trained on a chess manual and is good at predicting what the next word in a chess manual is. it is not synthesizing anything.

galaxyLogic · on May 21, 2023

I haven't seen this mentioned but can LLMs actually play chess?

I'm sure they have read rules of chess online, but if you ask them to play chess with you, what happens? Can they apply the rules? Can they apply them intelligently and win the game?

My point is that even though LLMs "know" what the rules of chess are, they don't really "understand" them, unless they can use them to play the game and play it well.

consilient · on May 21, 2023

GPT4 can play chess, but not especially well. For example: https://lichess.org/@/oopsallbots-gpt-4

anon84873628 · on May 21, 2023

You win chess by "applying the rules intelligently" the same way you launch rockets to space by applying the laws of physics intelligently. True, yes, but not as simple as it sounds.

mejutoco · on May 21, 2023

"A chess manual" vs the finer point of OP "a significant fraction of chess's writings"

beardedwizard · on May 21, 2023

Ok, name 2 famous "chess writings" and let's see if they are in common crawl...plenty has been written about chess on the internet. Why are we acting like this is magic?

beardedwizard · on May 21, 2023

Human: Finish this quote: "Chess holds its master in its own bonds, shaking the mind and brain

Gpt4: The quote you're referring to is from the Indian mathematician and writer, Raja Rao. The full quote is as follows: "Chess holds its master in its own bonds, shaking the mind and brain so that the inner freedom and independence of even the strongest character cannot remain unaffected."

I guess it's seen some "writings".

mejutoco · on May 21, 2023

Please note I am not the same poster. Lots of content is in books offline and not available online.

Again, plenty having been written does not mean some specific book is guaranteed to be there.

I do not know about chess, but I can suggest Princeton companion to mathematics to find in commoncrawl.

krainboltgreene · on May 21, 2023

I never said that CommonCrawl doesn't have a chess manual, whatever the hell that might be.

> the simplest conclusion is that it has indeed been trained on a chess manual

That is an absurd conclusion that is based on literally fantasy.

JoshTko · on May 21, 2023

The more I think about it the more I'm convinced I am basically just predicting/saying my next word whenever I speak.

zmmmmm · on May 21, 2023

Honestly I feel like one reason people are struggling with this is they can't accept a critical part of the truth : most people are stochastic parrots themselves most of the time. True, creative genuine deep thinking is an exceptional state of thinking for us.

DJHenk · on May 21, 2023

Years ago I transitioned from a developer role to a manager role and suddenly I had to do a lot more talking. Not all the talking needs to be a deeply involved exchange of complex ideas, a lot of it serves a different purpose. Sometimes it can be a simple as filling up the time in a pleasant way with a group of people that may or may not know each other that well.

After getting some experience with this I noticed that I had developed a talking on/off button in my head. I could just simply turn it on and start talking. I could generate words that sounded good together and fit the purpose of the moment. But They just seemed to come from a different place in my brain than my conscious mind. Because that was not involved in this process at all. The only job my mind had was to turn the button off again at the right moment, for the rest it was free to think whatever it wanted.

(I transferred back to development a couple of years later.)

namaria · on May 21, 2023

The very fact that we can be conscious about our ability to generate language and work it separately from the rest of our consciousness tells me that chatbots might emulate our language generation ability. We shouldn't invert it to conclude that means they can reason about it.

TeMPOraL · on May 21, 2023

> We shouldn't invert it to conclude that means they can reason about it.

Agreed. However, I think it's somewhat accepted view that the bulk of work constituting reasoning happens subconsciously, with the conscious mind playing the role of a censor/gatekeeper, and occasionally handholding during when reasoning through tougher problems.

zo1 · on May 21, 2023

This is why I am starting to think that it is very important for us to be able to "introspect" and reason about the thoughts/feelings we're having. It allows us to have an internal feedback loop that I think a lot of people lack.

cageface · on May 21, 2023

Maybe in casual conversation but that's not how I experience my though process about anything non trivial at all. I usually spend a lot of time thinking about the concept in non verbal terms and that process involves recalling images and sensory information in fairly abstract terms and then through what feels like several iterations it starts to coalesce into something I can encode in language.

I think we can all agree that these LLMs are surprisingly good at generating text that is often coherent but I don't see how you can discard all those extra inputs and claim you have the same process.

panarky · on May 21, 2023

> thinking about the concept in non verbal ... abstract terms

The abstract terms we think about are concepts, and we think about multiple concepts, at various levels of abstraction, and their relationships to each other, before getting a sense of what we want to say or write.

Only then do we begin speaking or writing, grouping concepts into paragraphs, breaking them down into sentences and words.

And there's evidence that LLMs do something similar, creating embeddings for both big ideas and small details, modeling how the small details combine into larger concepts, discovering the relationships between concepts, and only then generating a probabilistic sequence of tokens to express those deeper concepts.

cageface · on May 21, 2023

Right but they're limited to concepts that are derived from linguistic tokens. Human reasoning can include nonverbal inputs.

For example asking GPT what would be a reasonable way to stack a list of arbitrary objects on top of each other.

namaria · on May 21, 2023

Lately I've been think that staple of science fiction, the android robot, is the only way to get AGI. I thought a lot about computer vision when that was the bleeding edge of AI a few years back. It strikes me as obviously wrong to try training models on two dimensional arrays of pixels and expect them to develop visual skills. Why don't we give them two cameras and arms so they can learn about physics and start differentiating 3 dimensional input? I think when we have robots capable of interacting with the world as humans do we might be able to train them to develop near human levels of intelligence.

mewse · on May 21, 2023

The question is whether you knew how that sentence was going to end when you started writing it, or indeed whether I knew that I was going to add this comma-separated adjunct when I started writing the preceding clause, and I cannot honestly say at this precise moment of typing whether the final word in this sentence is going to end up being 'yes' or 'no'.

diffxx · on May 21, 2023

For me it's a little more subtle. When I sit down to write, say, an email, I usually have a specific point that I want to make but I usually haven't yet figured out exactly how I want to get there. I'm not even sure it is even possible to conceive of writing without doing it word by word. Perhaps the difference though is that we have the ability to go forward and backwards in the sense that we can usually think faster than we can type or write which allows our thoughts to advance beyond the tokens we have produced. This allows for a kind of real time editing process that incorporates both past and possible future information. But I imagine there us probably a way to enhance LLMs to branch and edit before producing the next word materially (if they aren't already doing that).

To go even more meta, there is an analogy I'm trying to make right now in which I am visualizing a road and thinking about how describing the road relates to the process of writing. In my mind's eye, I can see the full length of the road and all of its contours but I can't actually describe the individual stretches of the road coherently without enumerating them. Something similar happens with writing. I can visualize what I want to say far beyond the next word, but it's true that the actual process of writing goes word to word, much like how the process of token selection is described for an LLM. The question is whether the LLM has an analogous conception of where it is going. Going back to the process above, sometimes I know where I am going and haven't yet figured out how to articulate it yet. It is through the process of writing that I am able to articulate that thought. But the thought preceded my articulation of it. I don't know to what extent LLMs have coherent thoughts that they are articulating or if that even makes sense for the type of intelligence they project. My suspicion is that they don't have additional sensory inputs beyond language that give thoughts the immaterial shape that then is expressed in language. Without that, I am skeptical that they will truly get beyond regurgitating and/or remixing what has already been fed to them textually. That doesn't diminish how amazing they are, but I am somewhat more in the Brooks/Knuth camp that they are impressive and surprising, but there is something that ultimately leaves me a bit cold about them.

iainmerrick · on May 21, 2023

I usually have a specific point that I want to make but I usually haven't yet figured out exactly how I want to get there. I'm not even sure it is even possible to conceive of writing without doing it word by word. [...] In my mind's eye, I can see the full length of the road and all of its contours but I can't actually describe the individual stretches of the road coherently without enumerating them.

Not to trivialise the interesting point you’re making, but do you never write with an outline? Write bullets for the big points you want to touch, then go back and flesh out the details?

Mertax · on May 21, 2023

I do wonder how well (quickly) the whole thought is formed in your head and it’s just the encoding into language that tricks you into thinking you didn’t know what the word would be.

Having learned another language, the moment you start to feel “fluent” is when you start speaking first in the 2nd language and aren’t using your first language as an intermediate step to translate to your 2nd language.

quad_eye_oh · on May 21, 2023

This is a thought that LLMs caused me to have: the emotion of surprise is just the brain model of the world discovering that it had been poorly calibrated, or making poor predictions.

Hendrikto · on May 21, 2023

Sounds like the free energy principle [0]:

> The free energy principle is based on the Bayesian idea of the brain as an “inference engine.” Under the free energy principle, systems pursue paths of least surprise, or equivalently, minimize the difference between predictions based on their model of the world and their sense and associated perception.

[0]: https://en.m.wikipedia.org/wiki/Free_energy_principle

morkalork · on May 21, 2023

Why not? Your brain is already making predictions about what you expect to see and hear as a part of your perception of reality anyways.

quad_eye_oh · on May 21, 2023

Right, but LLMs suggest that learning how to predict and only training on predictions is sufficient to learn anything and to have emergent generative abilities. What if learning to predict the upcoming input is all that's all that is needed for general human intelligence? What if it is all that any animals do?

20after4 · on May 21, 2023

This right here. I actually am strongly starting to believe that this is indeed what's going on.

I read the book Kingdom of Speech a few years ago and that also left me with the perspective that perhaps language has a lot more to do with how we think and perceive the world than most people like to admit. The book has been heavily criticized but I believe it made an interesting point about language.

moffkalast · on May 21, 2023

I mean hasn't it been proven on multiple occasions that humans who never learn a language during their life also don't develop higher cognitive skills and stay on the level of a child? Clearly language plays a notable part in reasoning, and it's frankly no surprise that we're seeing this behaviour from language models.

It's also quite interesting how foundation models and fine tuning appear analogous to being born with a brain that already has six million years' worth of weights in it (trained implicitly through random changes and natural selection), which are then adapted over the course of a lifetime when environment relevant data is gradually obtained.

anon84873628 · on May 21, 2023

The sad truth is that we are all NPCs without free will. "Thoughts" are a non-physical phenomenon that, seemingly, arise from physical activity in the brain. The thoughts (our "experience") come after the electrochemical reaction that manifests them. How could these non-physical phenomena then turn around and influence the physical chemistry? They can't. By the time we experience a thought, the physical state for it has been represented, and that state causes the next one after it. Like a stone bouncing down a hill - chaotic but still deterministic. We're just along for the ride.

antihipocrat · on May 21, 2023

The act of being conscious of our own thoughts and using them to influence our physical state is skill practiced by many people with success.

anon84873628 · on May 21, 2023

Nope, that's just what it feels like we're doing. It's a good trick!

namaria · on May 21, 2023

Reducing a debate that has been raging for millennia across several disciplines to three lines of text tells me you don't have enough of a grasp on the subject to draw any conclusions about it, let alone resolve it altogether.

Biganon · on May 21, 2023

Care to enlighten us a little? What is there even to debate? The position expressed by your parent comment is so uncomfortable that it's constantly being challenged, but that doesn't make it wrong. Or are "thoughts" some sort of magical fluff not bound by the laws of physics? Are we constantly pulling ourselves by our bootstraps? Giving birth to entirely new brain activity out of nothingness?

This is just woo woo nonsense. Until someone can find an explanation that makes more sense than "natural selection slowly gave apes the ability to think abstractly and they started thinking about themselves", I won't believe in any kind of free will, and now the meta is reaching the point where we create entirely new brains out of silicon.

"They just repeat stuff!"

So do we

namaria · on May 21, 2023

Does free will exist or only hard determinism can be true? You seem to have reached a definitive conclusion. And you seem to think natural selection and evolution of species and autonomous machines prove your point.

You also seem to think that questioning the veracity of your position is engaging in magical thinking. How could I possibly 'enlighten' you if you are sure of being right?

It doesn't matter if I point out that physics hasn't been able to prove materialism. Where is the fundamental particle? Why can't we determine even through which slit did a quantum of light travel? The Copenhagen interpretation of quantum physics appears to directly contradict your claims of hard determinism being the only valid explanation of reality. But I must be wrong about that. Since clearly you have seen reality for what it is, and I have failed to do so.

Then again, we cannot escape this situation seeing as it is determined.

Biganon · on May 21, 2023

Of course I'm willing to change my mind, but so far no one has ever given a compelling argument. I just follow Occam's razor.

> The Copenhagen interpretation of quantum physics appears to directly contradict your claims of hard determinism being the only valid explanation of reality

OK, fundamental physics might not follow hard determinism (or not in a way that we currently understand), but please, indeterminism says nothing about free will, human thoughts, or anything related to that. If you want to say the human brain follows the same physics rules as everything else in the universe, and that this implies some indeterminism, sure. That's almost certainly correct. But where do thoughts arise from that?

If you sprinkle randomness on the process, I'll agree with you; but nothing in physics even suggests ANY link between indeterminism, superposition, etc. and thought. So my point still stands: free will as we usually envision it has no reason to exist. The fact that particles can be in two states at once does NOT contradict this.

anon84873628 · on May 21, 2023

Indeterminism does not automatically imply free will. None of the open questions of physics, including how qualia arise, could plausibly explain free will. Neuroscience is continuing to show just how deterministic and predictable brains are.

So at this point the burden is on free will proponents to offer a plausible explanation for it. Without resorting to dualism, which is great for religious people but not scientifically useful.

anon84873628 · on May 21, 2023

Also, it's funny for you to start this thread with such a disparaging and dismissive comment, then come back and act victimized that you're being questioned. If you are a self aware entity in control of your behavior, maybe you should try harder not to be a hypocritical asshole.

ChatGTP · on May 21, 2023

I believe this view of humans is because we work in increasingly deterministic environments (the internet) using predictable methods of communication, text.

We’ve kind of made the world a bit boring and deterministic, places where almost perfect is knowledge is obtainable and so everything feels more and more predictable.

Your day probably consists of using Google, talking on platforms that don’t change much, solving already solved coding problems and communicating with others about office politics problems that we’ve all spoke about over and over again. We literally are just chatbots in this world.

namaria · on May 21, 2023

Can you think without words? Can you imagine how it feels to swim, something you might have done, and extend it to fly like a bird? Can you read a novel and feel emotions about characters that don't exist beyond text descriptions of their actions? Language is a way to encode thought. It's not what thoughts are made of.

copperx · on May 21, 2023

> Language is a way to encode thought. It's not what thoughts are made of.

What about people who think verbally and don't have a "mind's eye"? In that case, thoughts and their encoding might be closer to 1:1.

namaria · on May 21, 2023

Are images and words the only way to think? Are these boundaries even fundamental or are they arbitrary? How does a blind deaf person think? Do you ever think in terms of sensations or movements? Can you remember a roller coaster ride, a car trip? Can you think of a taste?

I find hard to conceive of people who can think only in terms of language.

goatlover · on May 21, 2023

Does that reasoning work when applied to what you just wrote? If you’re just predicting the next word, then what makes you confident in the conclusion that you’re just predicting your next word?

JoshTko · on May 22, 2023

Yes. This was my second draft. I reviewed my first draft which became an input in my second draft which gained more clarify after reviewing the first draft.

goatlover · on May 22, 2023

The way people usually say it is that you revised your first draft upon reflection, because it didn't quite get across the point you wanted to make, or something to that effect. So what makes the way you state it more true than the way people typically state it?

JoshTko · on May 22, 2023

I can switch from word generation and evaluation. I saw the score of 1st draft was below my target. And I used the insights from the eval of the 1st draft to inform the 2nd draft.

astrea · on May 21, 2023

But you and your brain are much more than the language-forming aspect.

JoshTko · on May 30, 2023

Language is the compression of our entire human experience into words, so I'd argue language captures the majority of the thinking aspect of the human experience.

moffkalast · on May 21, 2023

Yes, most of the brain is a glorified microcontroller for running and interpreting data from a billion skin sensors and driving hundreds of muscle actuators, doing depth and object inference from binocular images, locating sounds from the twin microphone array, etc. Language and especially reasoning is a very small part of it all.

dragonwriter · on May 21, 2023

> One way to resolve this surprise is to find some reason to believe these strange abilities are fundamentally not an understanding of the world. Thus stochastic parrots, this article, Yan LeCun and Chomsky, etc.

I mean, it is kind of obvious that predicting text based on a large corpus of text written by people with a model of the world will, if it works at all, look like having a model of the world.

The question is whether that involves having a model of the world, and secondarily, if there is a difference, what is the practical impact of the difference?

Or maybe that’s not really the question, because the whole “model of the world” thing is metaphysical omphaloskepsis that is inherently unanswerable because we can’t actually unambiguously reduce it to something with testable empirical predictions, reflecting a rationalization for elements of our culture and particularly our own view of our own special place in the universe, and the different answers to it likewise have no real meaning but simply reflect people’s bias for whether or not they want to share that special place, either in general or with the particular computing systems under discussion in particular.

tornato7 · on May 21, 2023

The one thing that has convinced me that chatGPT has built a real world model is asking it how to stack different objects. You can ask it for the safest way to stack a pillow, an egg, a nail, a book and an action figure. Even get more complicated. GPT-4 will, most of the time, correctly reason through how to stack these items to form a stable structure without breaking. That tells me it knows intimately the physical properties of all of these objects and how they interact with each other.

avereveard · on May 21, 2023

Or was that one of the training samples provided to gpt 3.5?

It is going to be increasingly harder to divide between what it was in the training set and what is creative thinking going forward.

Like, at the very minimum, all the publicly known reasoning samples that were used to probe gpt version 3.5 downward are going to be unreliable for knowing what gpt4 reasoning abilities are

tornato7 · on May 21, 2023

The point of the question is that you can easily come up with combinations of objects that are unlikely, or statistically impossible, to be in their training samples. How about this one? "Tell me how to safely stack the following objects on top of each other without any additional support: A Nikon DSLR, A small succulent, 4 lightbulbs, a pack of gummy worms, and a hardcover copy of Harry Potter and the Halfblood Prince."

Hardcover copy of Harry Potter and the Half-Blood Prince: Start with the largest, heaviest, and most stable object. This will serve as your base. Make sure to place it on a flat, secure surface.

Pack of gummy worms: This item is fairly light and flexible. It should sit nicely on the book without risking damage to either. You can place it right in the center to maintain balance.

Nikon DSLR: This is a heavy object with a fairly stable shape. Put this on top of the pack of gummy worms, as it's sturdy enough to handle the weight and won't be harmed by the soft package. Make sure to place it so that the lens is pointing upward and it sits flat and stable. Don't use any protruding lenses or parts to balance it, as they could easily be damaged.

Four Lightbulbs: Since light bulbs are round and fragile, you will need to stack them with care. If the bulbs are in a box, you can place the box on top of the DSLR, making sure it's centered and stable. If the bulbs are loose, it's a bit more tricky. You might want to nestle them into the lens of the DSLR (if the lens is large enough), which should provide some natural containment for them. If not, it might be safer not to include the lightbulbs in your stack.

A small succulent: Finally, the succulent can be placed on top. It's likely the lightest object and it is usually quite stable due to its pot. If the lightbulbs were in a box, place the succulent on top of that. If you ended up not including the lightbulbs, place the succulent directly on top of the DSLR.

RandomLensman · on May 21, 2023

Reads well if you don't think about it too much... For example: Where does the succulent go if the light bulbs are nestled into the lens of the DLSR? Balanced on the light bulbs? Why would the gummy worm package need to be in the center of the book to maintain balance?

tornato7 · on May 21, 2023

https://i.imgur.com/feEiiZA.png

I tried to stack all of these objects myself and couldn't really. I think GPT-4's approach is actually really good. It correctly points out that the gummy worms make a flexible base for the DSLR (otherwise the protruding buttons/viewfinder make it wobbly on the hard book), and the light bulbs are able to nestle into the front of the lens. If they were smaller light bulbs I could probably use the four of them as a small base on top of the lens to host the succulent.

RandomLensman · on May 21, 2023

Might also put the light bulbs as a base (especially if in a box). They are pretty sturdy and can hold a book.

tornato7 · on May 21, 2023

The point is that ChatGPT undeniably built a world model good enough to understand the physical and three-dimensional properties of these items pretty well, and it gives me a somewhat workable way to stack them, despite never having seen that in its training data.

RandomLensman · on May 21, 2023

You cannot conclude that from the output - the training data will likely contain a lot stacking things. Everyday objects also might have some stacking properties that make these questions easy to answer even with semi-random answers.

Plus, some stuff clearly makes no sense or is ignored (like the gummy worms in the center, forgetting about the succulent in some cases).

If you want to test world modeling, give it objects it will have never encountered, describe them and then ask to stack etc. For example, a bunch of 7 dimensional objects that can only be stacked a certain way.

moffkalast · on May 21, 2023

> If you want to test world modeling, give it objects it will have never encountered, describe them and then ask to stack etc. For example, a bunch of 7 dimensional objects that can only be stacked a certain way.

And when it does that perfectly, I assume you'll say that was also in the training data? All examples I've seen or tried point to LLMs being able to do some kind of reasoning that is completely dynamic, even when presented with the most outlandish cases.

RandomLensman · on May 21, 2023

All examples I tried myself show it failing miserable at reasoning.

It certainly needs better evidence than being able to come up with one of many possibilities of stacking things - aided by human interpretation on top of the text output. Happy to look at other suggestions for test problems.

moffkalast · on May 21, 2023

Well for me personally, the proof is in giving it a few sentences on how it should write fairly complicated pieces of unique code I need on a daily basis and seeing it correctly infer things I forgot to specify in ways that are typically borderline impossible for anything but another human. If that's not reasoning I don't know what is.

The other one that convinced me was this list: https://i.imgur.com/CQlbaDN.png I think the leetcode tests are quite indicative, going as far as saying that GPT-4 scores 77% on basic reasoning, 26% on complex reasoning and 6% on extremely complex reasoning.

Maybe the reasoning is all "baked in" as it were, like in a hypothetical machine doing string matching of questions and answers with a database containing an answer to every possible question. But in the end, correctly using those baked in thought processes may be good enough for it to be completely indistinguishable from the real thing, if the real thing even exists and we aren't stochastic resamplers ourselves.

> aided by human interpretation on top of the text output

That's an interesting point actually, I've been trying to do something in that regard recently, by having it use an API to do actual things (in a simulated environment) and it seems very promising despite the model not being tuned for it, but given that AutoGPT and plugin usage are a thing, that should be all the evidence you need on that front.

Google also did this with their old Palm model which is vastly inferior to even GPT 3.5: https://www.youtube.com/watch?v=j6O_uePUKKI

RandomLensman · on May 21, 2023

Coding isn't a use case of mine. For example, for things like financial derivatives replication it can tell you the abstract concept but it cannot apply it in a meaningful way.

biorach · on May 21, 2023

> For example, a bunch of 7 dimensional objects that can only be stacked a certain way.

That's a ridiculous example.

RandomLensman · on May 21, 2023

Why? You need make sure that a solution requires true understanding and isn't in the training set. If it can reason properly, it shouldn't have a problem with such a problem.

DiogenesKynikos · on May 21, 2023

How well do humans reason about 7-dimensional objects?

I'm already impressed if a computer can reason flexibly about 3-dimensional objects.

RandomLensman · on May 21, 2023

Humans having the right mathematical tooling do ok.

DiogenesKynikos · on May 23, 2023

What percentage of humans have that mathematical tooling?

The fact that people are even raising these sorts of obscure tests shows just how far AI has advanced.

tornato7 · on May 21, 2023

Please tell me how you would pose the question of a bunch of seven-dimensional objects that can only be stacked in a certain way.

namaria · on May 21, 2023

Extra points for teaching me a new word for navel gazing. But I think your reasoning about it devolved into navel gazing at the end.

mamp · on May 21, 2023

It turns out to accurately predict the next word requires huge amounts of implicit contextual knowledge i.e. understanding about the world.

Next word prediction was the trigger, the optimisation for the task results in a broad (but currently unreliable) model of the world.

blazespin · on May 21, 2023

The resolution is actually fairly simple. It's an incredibly brilliant stochastic parrot with some limited reasoning capabilities.

Some folks will try to say it cannot reason, but they are wrong, there is extensive proof of that.

The only question is how limited are its reasoning capabilities. After spending extensive time on openai/evals, having submitted 3 of my own, and doing a lot of tests, I would argue that an average person of average IQ could out think GPT4 - as long as the stochastic parrot aspect wasn't a factor.

moffkalast · on May 21, 2023

That's probably an accurate assessment, the question is mainly if the reasoning can be improved to a notable extent and how much on the current architecture.

I myself assumed that we're pretty close to the end of the S curve when first using 3.5-turbo and figured that hallucinations will be pretty hard to overcome, but with GPT 4 being such a massive improvement on all metrics I'm no longer as sure. GPT 5 will probably be more definitive on what's possible, based on where it starts having diminishing returns.

blazespin · on May 21, 2023

Exactly, that's the trillion (quadrillion?) dollar question. Have we hit asymptotic reasoning capability or not?

There are interesting things to do with synthetic data, so stochastic parrot might not yet be hitting asymptote.

sgt101 · on May 21, 2023

It's very hard to know because we don't know and can't experiment with its training data. So - it may be doing first principle reasoning, or it may be doing token substitution vs. some known example that it's seen before and is matching to.

blazespin · on May 21, 2023

The dictionary definition of reasoning says nothing about how the thinking is done, only that it's sensible and logical, which is exactly what GPT4 is. Limited, yes, but it reasons.

moffkalast · on May 21, 2023

> it may be doing token substitution vs. some known example that it's seen before and is matching to

Ah, just like us with literally 99% of stuff taught in school you mean.

nopinsight · on May 21, 2023

There is in fact compelling evidence that language models can learn some world models.

Here's a research paper on this and a blog post by the lead author summarizing the results.

Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task. https://arxiv.org/abs/2210.13382

Do Large Language Models learn world models or just surface statistics? https://thegradient.pub/othello/

-----

An argument based on common sense can also be made: Any system that possesses a wide range of capabilities, most of which it was not specifically trained to perform, cannot possibly perform all these tasks so well solely by making probabilistic guesses.

(Humans, too, were not directly shaped by natural and sexual selection to possess all of our cognitive capacities.)

whimsicalism · on May 21, 2023

Beyond compelling evidence, it has been our working theory for how neural networks generalize for many years now.

astrange · on May 21, 2023

I think the main issue is that since we don't have access to the implementation of GPT-4, there's no way for researchers to know how it works, so either they're not saying anything or they're just making things up.

GPT-4 is "multimodal" and RLHF'd, so it was trained with some tasks other than next word prediction. I don't remember if it's been trained for code correctness (by running unit tests etc.), but other models have been.

coldtea · on May 21, 2023

That's not much of a problem, as it only affects the "GPT" LLM.

The same researchers can always study comparable in behavior LLMs like Llama and its descendants.

winddude · on May 21, 2023

> One way to resolve this surprise is to find some reason to believe these strange abilities are fundamentally not an understanding of the world. Thus stochastic parrots, this article, Yan LeCun and Chomsky, etc.

This one. This is the correct resolution.

DiogenesKynikos · on May 21, 2023

I don't think it's at all surprising that a system designed to predict the next word ends up understanding the world.

If you want to predict the next word accurately, you first have to know which words exist. To progress, you'll have to learn about the mechanics of grammar and which words are used more frequently or in combination. To become even more accurate, it helps to understand context, so that the sentences you string together will at least be relevant to the subject. If you want to increase your accuracy even further, you'll have to start memorizing all sorts of facts (e.g., "Who was the monarch of England in 1600?"). Being able to synthesize those facts into a coherent argument will increase your accuracy even further.

In the end, predicting the next word accurately requires an understanding of the world.

This isn't all that different from how our own intelligence evolved. You could look at humans from the outside and disparagingly point out that the ultimate purpose of the human brain is to direct muscle motions in a way that maximizes the chances of reproductive success. It just turns out that solving that problem effectively has led to the development of an enormously complicated piece of machinery, capable of synthesizing all sorts of input stimuli into a coherent picture of the world, and ultimately of producing the works of Shakespeare and the music of Beethoven.

notShabu · on May 21, 2023

Has there been any research in the possibility that much of what we say/speak (including this convo) is actually just stochastic parroting?

e.g. instead of the stochastic parrots mimicking intelligence maybe intelligence doesn't exist, it's just stochastic parrots of various levels of sophistications organized into a hierarchy. "Intelligence" is necessarily socially defined with the more complex parrots being unpredictable and "intelligent" from the POV of lower parrots. Vicer versa, looking down, the lower parrots seem to act like "NPCs"

Sai_ · on May 21, 2023

This is an interesting question.

To paraphrase as per my understanding of your comment, is intelligence an emergent property of being able to interact with each other through language?

Say I speak gibberish (to you) which is actually me explaining to you the theory of relativity, would you consider me intelligent?

anon84873628 · on May 21, 2023

What's the difference between "understanding" and "having really good probabilistic information about how words combine"?

Kids learn to speak by parroting what they hear and observing the outcome. Then they run tests that reinforce the connections between words. That's what the model is.

But humans also get to link words with all the other sense experience we have (like how sweet cherries, loud fire trucks, and that one crayon are all "red"). LLMs don't have as many dimensions of experience they can link to.

But anyway, intelligence is about having an internal model of the world and using it to predict the future. The more rich and accurate the model, the more intelligent. The ability to communicate isn't a prerequisite; lots of animals have intelligence that isn't built with language.

namaria · on May 21, 2023

Spot on. That we have internal language models doesn't mean we only have language models as cognition. People seem to be redefining imposter syndrome as reflected by stand alone language models. Makes me think of Tropic Thunder. "Am I even real". Lol get a grip people.

whimsicalism · on May 21, 2023

> Has there been any research in the possibility that much of what we say/speak (including this convo) is actually just stochastic parroting?

I don't think that is really a well defined question. What is a stochastic parrot?

leobg · on May 21, 2023

BF Skinner, “Verbal Behavior”. Critiquing that book is what put Chomsky on the map.

causality0 · on May 21, 2023

If LLMs are an understanding of the world it would mean humans in a few decades found a way to create sapience with many orders of magnitude fewer interacting elements than evolution did. I find that doubtful, at least in light of the fact every other way we've replicated biological computation requires many more computational elements.

seanhunter · on May 21, 2023

Firstly, if humans created sapience, we created it with all the interacting elements that evolution used plus all the things that we used.

Secondly, maybe sapience isn't as big of a deal as we thought compared with all the other things that evolution did. Remember that biological entities have to figure out survival, reproduction etc. Sapience emerges as a byproduct but the selective pressure is towards those things so sapience is only selected for to the extent that it also moves forward those other goals.

By contrast, LLM training is just focussed on the task of making the model better. The model doesn't have to figure out how to feed itself, ward off predators, not accidentally die in the myriad ways things die, reproduce itself etc. It's way more specific. It doesn't seem unreasonable to think that the complexity would be lower given it's not trying to achieve nearly as much.

pzo · on May 21, 2023

Good point. I would even add humans brain have to process a lot of sensory input: video, audio, touch, smell, taste. And communication channel for human is very inefficient - you have to translate your thoughs to language then to audio then someone alse doing the same in reverse. If you texting you have to slowly type something then someone else need to do text recognition in their brain.

LLM are like handicapped humans that are visual, hearing impaired, with long covid no taste or smell and can focus only thinking and very efficient communication channel as text/tokens sent via wire on the internet

ilaksh · on May 21, 2023

GPT-4 can understand images. That feature is just not released to the general public yet.

seanhunter · on May 21, 2023

Yes. One of the astonishing examples in the paper I quoted earlier [1] is an image produced in response to the prompt "Produce Javacript code that creates a random graphical image that looks like a painting of Kandinsky”. The response by GPT-4 does kind of look quite a lot like a Kandinsky.

[1] https://arxiv.org/pdf/2303.12712.pdf

furyofantares · on May 21, 2023

Alright, I give. What the heck is "sapience"?

seanhunter · on May 21, 2023

Generally people use that word to mean "wisdom" or at least "some reasoning ability distinct from what animals have".

Edit to add: From a personal perspective I don't see any reason to think humans have qualitatively different reasoning abilities from animals, or a unique "soul" or anything like that, so the term "sapience" doesn't really have a special resonance with me like it might for someone who thinks those things. That may affect some judgements here I don't know.

coldtea · on May 21, 2023

You know, the capacity of though/intelligence/understanding the world?

As in "homo sapiens" (same latin root)

furyofantares · on May 21, 2023

Sounds by a wide margin even less well defined than AGI or sentience

XorNot · on May 21, 2023

Why would that be surprising though? The natural world climbs up to understanding from a severe disadvantage. Chemicals don't understand the world - the natural world had to be randomly build every bit of its systems from scratch, before it could produce a succinct descriptive system for it.

LLMs on the other hand start out with a succinct descriptive system, and translate that to the world of chemicals and photons via some very complicated naturally evolved systems.

whimsicalism · on May 21, 2023

If humans actually went to the moon, it would mean that humans in a few decades found a way to access a niche that nature never did.

In all seriousness, it's interesting all of these dualisms we like to hold on to. Humans are part of nature. It is unsurprising that further sapience would branch off from an already sapient race as opposed to re-emerge elsewhere.

jiggawatts · on May 21, 2023

Elephant and whale brains are both bigger than human brains, but they're less intelligent than us. Much of the volume of their brains goes towards controlling a larger body. Conversely, some birds have much smaller brains than any primate but can learn to solve simple puzzles and problems.

dirkt · on May 21, 2023

> One way to resolve this surprise is to find some reason to believe these strange abilities are fundamentally not an understanding of the world.

An easy way to test this is to ask questions and followup-questions that actually require understanding, and compare this to the answers. I recommend to try that.

klipt · on May 21, 2023

> language encoded way more information about reality than we thought it did

Language is roughly what separates humans from other apes ... so why would it surprise us that it encodes much of the information of civilization?

fwlr · on May 21, 2023

I’m not as surprised as many people (I saw a great tweet once that said “Language was the first artificial intelligence, writing was the second. I literally believe this”, and that’s a broadly accurate description of my worldview too).

That said, maybe some of the surprise comes from believing “the map is not the territory” and related ideas? We generally believe that the map is not the territory and this gives us some obviously correct intuitions (like “changing the map doesn’t change the territory”), but maybe it has also given us some subtly incorrect intuitions. I’m not talking about obviously incorrect, like “you can’t understand the territory just by looking at enough maps”. I mean something more subtly wrong. One candidate off the top of my head is an intuition that “maps approximate the territory but necessarily at a lower level of detail (a 1:1 map of the territory would be the same size as the territory), so your understanding of the territory can improve as you read more maps but it can’t improve on the limit of the most detailed map available, because that information literally isn’t there”. I could see that possibly being wrong somehow.

_ugfj · on May 21, 2023

> Language is roughly what separates humans from other apes

Maybe not.

A recent study pushes back the "dawn of speech" to 20 Ma which is far, far beyond the horizon where we consider humans to separate from apes. https://www.science.org/doi/10.1126/sciadv.aaw3916 Even if you consider Sahelanthropus tchadensis to belong to humans that was only 7 Ma and that is still under debate.

I personally find "the fundamental human trait is control of fire to be used for cooking" theory very convincing. We do not yet know how far this goes back but no one pushed that back beyond 2 Ma.

krainboltgreene · on May 21, 2023

> so why would it surprise us that it encodes much of the information of civilization?

The Humanities are largely considered superfluous in the tech world, why are you surprised that they are surprised?

klipt · on May 21, 2023

To clarify, I'm including stuff like Python code under language, because these large language models are also trained on Python code.

krainboltgreene · on May 21, 2023

Very very few people in tech are even experts in the meta analysis of programming languages, much less the importance of languages.

namaria · on May 21, 2023

> My current (tentative) resolution of the surprise is that language encoded way more information about reality than we thought it did.

"A whole mythology is deposited in our language" - Wittgenstein

aappleby · on May 21, 2023

What exactly does "understanding the world" really mean?

leschak · on May 21, 2023

compression :)

yblu · on May 21, 2023

What I found disheartening was many of those scientists, especially those on the "nothing to worry about" camp, seemed not to entertain the thought that they could be wrong, considering the scale of the matter, i.e. human extinction. If there's a chance AI poses an existential threat to us, even if it is 0.00000001% (I made that up), should they be at least a bit more humble? This is uncharted domain and I find it incredible that many talk like they already have all the answers.

akiselev · on May 21, 2023

Meh. Add it to the pile. The number of world ending risks that we could be worried about at this point are piling up and AI exterminating us is far from the top concern, especially when AI may be critical to solving many of the other problems that are.

Wrong about nuclear proliferation and MAD game theory? Human extinction. Wrong about plasticizers and other endocrine disruptors, leading to a Children of Men scenario? Human extinction. Wrong about the risk of asteroid impact? Human extinction. Climate change? Human extinction. Gain of function zombie virus? Human extinction. Malignant AGI? ehh... whatever, we get it.

It's like the risk of driving: yeah it's one of the leading causes of death but what are we going to do, stay inside our suburban bubbles all our lives, too afraid to cross a stroad? Except with AI this is all still completely theoretical.

whimsicalism · on May 21, 2023

I think almost none of the scenarios you have named outside of the asteroid & the AGI would result in complete human extinction, potentially a very bad MAD breakdown could also lead to this but the research here is legitimately mixed.

yblu · on May 21, 2023

You disagreed with me, but at least you acknowledged there was risk, even though we could disagree about the odd or potential impact. Yet, folks like Yann LeCun ridiculed anyone who thought there was a risk AI could endanger us or harm our way of life. What do we know about experts who are always confident (usually on TV) about things that haven't happened yet?