this post was submitted on 01 Jun 2025

261 points (96.1% liked)

Technology

70942 readers

3395 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

261

I'm looking for an article showing that LLMs don't know how they work internally (feddit.it)

submitted 6 days ago* (last edited 6 days ago) by lgsp@feddit.it to c/technology@lemmy.world

72 comments fedilink hide all child comments

I found the aeticle in a post on the fediverse, and I can't find it anymore.

The reaserchers asked a simple mathematical question to an LLM ( like 7+4) and then could see how internally it worked by finding similar paths, but nothing like performing mathematical reasoning, even if the final answer was correct.

Then they asked the LLM to explain how it found the result, what was it's internal reasoning. The answer was detailed step by step mathematical logic, like a human explaining how to perform an addition.

This showed 2 things:

LLM don't "know" how they work
the second answer was a rephrasing of original text used for training that explain how math works, so LLM just used that as an explanation

I think it was a very interesting an meaningful analysis

Can anyone help me find this?

EDIT: thanks to @theunknownmuncher @lemmy.world https://www.anthropic.com/research/tracing-thoughts-language-model its this one

EDIT2: I'm aware LLM dont "know" anything and don't reason, and it's exactly why I wanted to find the article. Some more details here: https://feddit.it/post/18191686/13815095

top 50 comments

sorted by: hot top controversial new old

[–] theunknownmuncher@lemmy.world 73 points 6 days ago (2 children)

https://www.anthropic.com/research/tracing-thoughts-language-model its this one

[–] lgsp@feddit.it 20 points 6 days ago

Oh wow thank you! That's it!

I didn't even remember now good this article was and how many experiments it collected

[–] anzo@programming.dev 5 points 5 days ago

Here's a book for a different audience. Explains in layman terms why to be wary about this tech, https://thebullshitmachines.com/

[–] glizzyguzzler@lemmy.blahaj.zone 63 points 6 days ago (19 children)

Can’t help but here’s a rant on people asking LLMs to “explain their reasoning” which is impossible because they can never reason (not meant to be attacking OP, just attacking the “LLMs think and reason” people and companies that spout it):

LLMs are just matrix math to complete the most likely next word. They don’t know anything and can’t reason.

Anything you read or hear about LLMs or “AI” getting “asked questions” or “explain its reasoning” or talking about how they’re “thinking” is just AI propaganda to make you think they’re doing something LLMs literally can’t do but people sure wish they could.

In this case it sounds like people who don’t understand how LLMs work eating that propaganda up and approaching LLMs like there’s something to talk to or discern from.

If you waste egregiously high amounts of gigawatts to put everything that’s ever been typed into matrices you can operate on, you get a facsimile of the human knowledge that went into typing all of that stuff.

It’d be impressive if the environmental toll making the matrices and using them wasn’t critically bad.

TLDR; LLMs can never think or reason, anyone talking about them thinking or reasoning is bullshitting, they utilize almost everything that’s ever been typed to give (occasionally) reasonably useful outputs that are the most basic bitch shit because that’s the most likely next word at the cost of environmental disaster

[–] peoplebeproblems@midwest.social 17 points 5 days ago (2 children)

People don't understand what "model" means. That's the unfortunate reality.

[–] adespoton@lemmy.ca 14 points 5 days ago (1 children)

They walk down runways and pose for magazines. Do they reason? Sometimes.

[–] IncogCyberspaceUser@lemmy.world 11 points 5 days ago (1 children)

But why male models?

load more comments (1 replies)

[–] random_character_a@lemmy.world 7 points 5 days ago

Yeah. That's because peoples unfortunate reality is a "model".

[–] Treczoks@lemmy.world 9 points 5 days ago (1 children)

I've read that article. They used something they called an "MRI for AIs", and checked e.g. how an AI handled math questions, and then asked the AI how it came to that answer, and the pathways actually differed. While the AI talked about using a textbook answer, it actually did a different approach. That's what I remember of that article.

But yes, it exists, and it is science, not TicTok

[–] lgsp@feddit.it 5 points 5 days ago

Thank you. I found the article, linkin the OP

[–] AnneBonny@lemmy.dbzer0.com 7 points 5 days ago (1 children)

How would you prove that someone or something is capable of reasoning or thinking?

[–] glizzyguzzler@lemmy.blahaj.zone 6 points 5 days ago (2 children)

You can prove it’s not by doing some matrix multiplication and seeing its matrix multiplication. Much easier way to go about it

[–] theunknownmuncher@lemmy.world 19 points 5 days ago* (last edited 5 days ago) (7 children)

Yes, neural networks can be implemented with matrix operations. What does that have to do with proving or disproving the ability to reason? You didn't post a relevant or complete thought

Your comment is like saying an audio file isn't really music because it's just a series of numbers.

load more comments (7 replies)

[–] whaleross@lemmy.world 7 points 5 days ago (2 children)

People that can not do Matrix multiplication do not possess the basic concepts of intelligence now? Or is software that can do matrix multiplication intelligent?

[–] glizzyguzzler@lemmy.blahaj.zone 2 points 2 days ago (1 children)

So close, LLMs work via matrix multiplication, which is well understood by many meat bags and matrix math can’t think. If a meat bag can’t do matrix math, that’s ok, because the meat bag doesn’t work via matrix multiplication. lol imagine forgetting how to do matrix multiplication and disappearing into a singularity or something

load more comments (1 replies)

[–] WolfLink@sh.itjust.works 6 points 5 days ago (1 children)

The environmental toll doesn’t have to be that bad. You can get decent results from single high-end gaming GPU.

load more comments (1 replies)

[–] just_another_person@lemmy.world 3 points 6 days ago

It's a developer option that isn't generally available on consumer-facing products. It's literally just a debug log that outputs the steps to arrive at a response, nothing more.

It's not about novel ideation or reasoning (programmatic neural networks don't do that), but just an output of statistical data that says "Step was 90% certain, Step 2 was 89% certain...etc"

[–] AnneBonny@lemmy.dbzer0.com 2 points 5 days ago (2 children)

Who has claimed that LLMs have the capacity to reason?

[–] theparadox@lemmy.world 12 points 5 days ago (2 children)

More than enough people who claim to know how it works think it might be "evolving" into a sentient being inside it's little black box. Example from a conversation I gave up on... https://sh.itjust.works/comment/18759960

[–] theunknownmuncher@lemmy.world 6 points 5 days ago

I don't want to brigade, so I'll put my thoughts here. The linked comment is making the same mistake about self preservation that people make when they ask an LLM to "show it's work" or explain it's reasoning. The text response of an LLM cannot be taken at it's word or used to confirm that kind of theory. It requires tracing the logic under the hood.

Just like how it's not actually an AI assistant, but trained and prompted to output text that is expected to be what an AI assistant would respond with, if it is expected that it would pursue self preservation, then it will output text that matches that. It's output is always "fake"

That doesn't mean there isn't a real potential element of self preservation, though, but you'd need to dig and trace through the network to show it, not use the text output.

[–] AnneBonny@lemmy.dbzer0.com 2 points 5 days ago

Maybe I should rephrase my question:

Outside of comment sections on the internet, who has claimed or is claiming that LLMs have the capacity to reason?

[–] adespoton@lemmy.ca 6 points 5 days ago (1 children)

The study being referenced explains in detail why they can’t. So I’d say it’s Anthropic who stated LLMs don’t have the capacity to reason, and that’s what we’re discussing.

The popular media tends to go on and on about conflating AI with AGI and synthetic reasoning.

[–] theunknownmuncher@lemmy.world 5 points 5 days ago (3 children)

You're confusing the confirmation that the LLM cannot explain it's under-the-hood reasoning as text output, with a confirmation of not being able to reason at all. Anthropic is not claiming that it cannot reason. They actually find that it performs complex logic and behavior like planning ahead.

load more comments (3 replies)

load more comments (13 replies)

[–] BodilessGaze@sh.itjust.works 40 points 5 days ago (17 children)

I don't know how I work. I couldn't tell you much about neuroscience beyond "neurons are linked together and somehow that creates thoughts". And even when it comes to complex thoughts, I sometimes can't explain why. At my job, I often lean on intuition I've developed over a decade. I can look at a system and get an immediate sense if it's going to work well, but actually explaining why or why not takes a lot more time and energy. Am I an LLM?

load more comments (17 replies)

[–] JackGreenEarth@lemm.ee 18 points 6 days ago (1 children)

By design, they don't know how they work. It's interesting to see this experimentally proven, but it was already known. In the same way the predictive text function on your phone keyboard doesn't know how it works.

[–] lgsp@feddit.it 17 points 6 days ago (1 children)

I'm aware of this and agree but:

I see that asking how an LLM got to their answers as a "proof" of sound reasoning has become common
this new trend of "reasoning" models, where an internal conversation is shown in all its steps, seems to be based on this assumption of trustable train of thoughts. And given the simple experiment I mentioned, it is extremely dangerous and misleading
take a look at this video: https://youtube.com/watch?v=Xx4Tpsk_fnM : everything is based on observing and directing this internal reasoning, and these guys are computer scientists. How can they trust this?

So having a good written article at hand is a good idea imho

[–] Blue_Morpho@lemmy.world 2 points 5 days ago* (last edited 5 days ago)

I only follow some YouTubers like Digital Spaceport but there has been a lot of progress from years ago when LLM's were only predictive. They now have an inductive engine attached to the LLM to provide logic guard rails.

[–] KeenFlame@feddit.nu 13 points 5 days ago

It's the anthropic article you are looking for, where they performed open brain surgery equivalent to find out that they do maths in very strange and eerily humanlike operations, like they will estimate, then if it goes over calculate the last digit like I do. It sucks as a counting technique though

[–] tal@lemmy.today 11 points 6 days ago* (last edited 5 days ago)

Define "know".

An LLM can have text describing how it works and be trained on that text and respond with an answer incorporating that.
LLMs have no intrinsic ability to "sense" what's going on inside them, nor even a sense of time. It's just not an input to their state. You can build neural-net-based systems that do have such an input, but ChatGPT or whatever isn't that.
LLMs lack a lot of the mechanisms that I would call essential to be able to solve problems in a generalized way. While I think Dijkstra had a valid point:

The question of whether a computer can think is no more interesting than the question of whether a submarine can swim.

...and we shouldn't let our prejudices about how a mind "should" function internally cloud how we treat artificial intelligence...it's also true that we can look at an LLM and say that it just fundamentally doesn't have the ability to do a lot of things that a human-like mind can. An LLM is, at best, something like a small part of our mind. While extracting it and playing with it in isolation can produce some interesting results, there's a lot that it can't do on its own: it won't, say, engage in goal-oriented behavior. Asking a chatbot questions that require introspection and insight on its part won't yield interesting result, because it can't really engage in introspection or insight to any meaningful degree. It has very little mutable state, unlike your mind.

[–] franzcoz@feddit.cl 3 points 5 days ago (1 children)

There was a study by Anthropic, the company behind Claude, that developed another AI that they used as a sort of "brain scanner" for the LLM, in the sense that allowed them to see sort of a model of how the LLM "internal process" worked

[–] lgsp@feddit.it 4 points 5 days ago

Yes, that's it. I added the link in the OP,

load more comments