

Except LLM output is largely gibberish. Just confident gibberish. There’s a reason we call it “AI slop”.
LLM responses are only ever “sound” when they’re regurgitating existing information they were trained on. Beyond some simple transformations, they are unable to create original ideas. They very frequently break down on somewhat unique tasks, as evidenced by the ever-prevalent code-slop which is eroding our software.
They don’t have a memory of previous conversations (unless you literally copy-paste it into the prompt), they don’t learn (Claude “memories” is literally just copy-pasting a summary into the prompt, only automatically). They don’t have any “thoughts” of their own between prompts (OpenClaw just keeps prompting them to pretend they are autonomous).
The underlying implementation of “reasoning” in LLMs is literally “hallucinate some more text which vaguely looks like thoughts and hope that influences the answer”. LLMs are probabilistic models which we figured out how to make so they produce somewhat correct-looking answers at a rate a little higher than chance.
Magic 8-balls sometimes give sound responses. Do they think? Where do we draw the line with this interpretation of “thinking”?



Good for you. Have a cookie, I guess?
Citation needed.
A whole book of citations needed. That claim is wildly inconsistent with the consensus about AI hallucinations.
You mean like how LLMs keep hallucinating the same passwords and nonexistent dependencies to the point that bad actors are using that fact to compromise vibe coded systems via techniques like slopsquatting?
In fact, I keep experimenting with frontier models (including Fable when it was available) just so that the “but we’ve made so much progress in the past few months” argument can’t be used against me. You’re wildly overselling their capabilities.