this post was submitted on 22 Jun 2025

106 points (73.7% liked)

Programming Humor

3184 readers

12 users here now

Other Programming Communities [email protected] [email protected] [email protected] [email protected] [email protected] [email protected] [email protected]

founded 2 years ago

MODERATORS

[email protected]

106

AI will replace us all (lemmy.world)

submitted 2 days ago by [email protected] to c/[email protected]

38 comments fedilink hide all child comments

story: https://www.pcmag.com/news/chatgpt-gets-absolutely-wrecked-in-chess-match-with-1978-atari

all 42 comments

sorted by: hot top controversial new old

[–] [email protected] 10 points 1 day ago (1 children)

What exactly is it that makes the image generating AI use the ugliest colors for backgrounds? This one is like the stained walls in chain-smoker's house.

[–] [email protected] 3 points 1 day ago

Cross contamination from all the ai generated ghibli images, ai eats it's own shit and it is showing.

[–] [email protected] 26 points 1 day ago (1 children)

Everyone here saying "no shit, LLMs were not even designed to play chess" are not the people who this is directed at.

Multiple times at my job I have had to explain, often to upper management, that LLMs are not AGIs.

Stories like these help an under informed general public to wrap their heads around the idea that a "computer that can talk" =/= a "computer that can truly think/reason".

[–] [email protected] 6 points 1 day ago* (last edited 1 day ago)

They say LLMs can “reason” now, but they obviously can’t. At best, they could be trained to write a code snippet and run the code to get the answer. I’ve noticed when asked to do math ChatGPT will now translate my math question into python and run that to get the answer, since l it can’t do math itself reliably.

There are algorithms for playing chess that win by analyzing every possible move for 5, 10, 100, or more moves in advance and choosing the one most likely to lead to an optimal outcome. This is essentially what the Atari game is probably doing. LLMs could probably be given the tools to run that algorithm themselves. However, the LLM itself can’t possibly do the same thing.

[–] [email protected] 49 points 2 days ago* (last edited 1 day ago) (1 children)

This picture is AI generated

Edit: OP removed the picture from the post

[–] [email protected] 4 points 2 days ago

I know right, the Atari is really pulling it's weight.

[–] [email protected] 10 points 1 day ago

for some reason it reminds me of a quote from friends: "voice recognition is gonna be pretty much standard on any computer you buy. So you can be like 'wash my car', 'clean my room'. You know it's not gonna be able to do any of those things, but it'll understand what you're saying"

[–] [email protected] 33 points 2 days ago (1 children)

Can the Atari answer my questions incorrectly with confidence?

Checkmate.

[–] [email protected] 4 points 2 days ago

Check

[–] [email protected] 14 points 2 days ago* (last edited 2 days ago) (1 children)

"Everybody is a genius. But if you judge a fish by its ability to climb a tree, it will live its whole life believing that it is stupid." attributed to Einstein but I read he didn't say it.

[–] [email protected] 2 points 1 day ago

I'm not sure I've ever read an actual quote by Einstein at this point.

Or Thomas Jefferson for that matter

I do like the quote regardless

[–] [email protected] 11 points 2 days ago

To be fair, an Atari from 1977 would also outplay me in chess 😅

[–] [email protected] 8 points 2 days ago* (last edited 2 days ago) (1 children)

I’d like to see the Atari write a shitty article about the seven best and worst kinds of moviegoers. Or role play with me that they are a 300 year old sparkly vampire and I am an insufferable teenage girl with zero friends or ability to emote proper human emotions.

[–] [email protected] 3 points 1 day ago

I'm sure you could run Eliza in the Atari...

[–] [email protected] 4 points 2 days ago (3 children)

No shit. Chess programs are specifically built and optimised to the nth degree for this specific use case and nothing else. They do not share the massive compute overhead and convoluted nondeterministic nature of an LLM.

This is like drag racing an F1 car and a Camry and being surprised at the result.

[–] [email protected] 3 points 1 day ago

This is like drag racing an F1 car and a Camry

More like racing a Reliant Robin and an answering machine.

[–] [email protected] 3 points 1 day ago (1 children)

I don't think the Atari Chess program is as optimized as you think.

[–] [email protected] 3 points 1 day ago

1.19 MHz, 1/8 kB RAM

so no transposition tables, no endgame databases, nothing that requires pretty much any memory.

[–] [email protected] 1 points 1 day ago

Or have a real engine designer design a moderately powerful engine vs a computer throws together a blob of metal that looks kinda like an engine

[+] xor 4 points 2 days ago* (last edited 10 hours ago) (2 children)

[deleted]

[–] [email protected] 8 points 2 days ago

My computer beat me at chess but I beat it at kickboxing.

[–] [email protected] 1 points 1 day ago (1 children)

In that analogy, billions would be being spent by exotic car manufacturers saying they will replace all vehicules: airplanes, boats, scooters, bicycles, rockets....

Also inexplicably the Lamborghini sometimes just throws itself into reverse and insists that it's moving forward.

[+] xor 2 points 1 day ago* (last edited 10 hours ago) (1 children)

[deleted]

[–] [email protected] 1 points 1 day ago (2 children)

LLM’s are not meant to do anything like playing a chess game

They're being sold as being able to play chess, though.

[–] [email protected] 1 points 1 day ago (1 children)

Nobody is making those claims. Or they would get sued for false advertising.

People are interpreting as if chatbots are capable of playing chess, but that isn't on the company.

The claims those companies make are real. It are dumb features that are mostly useless, but the claims are not fake.

[–] [email protected] 1 points 1 day ago (1 children)

Every single one of those scammers is making those claims, and worse, starting with calling their shit AI.

They're selling useless and harmful shit, they know they're selling useless and harmful shit, and they plan to run away with the money the second the bubble starts to burst, without a single care for the state they'll leave the internet and the economy in.

[–] [email protected] 3 points 2 days ago (1 children)

In all fairness chessbots are REALLY REALLY good. Like incredibly good at chess. I am not shocked the guessing machine lost to one.

[–] [email protected] 7 points 2 days ago (1 children)

You really don't understand how little processing a 2600 had, whoever wrote that chess algorithm is a fucking coding god

[–] [email protected] 1 points 2 days ago (1 children)

There are chess engines floating around out there under 4kb, probably even less if you were mainlining the thing in assembly for a 6502 instruction set, still 128 bytes of RAM to work with is a punish.

But chess is mostly a 'solved problem' computationally, it's impressive constrained to the hardware but this whole Atari vs ChatGPT thing is like a grandmaster in a Mechanical Turk vs a toddler

[–] [email protected] 1 points 1 day ago

The story says the game is also from the '70s. So that would be before we were anywhere near video game chess being good. Much less a solved problem.

[–] [email protected] 2 points 2 days ago

really funny video about chatgpt being horrible at chess https://youtu.be/l_wOsSda3Us

[–] [email protected] 3 points 2 days ago (1 children)

4O got wrecked. My ai fan friend said O3 is their reasoning model so it means nothing. I don't agree but can't find proof.

Has someone done this with O3?

[–] [email protected] 18 points 2 days ago (3 children)

It’s a fundamental limitation of how LLMs work. They simply don’t understand how to follow a set of rules in the same way as a traditional computer/game is programmed.

Imagine you have only long-term memory that you can’t add to. You might get a few sentences of short-term memory before you’ve forgetting the context of the beginning of the conversation.

Then add on the fact that chess is very much a forward-thinking game and LLMs don’t stand a chance against other methods. It’s the classic case of “When all you have is a hammer, everything looks like a nail.” LLMs can be a great tool, but they can’t be your only tool.

[–] [email protected] 5 points 2 days ago (1 children)

Or: If it's possible to create a simple algirithm, that will always be infinitely more accurate than ML.

[–] [email protected] 9 points 2 days ago

That is because the algorithm has an expected output that can be tested and verified for accuracy since it works consistently every time. If there appears to be inconsistency, it is a design flaw in the algorithm.

[–] [email protected] 2 points 2 days ago (3 children)

MY biggest disappointment with how AI is being implemented is the inability to incorporate context specific execution if small programs to emulate things like calculators and chess programs. Like why does it doe the hard mode approach to literally everything? When asked to do math why doesn't it execute something that emulates a calculator?

[–] [email protected] 2 points 1 day ago

That's definitely being done. It's referred to as "tool calling" or "function calling": https://python.langchain.com/docs/how_to/tool_calling/

This isn't as potent as one might think, because:

each tool needs to be hooked up and described extensively.
the naive approach where the LLM generates heaps of text when calling these tools, for example to describe the entire state of the chessboard as JSON or CSV, is unreliable, because text generation is unreliable.
smarter approaches, like having an external program keeping track of the chessboard state and sending it to a chess engine, so that the LLM only has to forward the move that the user described, don't really make sense to incorporate into a general-purpose language model. You can find chess chatbots on the internet, though.

But all-in-all, it is a path forward where the LLMs could just do the semantics and then call a different tool for each thinky job, serving at least as a user interface.
The hope is for it to also serve as glue between these tools, automatically calling the right tools and passing their output into other tools. I believe, the next step in this direction is "agentic AI", but I haven't yet managed to cut through the buzzword soup to figure out what that actually means.

[–] [email protected] 3 points 2 days ago

I’ve been waiting for them to make this improvement since they were first introduced. Any day now…

[–] [email protected] 2 points 2 days ago

Chatgpt definitely does that. It can write small python programs and execute them, but it doesn't do it systematically, you have to prompt for it. It can even use chart libs to display data.

[–] [email protected] 0 points 2 days ago

It’s a fundamental limitation of how LLMs work.

LLMs have been adding reasoning front ends to them like O3 and deep seek. That's why they can solve problems that simple LLM's failed at.

I found one reference to O3 rated at chess level 800 but I'd really like to see Atari chess vs O3. My telling my friend how I think it would fail isn't convincing.