Technology

73389 readers

4033 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

798

ChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logic (www.tomshardware.com)

submitted 1 month ago by Lifecoach5000@lemmy.world to c/technology@lemmy.world

160 comments fedilink hide all child comments

(page 2) 50 comments

sorted by: hot top controversial new old

[–] krigo666@lemmy.world 5 points 1 month ago* (last edited 1 month ago)

Next, pit ChatGPT against 1K ZX Chess in a ZX81.

[–] FourWaveforms@lemm.ee 5 points 1 month ago

If you don't play chess, the Atari is probably going to beat you as well.

LLMs are only good at things to the extent that they have been well-trained in the relevant areas. Not just learning to predict text string sequences, but reinforcement learning after that, where a human or some other agent says "this answer is better than that one" enough times in enough of the right contexts. It mimics the way humans learn, which is through repeated and diverse exposure.

If they set up a system to train it against some chess program, or (much simpler) simply gave it a tool call, it would do much better. Tool calling already exists and would be by far the easiest way.

It could also be instructed to write a chess solver program and then run it, at which point it would be on par with the Atari, but it wouldn't compete well with a serious chess solver.

[–] NotMyOldRedditName@lemmy.world 5 points 1 month ago (2 children)

Okay, but could ChatGPT be used to vibe code a chess program that beats the Atari 2600?

load more comments (2 replies)

[–] muntedcrocodile@lemm.ee 4 points 1 month ago* (last edited 1 month ago) (1 children)

This isn't the strength of gpt-o4 the model has been optimised for tool use as an agent. That's why its so good at image gen relative to other models it uses tools to construct an image piece by piece similar to a human. Also probably poor system prompting. A LLM is not a universal thinking machine its a a universal process machine. An LLM understands the process and uses tools to accomplish the process hence its strengths in writing code (especially as an agent).

Its similar to how a monkey is infinitely better at remembering a sequence of numbers than a human ever could but is totally incapable of even comprehending writing down numbers.

[–] cheese_greater@lemmy.world 3 points 1 month ago (2 children)

Do you have a source for that re:monkeys memorizing numerical sequences? What do you mean by that?

[–] RememberTheEnding@lemmy.world 6 points 1 month ago

https://www.youtube.com/watch?v=MKvX9PPmI-Q

[–] shalafi@lemmy.world 3 points 1 month ago

That threw me as well.

load more comments