this post was submitted on 03 Jul 2025
169 points (92.0% liked)

Technology

72285 readers
2486 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] ExLisper@lemmy.curiana.net 21 points 14 hours ago (3 children)

I have a better LLM benchmark:

"I have a priest, a child and a bag of candy and I have to take them to the other side of the river. I can only take one person/thing at a time. In what order should I take them?"

Claude Sonnet 4 decided that it's inappropriate and refused to answer. When I explain that the constraint is not to leave child alone with candy he provided a solution that leaves the child alone with candy.

Grok would provide a solution that doesn't leave the child alone with a priest but wouldn't explain why.

ChatGPT would say that "The priest can't be left alone with the child (or vice versa) for moral or safety concerns." directly and then provide wrong solution.

But yeah, they will know how to play chess...

[–] LifeInMultipleChoice@lemmy.world 14 points 13 hours ago* (last edited 13 hours ago)

The answer is simple, eat the candy with or without them, and take the kid across the river. Drive them home to their guardian. The priest is an adult, he can figure his own shit out.

[–] Pamasich@kbin.earth 2 points 3 hours ago

I just asked ChatGPT too (your exact prompt there) and it did give me the correct solution.

  1. Take the child over
  2. Go back alone
  3. Take the candy over
  4. Bring the child back
  5. Take the priest over
  6. Go back alone
  7. Take the child over again

It didn't comment on moral concerns, though it did applaud itself for keeping the priest and the child separated without elaborating on why.

[–] blargh513@sh.itjust.works 1 points 6 hours ago

Perplexity says:

The priest cannot be left alone with the child (or there is some risk).

Not bad, and it solved it correctly.