this post was submitted on 18 Jul 2025
161 points (96.0% liked)

Technology

72946 readers
3263 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

OpenAI launched ChatGPT Agent on Thursday, its latest effort in the industry-wide pursuit to turn AI into a profitable enterprise—not just one that eats investors' billions. In its announcement blog, OpenAI says its Agent "can now do work for you using its own computer," but CEO Sam Altman warns that the rollout presents unpredictable risks.

[...]

OpenAI research lead Lisa Fulford told Wired that she used Agent to order "a lot of cupcakes," which took the tool about an hour, because she was very specific about the cupcakes.

you are viewing a single comment's thread
view the rest of the comments
[–] Evotech@lemmy.world -2 points 1 day ago (1 children)
[–] wise_pancake@lemmy.ca 2 points 23 hours ago (2 children)

I use agents a lot and have written several MCP servers now, the tasks I automate aren't things like order cupcakes, it's mainly the glue between complex things.

I still can't get Claude to nicely open a JIRA ticket for me, but I can get it to read through a sequence of connected documents and filter that into.

I don't think agents are ready for the main event and these are some poor examples of their power.

I'm not saying they won't improve, but using the right tool for the right job is critical. An hour to order cupcakes is silly even for an llm.

[–] Evotech@lemmy.world 3 points 22 hours ago

It’s examples for the common guy in the streets who don’t know what an mcp server is.

[–] Eyekaytee@aussie.zone 1 points 11 minutes ago

yes in the wired article one of them says they would like to find out where it got stuck taking an hour with an agent replay feature