this post was submitted on 02 Oct 2025
92 points (91.8% liked)

Technology

76672 readers
1890 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] InEnduringGrowStrong@sh.itjust.works 91 points 1 month ago (6 children)

Microsoft says its Agent Mode in Excel has an accuracy rate of 57.2 percent in SpreadsheetBench, a benchmark for evaluating an AI model’s ability to edit real world spreadsheets.

It generates 42.8% bullshit.

[–] jubilationtcornpone@sh.itjust.works 40 points 1 month ago (1 children)

They probably view that as a statistic worth bragging about. It's not. If Excel got calculations right 57.2% of the time it would be completely worthless.

[–] PerogiBoi@lemmy.ca 2 points 1 month ago (1 children)

I asked copilot to look through my every spreadsheet and find how many instances of a category occurred. I was curious to see if it was any good. Gave me 2 different numbers. Neither were correct.

[–] jubilationtcornpone@sh.itjust.works 4 points 1 month ago (2 children)

Copilot: Putting the "Artificial" in Artificial Intelligence.

[–] sirboozebum@lemmy.world 2 points 1 month ago

Fartificial Intelligence

[–] PerogiBoi@lemmy.ca 2 points 1 month ago

The tech behind LLMs could have just been Clippy and everyone would be happy.

[–] MadMadBunny@lemmy.ca 21 points 1 month ago (1 children)

So it achieved the actual proficiency of a middle manager…

[–] MonkderVierte@lemmy.zip 3 points 1 month ago

Decades ago. The company that replaced it's CEO with a LLM thrives.

[–] potoo22@programming.dev 11 points 1 month ago

Just keep regenerating data until it's something the stock holders like. Doesn't matter if it's BS. They're already accustomed to that.

[–] SkaveRat@discuss.tchncs.de 11 points 1 month ago (1 children)

Slightly better than Vegas. Unfortunately, plenty of people are okay with Vegas odds.

[–] Imgonnatrythis@sh.itjust.works 8 points 1 month ago

Not enough accuracy to be useful. Not enough bullshit for politics.