this post was submitted on 19 Nov 2025
352 points (98.4% liked)

Technology

76917 readers
3213 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

The issue was not caused, directly or indirectly, by a cyber attack or malicious activity of any kind. Instead, it was triggered by a change to one of our database systems' permissions which caused the database to output multiple entries into a “feature file” used by our Bot Management system. That feature file, in turn, doubled in size. The larger-than-expected feature file was then propagated to all the machines that make up our network.

The software running on these machines to route traffic across our network reads this feature file to keep our Bot Management system up to date with ever changing threats. The software had a limit on the size of the feature file that was below its doubled size. That caused the software to fail.

you are viewing a single comment's thread
view the rest of the comments
[–] unexposedhazard@discuss.tchncs.de 5 points 18 hours ago* (last edited 18 hours ago) (1 children)

How about an hour? 10 minutes? Would have prevented this. I very much doubt that their service is so unstable and flimsy that they need to respond to stuff on such short notice. It would be worthless to their customers if that were true.

Restarting and running some automated tests on a server should not take more than 5 minutes.

[–] SMillerNL@lemmy.world 10 points 16 hours ago (2 children)

5 minutes of uninterrupted DDoS traffic from a bot farm would be pretty bad.

[–] ramble81@lemmy.zip 11 points 14 hours ago* (last edited 13 hours ago) (1 children)

5 hours of unintended downtime from an update is even worse.

Edited for those who didn’t get the original point.

[–] SMillerNL@lemmy.world 5 points 13 hours ago (1 children)

It wasn’t an unintentional update though, it was an intentional update with a bug.

[–] ramble81@lemmy.zip 1 points 13 hours ago

Edited. My point still stands.

[–] dafta@lemmy.blahaj.zone 7 points 14 hours ago (1 children)

Significantly better than several hours od most of the internet being down.

[–] SMillerNL@lemmy.world 4 points 13 hours ago

Maybe not updating bot mitigation fast enough would cause an even bigger outage. We don’t know from the outside.