this post was submitted on 07 Feb 2025
0 points (NaN% liked)

Fediverse

31432 readers
2621 users here now

A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).

If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!

Rules

Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration)

founded 2 years ago
MODERATORS
 

We have paused all crawling as of Feb 6th, 2025 until we implement robots.txt support. Stats will not update during this period.

top 4 comments
sorted by: hot top controversial new old
[–] Blaze@lemmy.dbzer0.com 1 points 1 month ago

Forced to use https://lemmy.fediverse.observer/list to see which instances are the most active

[–] hendrik@palaver.p3x.de 0 points 1 month ago (1 children)

Did someone complain? Or why stop?

[–] mesamunefire@lemmy.world 1 points 1 month ago (1 children)

No idea honestly. If anyone knows, let us know! I dont think its necessarily a bad thing, If their crawler was being too aggressive, then it can accidentally DDOS smaller servers. Im hoping that is what they are doing and respecting the robot.txt that some sites have.

[–] ada@lemmy.blahaj.zone 1 points 1 month ago

Gotosocial has a setting in development that is designed to baffle bots that don't respect robots.txt. FediDB didn't know about that feature and thought gotosocial was trying to inflate their stats.

In the arguments that went back and forth between the devs of the apps involved, it turns out that FediDB was ignoring robots.txt. ie, it was badly behaved