this post was submitted on 23 Mar 2025
1247 points (98.7% liked)

Technology

72017 readers
3126 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] aramis87@fedia.io 154 points 3 months ago (43 children)

The biggest problem with AI is that they're illegally harvesting everything they can possibly get their hands on to feed it, they're forcing it into places where people have explicitly said they don't want it, and they're sucking up massive amounts of energy AMD water to create it, undoing everyone else's progress in reducing energy use, and raising prices for everyone else at the same time.

Oh, and it also hallucinates.

[–] pennomi@lemmy.world 29 points 3 months ago (9 children)

Eh I’m fine with the illegal harvesting of data. It forces the courts to revisit the question of what copyright really is and hopefully erodes the stranglehold that copyright has on modern society.

Let the companies fight each other over whether it’s okay to pirate every video on YouTube. I’m waiting.

[–] naught@sh.itjust.works 12 points 3 months ago (4 children)

AI scrapers illegally harvesting data are destroying smaller and open source projects. Copyright law is not the only victim

https://thelibre.news/foss-infrastructure-is-under-attack-by-ai-companies/

[–] interdimensionalmeme@lemmy.ml 0 points 3 months ago (1 children)

In this case they just need to publish the code as a torrent. You wouldn't setup a crawler if there was all the data in a torrent swarm.

[–] untakenusername@sh.itjust.works 1 points 3 months ago

I've heard stuff like bittorent doesn't work well when the data is often updated or changed

I might be totally wrong, I've only ever used it once when downloading Wikipedia

load more comments (2 replies)
load more comments (6 replies)
load more comments (39 replies)