this post was submitted on 27 Dec 2025
402 points (98.8% liked)

Technology

78002 readers
2319 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] tacosanonymous@mander.xyz 90 points 3 hours ago (2 children)

That’s disgusting. Where would you find such torrents?

[–] LodeMike@lemmy.today 50 points 3 hours ago (2 children)
[–] errer@lemmy.world 38 points 2 hours ago (1 children)

The album art torrent is a goldmine. Such a pain in the ass sometimes to find high quality album covers.

load more comments (1 replies)
[–] BigTurkeyLove@lemmy.world 7 points 2 hours ago* (last edited 2 hours ago) (2 children)

People are saying it's 300TB but this link is only 200GB why?

[–] floquant@lemmy.dbzer0.com 26 points 2 hours ago (1 children)

The 200GB is the metadata sqlite database only

[–] Kolanaki@pawb.social 9 points 1 hour ago* (last edited 1 hour ago)

God damn! That's essentially just text, right? Or would it also include album cover art?

[–] LodeMike@lemmy.today 5 points 2 hours ago

Not released yet

[–] randomuser38529@lemmy.world 9 points 2 hours ago* (last edited 2 hours ago)

cue Padme

‘And avoid it?’ — ‘To avoid it, right?!’

[–] blitzen@lemmy.ca 58 points 5 hours ago (2 children)

As far as I’ve read, the database is largely low bitrate files, and some AI. The value here is metadata and preservation of “rare” music.

[–] GraveyardOrbit@lemmy.zip 44 points 5 hours ago

All tracks within the top 99.6% of listens are supposed to be high quality

[–] hietsu@sopuli.xyz 13 points 2 hours ago* (last edited 2 hours ago) (1 children)

Nope, I would not call 160kbps Vorbis low bitrate, it’s roughly quality of 192kbps MP3. Only the ”popularity=0” stuff (so stuff with so few listens that Spotify does not keep record of) were re-encoded to 75kbps Opus, which as a modern codec is much better than it sounds like but of course re-encode is not great for already lossless stuff.

For purists there are those Tidal downloader sites available everywhere for free lossless music, even 24-bit hires FLAC.

[–] blitzen@lemmy.ca 2 points 45 minutes ago

Opus is what I’m encoding my working library to. I like ripping to flac (and archiving them as such), but the advantages to smaller file sizes for the working library are worth it for me. So far, I’m really liking the format.

I keep the archive on spinning hard drives, but the opus library on ssd (which makes browsing much quicker, and no unnecessary spinning up the hard drives.)

[–] ramenshaman@lemmy.world 37 points 1 hour ago (1 children)

Did not see this coming when I built my 40TB NAS

[–] commander@lemmy.world 10 points 1 hour ago* (last edited 1 hour ago) (1 children)

Get to acquiring Seagate external HDDs and shucking them for your own 3.5" drive bays before the data centers get them

[–] ramenshaman@lemmy.world 9 points 1 hour ago

Sadly my wallet is on time out

[–] sbv@sh.itjust.works 34 points 6 hours ago (3 children)

Is this new? Aren't most tracks already available in torrents?

[–] borokov@lemmy.world 73 points 5 hours ago (1 children)

Yep, most of tracks were already available on "various" sources, but this time they directly scraped the whole Spotify database.

It's really nice from them to backup Spotify database on a distributed system, and for free ! This ensure Spotify business won't be endanger in case of critical hardware failure.

[–] halcyoncmdr@lemmy.world 1 points 1 hour ago

So nice of them to help with Spotify's off-site backup.

[–] JASN_DE@feddit.org 28 points 5 hours ago (1 children)

It's new insofar as this is one big scrape. About 300TB iirc.

[–] HeyJoe@lemmy.world 15 points 5 hours ago (6 children)

300tb is a lot, but its kind of crazy to think this entire company only needs 300tb storage arrays to function. I wonder how they handle things internally. I would imagine at least 1 backup server ready to go in HA. I wonder if they have multiple regions across the country that also serves up the same setup.

[–] capuccino@lemmy.world 33 points 4 hours ago (1 children)

They need other 300TB to store all the ads.

[–] mojofrododojo@lemmy.world 3 points 2 hours ago

"Are you an incel with few friends, no job, and a deep seated hate for melanin? COME JOIN ICE!"

[–] Duke_Nukem_1990@feddit.org 3 points 2 hours ago* (last edited 2 hours ago)

Afaik 300 TB is just the most popular music and around a third of all tracks. The blog post on anna's is quite entertaining tho.

[–] JohnEdwa@sopuli.xyz 1 points 2 hours ago

IIRC there's still like 700TB of low popularity music missing, but it is only something like 0.4% of listens.
And they need a more storage overall because they have to set up datecenters around the world - doesn't make sense to stream tens of millions of connections across the ocean. But that also gives all the backups one would need for "free".

[–] OrganicMustard@lemmy.world 1 points 15 minutes ago

There are 245 TB ssd drives now. You can almost fit that in a single drive.

load more comments (2 replies)
[–] navigator@piefed.zip 4 points 2 hours ago (1 children)

Not mine, because I’m not famous enough for people to pirate my music lol. It would be flattering for me to be included in this batch of scraped music.

[–] FatVegan@leminal.space 6 points 1 hour ago

I'd steal your music

[–] fluffykittycat@slrpnk.net 20 points 3 hours ago

Anna's the GOAT

[–] RabbitBBQ@lemmy.world 12 points 1 hour ago

That's nothing compared to my old Napster collection

[–] mrmaplebar@fedia.io 10 points 5 hours ago (10 children)

I wish I could think anything positive about this, but I can't imagine anyone who actually cares about music needs or wants this. Instead it'll almost certainly be used as an illegal and unethical dataset to further train bullshit AI to make slop songs. As easy as it is for people to claim "preservation", I do have to question the motives of stuff like this...

Fuck AI. Support your favorite human artists.

[–] llama@lemmy.zip 7 points 4 hours ago (1 children)

Same it seems useless to me. The real value is knowing how songs relate to each other in terms of being played before/after other songs, and that's only available via internal datasets that they could never scrape anyway.

[–] Telorand@reddthat.com 2 points 2 hours ago
[–] Seasm0ke@lemmy.world 7 points 4 hours ago

I mean, it seems like the perfect avenue to replicate Stremio / Kodi but for music

[–] paper_moon@lemmy.world 4 points 4 hours ago* (last edited 4 hours ago)

I guess it's easier packaged in a torrent rather than individual downloads, but I do question why anyone would need this if all they were doing was training AI, as everything is already available on YouTube for free. You don't need to hack a company to get the audio. Now if you're a human trying to actually listen to the songs, obviously the Spotify torrent will sound much better, assuming the hack captured the higher quality audio streams from Spotify. All the youtube downloaders sound like crap because its like 128kbps m4a (at least on newpipe, though you can do 160kbps opus if you want)

[–] paraphrand@lemmy.world 3 points 1 hour ago

Yeah, the people who are racing to download it all want to use it for profit. AI companies, companies that run databases, etc.

[–] Grimy@lemmy.world 1 points 45 minutes ago

It's a good thing if you are smart enough to understand that AI isn't going away. Universal bought udio, the "legal" variant of the dataset will be used to train models, only they will be closed source, censored and come with a ToS that gives all the rights from the generated music to the record companies from the get go.

At least this gives open source a chance.

load more comments (5 replies)
[–] Eryn6844@piefed.blahaj.zone 9 points 4 hours ago

not sure why you want that much music most of it garbage. i would like some of the podcasts that people dont post anywhere else though. all hail the data hoaders.

[–] morto@piefed.social 8 points 1 hour ago (1 children)

It would be awesome if we had an app that allowed to stream directly from such torrents, and had a user-made recommendation system to replace the discovery algorithm :D

[–] bizzle@lemmy.world 4 points 1 hour ago

Stremio + Torrentio does this for TV but I haven't found an equivalent for music. Hoping to be proven wrong 🤞

[–] ZombieCyborgFromOuterSpace@piefed.ca 7 points 1 hour ago (1 children)

Let's put it all on a Funkwhale server.

[–] roofuskit@lemmy.world 9 points 1 hour ago

Sure, you set it up.

[–] SeventySeven@sh.itjust.works 6 points 1 hour ago (1 children)

Datahoarders are going to go WILD over this

[–] nutsack@lemmy.dbzer0.com 4 points 55 minutes ago* (last edited 54 minutes ago) (1 children)

data hoarders already have everything in here and far more, and the web release versions are a lower priority. thinking of red.sh here

[–] HereIAm@lemmy.world 2 points 44 minutes ago

I don't think many data hoarders are sitting on the AI generated stuff 😁

[–] jaybone@lemmy.zip 4 points 50 minutes ago (1 children)

Wasn’t all of this shit already available as torrents?

[–] FlashMobOfOne@lemmy.world 1 points 34 minutes ago* (last edited 33 minutes ago)

No.

Not everything got torrented after music streamers came into prominence. (Though chances are pretty good you could rip an MP3 off Youtube for whatever you're looking for.)

load more comments
view more: next ›