Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
lol i think that might be the worst/best thing I have seen in a long time
Cuck boy getting pegged by post top op Garfield is definitely not something I had jotted down in my day-at-a-glance.
fuck spez
Art.
What a terrible day to have eyes.
Given that the Internet Archive is the de facto standard way to cite material as seen on a given date
they're a trustworthy party that will probably persist for a long time
that's going to make it harder to cite content on Reddit.
As somebody who often ends up using Reddit like Stackoverflow and in some cases needing the Internet Archive (IA) to find the original post after it’s been deleted or garbled, I think this is a wakeup call for those go to Reddit both to get technical help and to post it. More than ever, Reddit is becoming an unreliable place to find answers for old obscure issues and if they are going to lockout places like the IA then I think it’s time people stopped contributing their solutions to Reddit.
Searching anywhere in general is getting shittier and shittier by day. Web searches are riddled with hallucinated AI generated garbage pages. Finding the right answer for difficult problems is getting worse and worse. We are sliding rapidly into Idiocracy.
yup. continuing to feed them traffic after their repeated attacks on the userbase is just sad. stop using them. yeah it sucks the info is gone, but acting like they'll wake up and change is absurd.
It’s another move to protect against AI scraping that isn't paying them for access.
I already gave up from Reddit long time ago. Deleted all
As long as the previous collections of archives are still intact. We probably don’t need all of their new spam posts in the wayback machine anyway
It is my understanding that if you block the wayback machine from indexing your site it will also delist the history as well.
They do archive sites against the owners wishes when they consider it an important site for public archiving, like some news sites. They are in no obligation to delete the archives and hope they don’t.
Parties have archived the data from pushshift, which cover a lot of Reddit history.
kagis
https://academictorrents.com/details/1614740ac8c94505e4ecb9d88be8bed7b6afddd4
Subreddit comments/submissions 2005-06 to 2024-12
This is the top 40,000 subreddits from reddit's history in separate files. You can use your torrent client to only download the subreddit's you're interested in.
I mean, that won't have the past half year or some low-traffic subreddits, but...
People who posted on Reddit ( speaking in the past tense, because who would continue to do so now that we have better things? ) never intended for it to be of limited access. Reddit was a publicly accessible place, and people shared their thoughts and comments on it because it was the frontpage of the internet, so the place of choice to share things with the world. That being scraped should not be a problem. But clearly Reddit didn't want to give you a platform to share your thoughts with the world, they wanted you to donate your thoughts and take it as their property so that they can capitalize on it.
That place is becoming more and more of a shithole. Bots, Ads, trolls, garbage mods… deleted the app last month.
I quit reddit, cold turkey, the day they shut off free API access for 3rd parties. Except for a couple of fairly niche subs I haven't missed it at all.
The company says that AI companies have scraped data from the Wayback Machine, so it’s going to limit what the Wayback Machine can access.
Yeah, wouldn't want those AI companies to get all that data for free. Gotta make 'em pay for it.
This is huge blow to archivism, thanks to corporate greed and enshittification of reddit. Worst MBA filled POS.
Oh no, someone might not be paying them for their user generated content (!)
To be fair, it's probably best that history forgets this period of the web...
Damn you Spez.
So reddit will become even less valuable
Good plan. Keep locking down your big tech platforms, and we'll all be over here letting folks know where they can find freedom.
Careful. Lemmy is too small to draw the attention of sophisticated, persistent abuse. As a company, Reddit has struggled with revenue and we've all seen those struggles quite publicly. Lemmy instances with those same challenges would probably just fold and close up.
Federated networks give you freedom but the potential for abuse is proportional to that freedom while at the same time, federation is far more expensive taken as a whole.
I am new to Lemmy, is there a fuckreddit sub?
In a way, the entire lemmy community is the fuckreddit sub
Why would you want to spend more time thinking about a dead site?
I just like to laugh at things I dislike. And I also like to see how bad it's getting. Iwas in the undelete sub and it was amazing.
Yes.
Hi welcome to Lemmy, we hate reddit here.
Fuck Reddit and Fuck Spez.
Fuck Reddit
They can keep their shit for themselves, stopped caring a long time ago.
fucking reddit...
Time to just ignore them and scrape it anyways
OK, I stopped posting on Reddit but left my account and comments in place because I considered them part of the public record. If Reddit is taking that record private, it’s time for me to start removing my content from the platform.
Does anyone know if historical Reddit content will remain in IA? If not, I’m going to have to back up years of content somewhere else.
In the lieu of an IPO u/spez has actively destroyed everything that made Reddit good! Gate keeping the API thinking it'll help with making some bigshot LLM some day lol
When reddit has mutated a few more times. They start erasing stuff themselves. It will be lost to time and that fills me with hope.
This company limited search crawlers to google, why are you surprised?
Is that even possible?
Technologically no. Reddit sends out the data to 10s of millions of users as part of their normal operations. They need to try to block those who collect that data for the IA. Reddit has the very short end of the stick.
The problem is that evading such counter-measures may be criminal in the US. Obviously, EU laws are much harsher.