this post was submitted on 25 Aug 2025
466 points (98.9% liked)
People Twitter
8171 readers
1395 users here now
People tweeting stuff. We allow tweets from anyone.
RULES:
- Mark NSFW content.
- No doxxing people.
- Must be a pic of the tweet or similar. No direct links to the tweet.
- No bullying or international politcs
- Be excellent to each other.
- Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician. Archive.is the best way.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I once asked ChatGPT for an opinion on my blog and gave the web address. It summarized some historical posts accurately enough. It was definitely making use of the content, and not just my prompt. Flattered me with saying "the author shows a curious mind". ChatGPT is good at flattery (in fact, it seems to be trained specifically to do it, and this is part of OpenAI's marketing strategy).
For the record, yes, this is a bit narcissistic, just like googling yourself. Except you do need to google yourself every once in a while to know what certain people, like employers, are going to see when they do it. Unfortunately, I think we're going to have to start doing the same with ChatGPT and other popular models. No, I don't like that, either.
...
Ok, being simplistic about the actual workings: anything a LLM outputs is based only in the training data or the prompt, a LLM does not "create" anything.
I really doubt your blog is statistically significant enough represented in the training data, therefore I can only assume that yes, your blog post URL referenced was web scrapped by ChatGPT and, and any other URLs linked by this main URL that the scrapped deemed significant to the prompt, and all that text was in fact added to the full internal prompt that was processed by the actual LLM.