this post was submitted on 07 Dec 2025
1012 points (98.0% liked)
Technology
77090 readers
3519 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Yeah, context management is one big key. The "compacting conversation" hack is a good one, you can continue conversations indefinitely, but after each compact it will throw away some context that you thought was valuable.
The best explanation I have heard for the current limitations is that there is a "context sweet spot" for Opus 4.5 that's somewhere short of 200,000 tokens. As your context window gets filled above 100,000 tokens, at some point you're at "optimal understanding" of whatever is in there, then as you continue on toward 200,000 tokens the hallucinations start to increase. As a hack, they "compact the conversation" and throw out less useful tokens getting you back to the "essential core" of what you were discussing before, so you can continue to feed it new prompts and get new reactions with a lower hallucination rate, but with that lower hallucination rate also comes a lower comprehension of what you said before the compacting event(s).
Some describe an aspect of this as the "lost in the middle" phenomenon since the compacting event tends to hang on to the very beginning and very end of the context window more aggressively than the middle, so more "middle of the window" content gets dropped during a compacting event.