this post was submitted on 27 May 2025
1950 points (99.5% liked)
Programmer Humor
23554 readers
2034 users here now
Welcome to Programmer Humor!
This is a place where you can post jokes, memes, humor, etc. related to programming!
For sharing awful code theres also Programming Horror.
Rules
- Keep content in english
- No advertisements
- Posts must be related to programming or programmer topics
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I don't know, some of these guys have acccess to a LOT of code, and even more debate about what those good codebases entail.
I think the other issue is more relevant. Even 128K tokens is not enough for something really big, and the memory and processing costs for that do skyrocket. People are trying to work around it with draft models and summarization models, so they try to pick out the relevant parts of a codebase in one pass and then base their code generation just on that, and... I don't think that's going to work reliably at scale. The more chances you give a language model to lose their goddamn mind and start making crap up unsupervised the more work it's going to be to take what they spit out and shape it into something reasonable.