this post was submitted on 01 Nov 2025
19 points (88.0% liked)
Hacker News
2916 readers
544 users here now
Posts from the RSS Feed of HackerNews.
The feed sometimes contains ads and posts that have been removed by the mod team at HN.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Thing is Deepseek didn't have any new technology insights or "special sauce"
They just took all the current best practices at the time (high quality machine curated data sets, MoE architecture, etc) and did them as fully and rigorously as possible
It's not like they invented chain of thought/Large Reasoning Model or state-space or anything new at all