this post was submitted on 21 Nov 2025
823 points (98.2% liked)

Technology

76945 readers
4642 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] brucethemoose@lemmy.world 2 points 6 hours ago* (last edited 6 hours ago) (1 children)

Most aren't really running Deepseek locally. What ollama advertises (and basically lies about) is the now-obselete Qwen 2.5 distillations.

...I mean, some are, but it's exclusively lunatics with EPYC homelab servers, heh. And they are not using ollama.

[–] DandomRude@lemmy.world 2 points 5 hours ago (2 children)

Thx for clarifying.

I once tried a community version from huggingface (distilled), which worked quite well even on modest hardware. But that was a while ago. Unfortunately, I haven't had much time to look into this stuff lately, but I wanted to check that again at some point.

[–] brucethemoose@lemmy.world 2 points 4 hours ago* (last edited 3 hours ago) (1 children)

Also, I’m a quant cooker myself. Say the word, and I can upload an IK quant more specifically tailored for whatever your hardware/aim is.

[–] DandomRude@lemmy.world 1 points 3 hours ago (1 children)

Thank you! I might get back to you on that sometime.

[–] brucethemoose@lemmy.world 2 points 2 hours ago

Do it!

Feel free to spam me if I don’t answer at first. I’m not ignoring you; Lemmy fails to send me reply notifications, sometimes.

[–] brucethemoose@lemmy.world 2 points 4 hours ago

You can run GLM Air on pretty much any gaming desktop with 48GB+ of RAM. Check out ubergarm's ik_llama.cpp quants on Huggingface; that’s state of the art right now.