this post was submitted on 21 Nov 2025
832 points (98.3% liked)

Technology

76945 readers
4642 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] FauxLiving@lemmy.world 1 points 17 minutes ago (1 children)

Thanks for the recommendation, I'll look into GLM Air, I haven't looked into the current state of the art for self-hosting in a while.

I just use this model to translate natural language into JSON commands for my home automation system. I probably don't need a reasoning model, but it doesn't need to be super quick. A typical query uses very few tokens (like 3-4 keys in JSON).

The next project will be some kind of agent. A 'go and Google this and summarize the results' agent at first. I haven't messed around much with MCP Servers or Agents (other than for coding). The image models I'm using are probably pretty dated too, they're all variants of SDXL and I stopped messing with ComfyUI before video generation was possible locally, so I gotta grab another few hundred GB of models.

It's a lot to keep up with.😮‍💨

[–] brucethemoose@lemmy.world 1 points 10 minutes ago

It’s a lot to keep up with

Massive understatement!

The next project will be some kind of agent. A ‘go and Google this and summarize the results’

Yeah, you do want more contextual intelligence than an 8B for this.

The image models I’m using are probably pretty dated too

Actually SDXL is still used a lot! Especially for the anime stuff. It just got so much finetuning and tooling piled on.