this post was submitted on 18 Nov 2025
107 points (93.5% liked)

Linux

10165 readers
661 users here now

A community for everything relating to the GNU/Linux operating system (except the memes!)

Also, check out:

Original icon base courtesy of lewing@isc.tamu.edu and The GIMP

founded 2 years ago
MODERATORS
 

As Snowden told us, video and audio recording capabilities of your devices are NSA spying vectors. OSS/Linux is a safeguard against such capabilities. The massive datacenter investments in US will be used to classify us all into a patriotic (for Israel)/Oligarchist social credit score, and every mega tech company can increase profits through NSA cooperation, and are legally obligated to cooperate with all government orders.

Speech to text and speech automation are useful tech, though always listening state sponsored terrorists is a non-NSA targeted path for sweeping future social credit classifications of your past life.

Some small LLMs that can be used for speech to text: https://modal.com/blog/open-source-stt

top 19 comments
sorted by: hot top controversial new old
[–] grue@lemmy.world 55 points 11 hours ago (1 children)

Home Assistant has been heavily working on that sort of functionality lately.

[–] 9point6@lemmy.world 14 points 10 hours ago (1 children)

Home assistant continues to be fantastic, I remember it was what felt like fairly recently that all we had was OpenHAB and although it was fine, it was a bit of an uphill struggle to do anything.

[–] fonix232@fedia.io 6 points 8 hours ago (1 children)

There were like, about two years between OpenHAB and HA being released. Former debuted in 2011, HA saw first release in 2013.

[–] 9point6@lemmy.world 2 points 7 hours ago

Oh really? I could have sworn HA was a fair bit later than that

I think I used OpenHAB between about 2013 and 2018, then switched to HA around then after discovering it and reading about it for a couple of weeks.

Must have just had my head in the sand then!

[–] brucethemoose@lemmy.world 24 points 11 hours ago* (last edited 10 hours ago) (1 children)

I mean, there are many. TTS and self-hosted automation are huge in the local LLM scene.

We even have open source "omni" models now, that can ingest and output speech tokens directly (which means they get more semantic understanding from tone and such, they 'choose' the tone to reply with, and that it's streamable word-by-word). They support all sorts of tool calling.

...But they aren't easy to run. It's still in the realm of homelabs with at least an RTX 3060 + hacky python projects.


If you're mad, you can self-host Longcat Omni

https://huggingface.co/meituan-longcat/LongCat-Flash-Omni

And blow Alexa out of the water with a MIT-licensed model from, I kid you not, a Chinese food delivery company.


EDIT

For the curious, see:

Audio-text-to-text (and sometimes TTS): https://huggingface.co/models?pipeline_tag=audio-text-to-text&num_parameters=min%3A6B&sort=modified

TTS: https://huggingface.co/models?pipeline_tag=text-to-speech&num_parameters=min%3A6B&sort=modified

"Anything-to-anything," generally image/video/audio/text -> text/speech: https://huggingface.co/models?pipeline_tag=any-to-any&num_parameters=min%3A6B&sort=modified

Bigger than 6B to exclude toy/test models.

[–] fonix232@fedia.io 2 points 8 hours ago

I do wish there was a smaller LongCat model available. My current AI node has a hard 16GB VRAM limit (yay AMD UMA limitations), so 27B can't really fit. An 8B dynamically loaded model would fit, and run much better.

[–] moodwrench@lemmy.world 13 points 10 hours ago (2 children)

It's not lack of software, it's lack of hardware. Home assistant is ready as are others, but there's no good cheap mic/speaker/esp in a box hardware

[–] fonix232@fedia.io 6 points 8 hours ago (1 children)

The HA Voice Preview is a pretty solid device, but you're right, there isn't really any ready made Echo/Google Home Mini replacement device - primarily because all those devices are generally sold at a loss, or at cost at best, and subsidised by your data being sold.

You won't be able to make a Google Home Mini contender for below $50, and at that price most people will opt for the former. Good quality speakers, microphones, local processing (like the XMOS chip in the Voice Preview) all cost money, and there's no subsidy to be made. Some older Echo devices are rootable, but the hardware tends to be somewhat exotic (meaning no open source support for specialised components), and there's little ongoing third party support (focus has been on the display-equipped models, and to run Android on them).

All in all, "cheap" and "fully local open source voice assistant" don't really coexist.

[–] TechLich@lemmy.world 1 points 2 hours ago

The issue with that is there isn't an expensive option either. The only thing close is the home assistant voice preview and it's still very "preview". There's not really any way to do it well at any price point right now.

[–] Beacon@fedia.io -2 points 10 hours ago (3 children)

No, home assistant very much is not ready to replace an Alexa device. Home assistant mainly only does automation of smart devices, and as far as i can see from their website it does nothing else. One of the main things people use Alexa for is to play music from services like Spotify, and home assistant doesn't appear to do that.

[–] moodwrench@lemmy.world 3 points 10 hours ago (1 children)

Sorry.. my experience has been trying to move my google home to something open with no cloud... it's not been perfect for me after moving. Definitely things missing, but lots of things are better. Spotify does work with home assistant.. maybe look again or send a pr

[–] Beacon@fedia.io 0 points 9 hours ago (1 children)

It isn't listed anywhere on their homepage or example demos or anywhere listing its capabilities, so i did a web search to find it and I found that it sorta just kinda can do Spotify, but (1.) that isn't listed anywhere on the home assistant abilities listing pages, which shows just how not ready for the mass market it is, and (2.) takes a ridiculous amount of very techie setup just to get it to work

https://www.home-assistant.io/integrations/spotify/

And also, out of the box can i ask it to:

  • tell me the weather?

  • set a timer?

  • set an alarm?

I don't see anything on the website that says it can do these things. And even if it can (which doesn't appear to be the case from their website) then the fact that the website doesn't say it can do these things is a problem in itself that shows it isn't ready for the mass market

Just look at the webpage for Alexa vs. Home Assistant and it's clear that Alexa has a very wide variety of abilities and is designed to be easy to use by anyone, while the home assistant website only shows it doing smart device automation and looks like it's not for regular folks

https://www.amazon.com/dp/B0DCCNHWV5

https://www.home-assistant.io/

I would LOVE to replace my Alexa devices with a local FOSS system, but unfortunately home assistant isn't close to being able to do that yet

[–] fonix232@fedia.io 2 points 8 hours ago

I'm sorry, what?

Googling "home assistant Spotify" results in the very link you've provided.

And you can hardly expect a project like Home Assistant, with THOUSANDS of first party integrations, to cater to your specific needs, or to provide preferential treatment to companies like Spotify, who provide absolutely no support to the project.

It also doesn't require a "techie setup", but following a quite straightforward guide, that culminates in clicking about maybe a dozen buttons (most of them being "I accept" to various terms and policies), then copying a handful of readily provided strings into the right fields. It's simple enough that even my tech illiterate father can do it.

Home Assistant at the end of the day is NOT an Alexa (or other voice assistant) replacement, but a smarthome control hub OS. That it provides a voice assistant interface is quite secondary to its main mission.

[–] possiblylinux127@lemmy.zip 2 points 9 hours ago (1 children)

Home assistant has a voice assistant feature

[–] Beacon@fedia.io 1 points 9 hours ago

It does, but it still has the same inabilities as the screen interface has

[–] tyler@programming.dev 1 points 1 hour ago

You very clearly dont understand home assistant.

[–] thatradomguy@lemmy.world 4 points 7 hours ago

There also used to be an open source Alexa-like kind of smart speaker that went by Mycroft AI. They were doing crowd funding I believe but that didn't go anywhere and so they eventually stopped working on it. You can still find their stuff on YouTube though: https://www.youtube.com/@MycroftAIForEveryone/videos

[–] rimu@piefed.social 3 points 11 hours ago

Time to get a mic for my home server!

[–] data1701d@startrek.website 1 points 1 hour ago

I need to play with HomeAssistant more. My last bit of hesitation was I was struggling to find a replacement for the announcement and intercom functionality, which is half of what my family uses Alexa for.

It looks like it got announcements with the "broadcast" intent in February; for the intercom, there may be a plugin. This seems like it might have me covered on the intercom front: https://github.com/JoeHogan/ha-intercom

Perhaps I'll mess around with it again once the semester's over; a lot of my family would really like to jump the Amazon ship and certainly be willing to try it if I give them the option.