If you’re using the Home Assistant voice assistant mechanism (not Alexa/Google/etc.) how’s it working for you?
Given there’s a number of knobs that you can use, what do you use and what works well?
- Wake word model. There’s the default models and custom
- Conservation agent and model
- Speech to text models (e.g. speech-to-phrase or whisper)
- Text to speech models


There seem to be some 3rd party music capable devices in the works but nothing ready made in production.
I am waiting for a better overall speaker and I need a new server that can at least run a small lllm to run the assistant as intents are too ridged