August 22, 2025
This is great.
Relatedly, I'm convinced that the next big step forward in AI application architecture is figuring out good developer tools for doing some inference locally and some in the cloud. With this stack, you can mix and match local and cloud @pipecat_ai services. There's a lot more to do, in this direction, but it's a start!
Thank you to @joshwhiton for the PRs that massively improved Kokoro TTS stability here.
Five neural nets, achieving completely local voice AI, no internet, on an M1 with only 16GB ram.
Neural-based voice activity detection and turn detection means it's interruptible, but never interrupts me, and is able to sit idle and waiting. It's been flawless so far.
12B
