June 25, 2025
We're adding 14 languages to the @pipecat_ai native audio, open source, open data, smart turn (semantic VAD) model.
The new languages are: ๐ซ๐ท French, ๐ฉ๐ช German, ๐ช๐ธ Spanish, ๐ต๐น Portuguese, ๐จ๐ณ Chinese, ๐ฏ๐ต Japanese, ๐ฎ๐ณ Hindi, ๐ฎ๐น Italian, ๐ฐ๐ท Korean, ๐ณ๐ฑ Dutch, ๐ต๐ฑ Polish, ๐ท๐บ Russian, and ๐น๐ท Turkish. There are also additional samples for English.
You can use this model with no restrictions, contribute data sets or code, play conversation games online to contribute data, or help us clean the raw data sets!
The model is hosted on @FAL, too.
The model overview and training code: https://t.co/YbiYc7Y8VT
The data classifier app. Help clean and label the open source data sets:
https://t.co/DGJ5wapfMl
If you want to contribute your own voice to the model, go here:
https://t.co/MKhihgtRDQ
I tell everybody, if your voice is in these data sets, voice agents will work better for you than for all your friends, forever and ever. ๐
You can use the model on @FAL completely free, if you're running your Pipecat voice agents on Pipecat Cloud. Docs are here: