We're adding 14 languages to the @pipecat_ai native audio, open source, open…

June 25, 2025

We're adding 14 languages to the @pipecat_ai native audio, open source, open data, smart turn (semantic VAD) model.

The new languages are: 🇫🇷 French, 🇩🇪 German, 🇪🇸 Spanish, 🇵🇹 Portuguese, 🇨🇳 Chinese, 🇯🇵 Japanese, 🇮🇳 Hindi, 🇮🇹 Italian, 🇰🇷 Korean, 🇳🇱 Dutch, 🇵🇱 Polish, 🇷🇺 Russian, and 🇹🇷 Turkish. There are also additional samples for English.

You can use this model with no restrictions, contribute data sets or code, play conversation games online to contribute data, or help us clean the raw data sets!

The model is hosted on @FAL, too.

The model overview and training code: https://t.co/YbiYc7Y8VT

The data classifier app. Help clean and label the open source data sets:
https://t.co/DGJ5wapfMl

If you want to contribute your own voice to the model, go here:
https://t.co/MKhihgtRDQ

I tell everybody, if your voice is in these data sets, voice agents will work better for you than for all your friends, forever and ever. 😆

You can use the model on @FAL completely free, if you're running your Pipecat voice agents on Pipecat Cloud. Docs are here:

^[4]