← kwindla hultman kramer

Voice AI hardware + WebRTC + Pipecat

July 1, 2025

Voice AI hardware + WebRTC + Pipecat

Here's a @pipecat_ai SmallWebRTCTransport client for the ESP32-S3 family of embedded devices.

The SmallWebRTCTransport is a serverless WebRTC connection designed for voice AI. Link to code and resources below ...

@aconchillo wrote the ESP32 client code, building on work by @_pion, @pbbakkum, and sepfy's libpeer library.

The device in the video is the ESP32-S3-BOX-3B developer kit, with a microphone, speaker, and touch screen. It's connecting to a voice agent running on my laptop. The voice agent uses @openai GPT-4o and the @cartesia_ai Sonic voice model.

Some ideas for things to do next:
- write all the voice agent output to the screen as text
- write LLM-generated images to the screen
- design a little micro-UI kit library for the touch screen and dynamically generate the user interface

ESP32 SmallWebRTCTransport client code is here:

https://t.co/ImPAPH6KrF

You'll also need this Pipecat branch to run the examples/foundational demos with the ESP32 client (until the next Pipecat release):

https://t.co/jmgeXmpNns

Here's the hardware device in the video. (It should be fairly easy to support any device in the S3 family.)

https://t.co/cPERpT65iw

The voice bot in the video is just one of the standard Pipecat test examples:

https://t.co/I74ikUEhEI

  1. https://github.com/pipecat-ai/pipecat-esp32
  2. https://github.com/pipecat-ai/pipecat/pull/2095