October 9, 2025
Starter kit for a web app that can see your screen and talk to you.
- @GoogleDeepMind Gemini Live API for voice conversation and vision.
- React front end built with Next.js and voice-ui-kit.
- Deploy to Pipecat Cloud or anywhere you can run @pipecat_ai code.
Come build multimodal, realtime AI stuff with us this Saturday at YC in San Francisco. We're doing an all-day Gemini x Pipecat hackathon.
Github repo[1]
Hackathon application[2]
Open source voice-ui-kit docs[3]