Discussions
Integrating Heygen interactive avatar (lip-sync) with Gemini TTS (gemini-2.5-flash-preview-native-audio-dialog) in TypeScript
7 months ago by Ahmad
Hello,
I’m building an interactive avatar using Heygen. I want to keep Heygen for the avatar video and lip-sync but use Google’s gemini-2.5-flash-preview-native-audio-dialog for text-to-speech**.
Questions:
- Can the avatar accept externally generated audio for lip-sync (instead of Heygen’s built-in voices)?
- If yes, what is the correct API/SDK entry point to stream or upload TTS audio so the avatar lip-syncs to it?
- Are there recommended audio formats, sample rates, or buffering requirements?
If anyone has implemented this with Gemini or another external TTS provider, guidance or example code would be very helpful.