Repository navigation
talking-avatar
- Website
- Wikipedia
Talking Head (3D): A JavaScript class for real-time lip-sync using Ready Player Me full-body 3D avatars.
[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
[NeurlPS-2024] The official code of MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
Explore the power of Azure Text-to-Speech with interactive talking avatar, Lisa 👩🏻🦱. Choose from multiple languages and avatar styles to bring your text to life.
Interactable AI that have control over your frontend website, It guides your user walk around your website its a salesman / supports.
Talking Avatar: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.
AI Avatar/Anchor: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.
Animated Characters: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.
The text to speech avatar system is a text to speech feature with vision capabilities, that allow customers to create synthetic videos of a 2D photorealistic avatar speaking. The Neural text to speech Avatar models are trained by deep neural networks based on the human video recording samples, and the voice of the avatar .
The official main page of "EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion".
These are output demos of different models for Talking Avatar Generation (TAG).
Talking avatars created using Leonardo.ai, VoiceOverMaker, and D-ID.
A classic layered sprite avatar that listens to your voice, processes your speech with OpenAI's GPT, and responds with high-quality text-to-speech using OpenAI's premium voices.