Default voice is hi-IN-SwaraNeural (female, Microsoft Edge TTS). Flow: text → Hindi speech → MP4. Static = your photo + voice (needs FFmpeg). D-ID = lip-sync talking head (API key in pipeline/.env).
pipeline/.env
assets/anchor.jpg
config.php