Hacker News new | past | comments | ask | show | jobs | submit login

currently we are using silero VAD to detect speech: https://github.com/livekit/agents/blob/main/livekit-plugins/...

it works well for voice activity; though it doesn't always detect end-of-turn correctly (humans often pause mid-sentence to think). we are working on improving this behavior.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: