Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Considering that if you DO use VAD (voice activity detection), it's the best open weights voice recognition model by a very wide margin, it's quite good. I'd be willing to be that commercial products that "don't have this problem" are using VAD as well, and that this is well known to them. But Whisper is just the weights, and I suppose a simple reference implementation, not a full product.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: