Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

They do collect other languages - there’s a setting for it in the annotation section, and the dataset downloads let you choose other languages.

e.g.: https://commonvoice.mozilla.org/nl/listen



Although English is the most-contributed language, one of the goals of Common Voice is to support languages that wouldn’t normally receive attention from commercial providers.


The most-contributed language is Catalan with 3678 hours recorded vs. 3395 hours in English https://commonvoice.mozilla.org/en/languages (The language list sorts your browser's UI languages ahead of all others, which is why English may appear on top for you.)


Woops! Thanks :-)


Don’t feel bad - it’s not especially obvious. I only thought about it because I’m already familiar with the project.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: