Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
mandeepj
18 days ago
|
parent
|
context
|
favorite
| on:
Launch HN: Uplift (YC S25) – Voice models for unde...
> just the process of data gathering and maintaining high quality of data is what we have to figure out as we scale across languages.
À crawler and data ingestion pipeline will not help with that?
zaidqureshi
18 days ago
[–]
Gathering audio data online is not that hard, but getting it accurately labelled is challenging, as the speech understanding systems for those languages aren't there either, so we can't automatically do that
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
À crawler and data ingestion pipeline will not help with that?