It took some time, but we finally got Kokoro TTS (v1.0) running in-browser w/ We...

amelius · 2025-02-07T18:23:51 1738952631

Now that's what I call "server-less" computing!

deivid · 2025-02-07T17:00:56 1738947656

Amazing! I'm interested in models running locally and Kokoro seems amazing. Are you aware of similar models but for Speech to text?

xenova · 2025-02-07T17:31:05 1738949465

We have released a bunch of speech recognition demos (using whisper, moonshine, and others). For example:

- https://huggingface.co/spaces/Xenova/whisper-web

- https://huggingface.co/spaces/Xenova/whisper-webgpu

- https://huggingface.co/spaces/Xenova/realtime-whisper-webgpu

- https://huggingface.co/spaces/webml-community/moonshine-web

alex_young · 2025-02-10T19:27:18 1739215638

The realtime Whisper demo is amazing.

How can I understand what's in the compiled JS though? Is there some source for that?

Ono-Sendai · 2025-02-07T17:25:52 1738949152

whisper

sebastiennight · 2025-02-07T22:26:38 1738967198

This is brilliant. All we need now is for someone to code a frontend for it so we can input an article's URL and have this voice read it out loud... built-in local voices on MacOS are not even close to this Kokoro model

satvikpendem · 2025-02-07T23:07:03 1738969623

There are a few already, I assume MacWhisper will add it. That being said, I am also working on a (crossplatform, in Flutter) UI for this.

sebastiennight · 2025-02-08T22:31:37 1739053897

My understanding is that MacWhisper is a front-end for Whisper.cpp so... it does Speech-to-text? (transcribing what you dictate)

Here I'm talking about the model shared in this thread, which is text-to-speech (reading out loud content from the web)

satvikpendem · 2025-02-08T22:43:41 1739054621

Yes, I am saying they might include features for TTS in addition to their current STT feature set. Seems like many of these sorts of apps are looking to add both to be more full fledged.