polygondragons retweetledi

Supertonic just killed ElevenLabs.
A text-to-speech model that runs entirely on your device. No cloud. No API key. No per-character pricing.
2,700 GitHub stars. 100% open source. MIT licensed.
The numbers are wild:
→ 167x faster than real-time on an M4 Pro
→ Only 66M parameters
→ 1,263 chars/sec vs ElevenLabs Flash at 287
→ 1,048 chars/sec vs OpenAI TTS-1 at 55
→ Runs on a Raspberry Pi. Runs on an e-reader in airplane mode.
Reads currency, dates, phone numbers, and technical units correctly without preprocessing. ElevenLabs fails these. OpenAI fails these. Gemini fails these.
Supports 11 platforms and 5 languages. Chrome extension turns any webpage into audio in under a second.
I've watched on-device models lose to cloud APIs for years. This one doesn't lose.
The cloud TTS business just got cooked.

English

























