Toby Kim

38 posts

Toby Kim

Toby Kim

@_doyeob_

Building Nari Labs.

San Francisco เข้าร่วม Eylül 2023
222 กำลังติดตาม6K ผู้ติดตาม
ทวีตที่ปักหมุด
Toby Kim
Toby Kim@_doyeob_·
Two undergrads. One still in the military. Zero funding. One ridiculous goal: build a TTS model that rivals NotebookLM Podcast, ElevenLabs Studio, and Sesame CSM. Somehow… we pulled it off. Here’s how 👇
English
222
578
5.4K
723.3K
Toby Kim
Toby Kim@_doyeob_·
@fabianstelzer Valid point. But there are so many great models on Hugging Face that are being used thousands of times a day - which aren’t from top 10 labs. I just think consumer adoption is lagging behind
English
1
0
2
410
fabian
fabian@fabianstelzer·
@_doyeob_ correct me if I’m wrong but there is very little demand for non top 10 foundation models so I’m not sure the metaphor makes sense?
English
1
0
1
467
Toby Kim
Toby Kim@_doyeob_·
Model Training is the new Building Apps. 15 years ago, building a mobile app was hard. But a small technical team could pull it off. Same for model training nowadays. Only blocker is compute. But programs like TRC and other grants are fixing that. Excited for the future of AI.
English
13
2
80
6.3K
NotebookLM
NotebookLM@NotebookLM·
This just in... the @NotebookLM hosts have some rather exciting news they'd like to share with you all:
English
269
612
4.2K
998.9K
Toby Kim
Toby Kim@_doyeob_·
@nirajshah Thank you for the kind words! We’re going to keep on building and building :)))
English
1
0
1
174
Toby Kim
Toby Kim@_doyeob_·
@Sathees89347227 We’re currently working with the @huggingface team to bring Dia to transformers for training / finetune / fast inference.
English
1
0
2
118
Toby Kim
Toby Kim@_doyeob_·
@anonomize we're going to ship something much greater than a GUI
English
1
0
3
141
Py Man
Py Man@PyMan_Official·
@_doyeob_ speech is too fast, fix that also
English
1
0
0
98
Toby Kim
Toby Kim@_doyeob_·
Dia hit +6.5k stars on Github, #1 trending on Hugging Face - in under 48 hours. Thank you for all the support. But we won't stop here. We are building a product to change the future of audio entertainment. Curious? Join the waitlist 👇
English
19
10
372
27.3K
Toby Kim
Toby Kim@_doyeob_·
+3.4k stars on Github. #2 trending on Hugging Face. All in under 24 hours. Thanks for all the support <3 Here's Dia speaking about our launch:
English
29
70
1.1K
88.5K
Toby Kim
Toby Kim@_doyeob_·
2k stars in just over 12 hours! thanks for the huge support <3 (also top 5 on huggingface trending!) will be delivering big upgrades to the repo throughout this week.
Toby Kim tweet media
English
9
12
254
25K
Toby Kim
Toby Kim@_doyeob_·
People seemed to be enjoying the fire example, so I'm also adding it to the thread here. Thanks for all the support :)))
English
16
10
260
26.8K
Toby Kim
Toby Kim@_doyeob_·
A fun comparison between Dia-1.6B versus Sesame's CSM-1B on emotional conversations. (contains strong language) 👑Who won?
English
13
11
174
18.3K
Toby Kim รีทวีตแล้ว
camenduru
camenduru@camenduru·
🎧 Dia is a 1.6B parameter text to speech model created by Nari Labs. (Apache 2.0) 🔊 Jupyter Notebook 🥳 Thanks to @_doyeob_ ❤ Nari Labs ❤ 🌐page: tally.so/r/meokbo 🧬code: github.com/nari-labs/dia 🍊jupyte: github.com/camenduru/dia-…
Toby Kim@_doyeob_

Two undergrads. One still in the military. Zero funding. One ridiculous goal: build a TTS model that rivals NotebookLM Podcast, ElevenLabs Studio, and Sesame CSM. Somehow… we pulled it off. Here’s how 👇

English
3
27
158
11.3K
Toby Kim
Toby Kim@_doyeob_·
3 months later, we had a fully trained 1.6B model. It took longer and was harder than expected, but totally worth it. The best time to start is today!
English
1
3
199
24.7K
Toby Kim
Toby Kim@_doyeob_·
DeepMind’s How to Scale and HuggingFace’s Ultra-Scale Playbook were super helpful. If you are interested in training large models, go read them now!
English
2
27
435
39.3K
Toby Kim
Toby Kim@_doyeob_·
But we wanted more. More control over the voices. More freedom in the script. We tried every TTS API on the market — none of them sounded like real human conversation.
English
2
3
155
29.9K