Amphion

59 posts

Amphion banner
Amphion

Amphion

@realamphion

Amphion is a toolkit for Audio, Music, and Speech Generation. GitHub: https://t.co/SDsdCO4M29 HF: https://t.co/3VJSoumiFS

Katılım Aralık 2023
54 Takip Edilen388 Takipçiler
Amphion
Amphion@realamphion·
🚀Introducing Emilia-Large: 200K+ Hours of Open-Source Speech Data! We’re excited to release Emilia-Large, the largest TTS pretraining datasets! With 200K+ hours of multilingual speech data, fully open-source. It is ready to use for #TTS and #SpeechLM.
Amphion tweet media
English
2
12
59
6K
Amphion
Amphion@realamphion·
✨ What’s New? - 2x Scale: Expanded the original Emilia dataset from 101K to 200K+ hours with the new Emilia-YODAS dataset. - Low-Resource Boost: Enhanced support for languages like German, French, and Japanese. - Commercial Use: Emilia-YODAS is released under CC-BY
Amphion@realamphion

🚀Introducing Emilia-Large: 200K+ Hours of Open-Source Speech Data! We’re excited to release Emilia-Large, the largest TTS pretraining datasets! With 200K+ hours of multilingual speech data, fully open-source. It is ready to use for #TTS and #SpeechLM.

English
1
1
11
573
Amphion retweetledi
Amphion retweetledi
Amphion
Amphion@realamphion·
🔥🔥🔥MaskGCT is hot, making Amphion on the list of GitHub Trending again! > SoTA TTS model > Zero-shot cloning > Emotional TTS > Multilingual, now supporting English and Chinese > Fully non-autoregressive and duration controllable Try in HF and discord.gg/fRaQpH7s
English
3
6
34
9.5K
Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastav@reach_vb·
Fuck yeah! MaskGCT - New open SoTA Text to Speech model! 🔥 > Zero-shot voice cloning > Emotional TTS > Trained on 100K hours of data > Long form synthesis > Variable speed synthesis > Bilingual - Chinese & English > Available on Hugging Face Fully non-autoregressive architecture: > Stage 1: Predicts semantic tokens from text, using tokens extracted from a speech self-supervised learning (SSL) model > Stage 2: Predicts acoustic tokens conditioned on the semantic tokens. Synthesised: "Would you guys personally like to have a fake fireplace, an electric one, in your house? Or would you rather have a real fireplace? Let me know down below. Okay everybody, that's all for today's video and I hope you guys learned a bunch of furniture vocabulary!" TTS scene keeps getting lit! 🐐
English
42
149
1K
139K
lux
lux@Alice2848126245·
@realamphion In speech conversion, where does the target speaker's timbre information come from?
English
1
0
0
64
Satheesh kola
Satheesh kola@satkola·
@realamphion Can you release the training /finetuning scripts & instructions, so that we can train on indian language (Telugu).
English
1
0
0
145
Houdini
Houdini@D3crypTor_X·
@realamphion Does it support podcast like conversations? I.e more than 2 speakers?
English
1
0
0
7
Not Elon Musk
Not Elon Musk@ElonMuskAOC·
Describe Obama in one word
English
195
11
121
31.9K
Amphion
Amphion@realamphion·
@mohamed17381489 Yes, it supports multi-lingual. We are going to release another checkpoint that supports 6 languages soon.
English
0
0
3
153
Love TTS
Love TTS@mohamed17381489·
@realamphion YEEEEES!!!!! Thank you...Does it support multilingual? can we train it to speak Arabic?
English
1
0
0
177