Ampixa Labs

23 posts

Ampixa Labs banner
Ampixa Labs

Ampixa Labs

@__ampixa__

for Nepal

Katılım Nisan 2026
11 Takip Edilen97 Takipçiler
Sabitlenmiş Tweet
Ampixa Labs
Ampixa Labs@__ampixa__·
🇳🇵 kala-tts : नेपालमै बनेको, पहिलो आफ्नै देवनागरी G2P सहितको खुला-स्रोत नेपाली VITS आवाज। cloud छैन · तपाईंकै CPU मा चल्छ। pip install kala-tts · 🎧 tts.ampixa.com/kala
NE
10
21
126
34K
Ampixa Labs
Ampixa Labs@__ampixa__·
अन्दाज गर्नुहोस्, हामी केमा काम गरिरहेका छौँ? hint : यो लिम्बू(ᤕᤠᤰᤌᤢᤱ ᤐᤠᤴ) लिपि, अर्थात् सिरिजंगा, का लागि बनाइएको Validation Dashboard हो। PDF बाट निकालिएको image मा bounding box लगाइएको छ। दायाँपट्टि देखिएका blocks मा cropped image र त्यसको Noto Sans Devanagari मा equivalent Unicode राखिएको छ।
0
3
12
2.5K
Ampixa Labs
Ampixa Labs@__ampixa__·
@AsimPaudel4 Better model dropping soon with good prosody. you can literally hear the model breathing :)
English
1
0
2
39
Ampixa Labs
Ampixa Labs@__ampixa__·
Well, thanks for the suggestion but there is a better path forward. - download all the CC0 videos like pratinidhi sabha sessions and on youtube cc0 videos - voice activity detection code to identify where people start speaking along with diarization models to identify multiple speakers - run a noise artifact remover like noisereduce, deepfilternet or demucs etc to remove background noise - Build a ASR(speech recognition) model over it - emotion labelling models like emotion_top - human listening on samples we are doing that rn. The hardest part is ASR...
English
1
0
1
12
🔻☭★
🔻☭★@Communist977·
@__ampixa__ @aabhpsy You can hire people to record voice and train with them. I'm sure Nepalese will volunteer to do it for free.
English
1
0
1
32
Ampixa Labs
Ampixa Labs@__ampixa__·
🇳🇵 kala-tts : नेपालमै बनेको, पहिलो आफ्नै देवनागरी G2P सहितको खुला-स्रोत नेपाली VITS आवाज। cloud छैन · तपाईंकै CPU मा चल्छ। pip install kala-tts · 🎧 tts.ampixa.com/kala
NE
10
21
126
34K
Ampixa Labs
Ampixa Labs@__ampixa__·
@razaanstha Please follow. Next week there will be another natural tts based on styleTTS2
English
0
0
1
41
Ampixa Labs
Ampixa Labs@__ampixa__·
@Communist977 @aabhpsy IndicVoices has 23k hours of speech text pairs. Emilia is 46k hours Mls is 44k hours Nepali doesn't have that kind of speech -> text pairs Running ASR is also not viable with CER of around 12 to 18 % and hallucinations on open source ASRs like whisper for nepali
English
1
0
1
26
Ampixa Labs
Ampixa Labs@__ampixa__·
@kingofknowwhere sure, please keep looking. we plan to cover the whole 18 languages. Maithili ra nepali ko root sajilo bhayera yo sajilo huncha nai. G2P banauna parcha. If you know a linguist or prof who is fluent in maithali. please let us know
English
1
0
1
40
Ampixa Labs
Ampixa Labs@__ampixa__·
@pranayaratnasha Well, can you defer some time for the extensive test? there is a better model dropping soon based on updated/evolved styleTTS2 architecture.
English
0
0
0
94
Pranaya Ratna Shakya
Pranaya Ratna Shakya@pranayaratnasha·
@__ampixa__ great love the initial demos seen here going to give it an extensive test to see how far it will reach. Great going on tihs. May be soon we will have a native speaking assitant in our phones rather than english speaking ones. Kudos looking forward to future updates.
English
1
0
2
101
Ampixa Labs
Ampixa Labs@__ampixa__·
@aabhpsy Because of scarcity of Nepali data you will get Hindi prosodies with Nepali speech. The best one so far to start working on is dots.tts by rednote social media team.
English
1
0
1
46
Aabhash Ghimire
Aabhash Ghimire@aabhpsy·
Any already available options even on GPU? Can we continue work from what you have done so far and use GPUs to get production grade cloned audio in natural Nepali like we speak? I don't understand all but I can dig in and learn from 0 but since you guys have pioneered, it would be nice to get insights.
English
2
0
0
48
Ampixa Labs
Ampixa Labs@__ampixa__·
Problem is gathering thousands of hours of tts data, diarize them, run background noise remover models... We first plan to at least have 5000 hour (silver + gold) Nepali speech + text pair db and we are 30% there. We will open source that too.. So maybe within this year we will have multishot voice cloner atleast... But the goal is naturalness and real time inference on cpu, with limited voice
English
1
0
1
181
Ampixa Labs
Ampixa Labs@__ampixa__·
@DemonXnomeD The g2p part is the new architecture. We are also working on upgraded styleTTS2 architecture for more naturalness.
English
1
0
0
165
DemonX
DemonX@DemonXnomeD·
@__ampixa__ Any New architecture?? Or Just training any speech model?
English
1
0
1
178
Ampixa Labs
Ampixa Labs@__ampixa__·
How Kala reads Nepali text: नेपालमा → /ne.pal.ma/ रामले → /ram.le/ (not /raː.mə.leː/ like eSpeak) Akshara parse → schwa-deletion rules → IPA. No black-box character embeddings. Full frontend: github.com/Ampixa/nepa-ne…
Ampixa Labs tweet media
1
0
2
557
Ampixa Labs
Ampixa Labs@__ampixa__·
Dashboard for analyzing Nepal Speech generated from Kokoro. The first of it's kind. Follow to know more
Ampixa Labs tweet media
English
0
0
1
212
Ampixa Labs
Ampixa Labs@__ampixa__·
कि कसो @RabindraMishra ज्यू context: यो नेपालको लागी नेपालमा बनेको Text To Speech प्रणाली बाट बनाइएको हो ।
NE
0
0
0
418
Ampixa Labs
Ampixa Labs@__ampixa__·
Sneak peek
Ampixa Labs tweet media
English
0
0
1
266