Ashinator

567 posts

Ashinator banner
Ashinator

Ashinator

@ashdebugs

Coding, creating, learning, experimenting, and growing — at the same time

India Katılım Ağustos 2024
462 Takip Edilen189 Takipçiler
Sabitlenmiş Tweet
Ashinator
Ashinator@ashdebugs·
Just shipped HanaVerse after months of late nights! 🚀 It's a totally different take on AI chat - your Ollama models come to life as Hana, an anime character who actually TALKS back to you! Had to see if adding a face + voice would make AI convos feel more... human?
English
3
0
2
360
Ashinator
Ashinator@ashdebugs·
@ModelScope2022 I tried the demo ,it skips words when long text is given (3-4 sentence)
English
0
0
0
1.3K
ModelScope
ModelScope@ModelScope2022·
Say hello to MOSS-TTS-Nano 🚀 0.1B multilingual TTS from MOSI.AI and OpenMOSS. Designed for realtime speech generation without a GPU. Runs directly on CPU, keeping the deployment stack simple enough for local demos, web serving, and lightweight product integration. Part of the MOSS-TTS family alongside the 1.7B and 8B flagship models. 🤖 modelscope.cn/models/openmos… 🌍 modelscope.ai/models/openmos… 💻 github.com/OpenMOSS/MOSS-…
ModelScope tweet mediaModelScope tweet mediaModelScope tweet media
English
8
63
417
120.4K
Sudarshan Kamath
Sudarshan Kamath@kamath_sutra·
Introducing Lightning V3 - it beats every model we tested against. ElevenLabs, Cartesia, OpenAI. Lightning sets a new SOTA with V3 in conversational text-to-speech. → Highest MOS score for conversational TTS at 3.9 → ~76% win rate vs gpt-4o-mini-tts on naturalness → 15 languages with mid-sentence code-switching → Built from scratch for voice agents, not read-aloud Every TTS model sounds clean in a demo. You type a sentence and you get beautiful audio. Voice agents don't work that way. They stream. They're generating audio in real-time chunks with half the context missing. That's where everything breaks. A great reading voice and a great conversational voice are fundamentally different things. A conversational voice has to sound like it's thinking - with the pauses, the rhythm shifts, the reactions. It has to handle the way real people actually talk, including switching languages mid-sentence. That's what V3 does. V3.1 also ships voice cloning. 5 to 15 seconds of audio, no fine-tuning, production-grade clone across 15 languages. Blog link in the comments.
English
10
30
155
71.1K
Ashinator
Ashinator@ashdebugs·
@obeydulX Hello all let's see who will follow me , i will fb asap
English
0
0
0
3
Ashinator
Ashinator@ashdebugs·
Premium Laptop with 4s of battery
English
0
0
0
14
Ashinator
Ashinator@ashdebugs·
Win any game by cheating or via collaboration😝
English
0
0
0
8
Ashinator
Ashinator@ashdebugs·
WE have to keep trying and never give up
English
0
0
0
4
Ashinator
Ashinator@ashdebugs·
KokoClone — Kokoro, but it clones voices now. I built a lightweight pipeline that adds zero-shot voice cloning to Kokoro TTS while keeping its speed and real-time performance. Multilingual. Fast. lite weight. Open source. Links in thread 👇
English
1
0
1
88
Asish Kumar
Asish Kumar@asishcodes·
We have entered March. Here is what you should be doing if you are planning to submit a GSoC application: - Start creating your proposal as early as possible and send it to maintainers for feedback. - If you haven’t selected organizations yet, it’s late but still possible. I’ve seen people start now and still get selected. - Make as many PRs as you can during this period. - Include a video in all your PRs to differentiate them from AI-generated ones. - Don’t waste too much time contributing to multiple organizations. If you have a solid grasp of one, you can even apply to just that single organization.
English
1
0
36
2.4K
Ashinator
Ashinator@ashdebugs·
@kadirnardev According to you how much hours of dataset is small for you?
English
1
0
0
41
Kadir Nar
Kadir Nar@kadirnardev·
I have added much better features to the Echo-DacVae architecture. I have started training the 300M parameter Echo-Dacvae model with a very small dataset. It will finish in 5 hours.
Kadir Nar tweet media
Kadir Nar@kadirnardev

EchoDac-Vae-3.5B 😍

English
3
2
20
1.6K
Ashinator
Ashinator@ashdebugs·
@kaspathon @kaspaunchained Note: The loading time and response time of the Live Demo will be slow because the application is hosted on a free tier.😅
English
0
0
0
31
Ashinator
Ashinator@ashdebugs·
KaspaStream features 👇 • Create paid micro-tasks in plain English from website or Telegram • Instant task broadcasting (web + Telegram) • AI-assisted verification • Worker ranks (D → A) for trust & progress • Kaspa-based payments designed for speed
English
1
0
0
45