paari_7
400 posts









#JustIn | #IOC Hikes Industrial Fuel Price To ₹109.59/Litre From ₹87.67/Litre



i gave my @openclaw a speaker and a parenting skill.md, check out this sick demo


Qwen3-TTS: 3 voices, 105s audio in <3min (no Flash Attn,cold start). Opt = ~50s. Qwen3-TTS creates voices from text. "Joe Rogan vibes" → instant host. "Witty comedian" → perfect. @modal L4s. Apache 2.0. Self-hosted. No APIs. Thx @Alibaba_Qwen for real OSS 🙏

Qwen3-TTS: 3 voices, 105s audio in <3min (no Flash Attn,cold start). Opt = ~50s. Qwen3-TTS creates voices from text. "Joe Rogan vibes" → instant host. "Witty comedian" → perfect. @modal L4s. Apache 2.0. Self-hosted. No APIs. Thx @Alibaba_Qwen for real OSS 🙏







Qwen3-TTS: 3 voices, 105s audio in <3min (no Flash Attn,cold start). Opt = ~50s. Qwen3-TTS creates voices from text. "Joe Rogan vibes" → instant host. "Witty comedian" → perfect. @modal L4s. Apache 2.0. Self-hosted. No APIs. Thx @Alibaba_Qwen for real OSS 🙏

NVIDIA just removed one of the biggest friction points in Voice AI. PersonaPlex-7B is an open-source, full-duplex conversational model. Free, open source (MIT), with open model weights on @huggingface 🤗 Links to repo and weights in 🧵↓ The traditional ASR → LLM → TTS pipeline forces rigid turn-taking. It’s efficient, but it never feels natural. PersonaPlex-7B changes that. This @nvidia model can listen and speak at the same time. It runs directly on continuous audio tokens with a dual-stream transformer, generating text and audio in parallel instead of passing control between components. That unlocks: → instant back-channel responses → interruptions that feel human → real conversational rhythm Persona control is fully zero-shot! If you’re building low-latency assistants or support agents, this is a big step forward 🔥


