OpenMOSS (@Open_MOSS) - Perfil de Twitter | Zamantika Mersobahis Locabet

Tweet fijado

OpenMOSS@Open_MOSS·11 Şub

🚀 The MOSS-TTS Family is here. From zero-shot cloning to real-time VoiceAgents, we have released our most powerful suite of audio models yet. The Lineup: MOSS-TTS Flagship: The industry's best zero-shot voice cloning. Features precise control over duration & Pinyin, capable of generating 1 hour of speech. MOSS-TTSD-v1.0: A new standard for dialogue generation. Comprehensive optimization for conversational scenes and small languages. Best-in-class performance in all evaluations. MOSS-VoiceGenerator: One-shot timbre generation. Create voices with a single sentence and complex instruction handling. MOSS-TTS-Realtime: Built for the next era of VoiceAgents. Synthesis starts in just 2 characters for instant response. MOSS-SoundEffect: Text-to-Audio sound effects to expand your creative toolkit. 🔥 Try it now: studio.mosi.cn/voice-synthesis 💻 Deploy (GitHub): github.com/OpenMOSS/MOSS-… 🔌 API Docs: studio.mosi.cn/docs/moss-tts Welcome to our demo. The era of 'childhood' for TTS is over. #MOSS #AI #TextToSpeech #TTS #OpenClaw #Agent #OpenMOSS #Opensource #VoiceAgent

English

7

5

21

1.6K

OpenMOSS@Open_MOSS·1 Mar

Our bench can also test image edit models! It's a truly unified multimodal generative reasoning benchmark testing video models, image edit models and VLMs. Results on mini test set: (6/6)

English

0

109

OpenMOSS@Open_MOSS·1 Mar

CVPR2026 🎉 Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm 🌟We use video frames as a unified medium for text and vision reasoning. 🤯 🔥Video model (Sora-2) beats GPT-5 by 10% on Eyeballing Puzzles! 🧵arxiv.org/abs/2511.04570 (1/6) #CVPR2026 #seedance2 #Multimodal #VideoGeneration #Sora2 #Reasoning #LLM #AI

English

5

10

17

1.4K

OpenMOSS@Open_MOSS·1 Mar

What about text-heavy logic? Sora-2 takes a prompt + image, and generates a video "writing" the step-by-step solution. It even reads the answer via audio! 🔊 Staggering results: 🎯 MATH: 92% 🎯 MMMU: 69.2% (5/6)

English

0

96

OpenMOSS@Open_MOSS·1 Mar

Sora-2 solves complex visual puzzles (color filling, shape drawing) by understanding symmetry, gradients, and composition. On Visual-Shape tasks, Sora-2's inductive reasoning actually matches Claude 3.5 Sonnet! 🎨🧩 (4/6)

English

0

94

OpenMOSS@Open_MOSS·1 Mar

We introduce VideoThinkBench to test this. On "Eyeballing Puzzles", Sora-2 reasons by simulating light reflection and manipulating geometry. Result? It outperforms SOTA VLMs and scores 10% higher than GPT-5! 📈🧩 All code and data are open-sourced: github.com/tongjingqi/Thi… (3/6)

English

0

104

OpenMOSS@Open_MOSS·1 Mar

Current LLM/VLM paradigms ("Thinking with Text/Images") have limits: static images lack dynamics, and split modalities hinder understanding. Our fix: Thinking with Video. Video frames as a unified medium to draw/write reasoning steps! ✍️🎥 Project: thinking-with-video.github.io (2/6)

English

0

122

OpenMOSS@Open_MOSS·18 Şub

Happy Chinese New Year 🐎 MOSS-TTSD-v1.0 vs eleven v3 from 11 labs Which is better? Welcome Feedback! github.com/OpenMOSS/MOSS-… #OpenMOSS #OpenSource #AI #LLM #TTS #Speech

English

1

6

494

OpenMOSS@Open_MOSS·13 Şub

We built a Complete Replacement Model (CRM) that fully sparsifies a language model. This brings many changes to circuit tracing and global circuits. Congratulations to Zhengfu and his team!!!

Zhengfu He@ZhengfuHe

We built a Complete Replacement Model (CRM) that fully sparsifies a language model. This brings many changes to circuit tracing and global circuits. (1/n)

English

1

0

6

251

OpenMOSS@Open_MOSS·12 Şub

@Misternab Yeah it supports French. We will launch huggingface space demo recently. We will support email signup ASAP.

English

0

1

108

Nabil Garbi@Misternab·12 Şub

@Open_MOSS Does it support French ? i tried to signup for use by API but i dont find signup button.

English

1

0

66

OpenMOSS@Open_MOSS·11 Şub

🚀 The MOSS-TTS Family is here. From zero-shot cloning to real-time VoiceAgents, we have released our most powerful suite of audio models yet. The Lineup: MOSS-TTS Flagship: The industry's best zero-shot voice cloning. Features precise control over duration & Pinyin, capable of generating 1 hour of speech. MOSS-TTSD-v1.0: A new standard for dialogue generation. Comprehensive optimization for conversational scenes and small languages. Best-in-class performance in all evaluations. MOSS-VoiceGenerator: One-shot timbre generation. Create voices with a single sentence and complex instruction handling. MOSS-TTS-Realtime: Built for the next era of VoiceAgents. Synthesis starts in just 2 characters for instant response. MOSS-SoundEffect: Text-to-Audio sound effects to expand your creative toolkit. 🔥 Try it now: studio.mosi.cn/voice-synthesis 💻 Deploy (GitHub): github.com/OpenMOSS/MOSS-… 🔌 API Docs: studio.mosi.cn/docs/moss-tts Welcome to our demo. The era of 'childhood' for TTS is over. #MOSS #AI #TextToSpeech #TTS #OpenClaw #Agent #OpenMOSS #Opensource #VoiceAgent

English

7

5

21

1.6K

OpenMOSS@Open_MOSS·12 Şub

@mohamed17381489 Yes, it supports!

English

0

1

61

Love TTS@mohamed17381489·11 Şub

@Open_MOSS does it support arabic ? any plans to support the language?

English

1

0

71

OpenMOSS@Open_MOSS·11 Şub

MOSS-TTSD-v1.0 is a brand-new conversational speech generation model, which also supports ultra-long sequences and multilingual synthesis.

English

0

206

OpenMOSS@Open_MOSS·11 Şub

MOSS-TTS is our flagship model, trained on millions of hours of high-quality multilingual speech data. It supports a wide range of languages, including Chinese, English, French, Spanish, German, Portuguese, Japanese, and Korean. The model features fine-grained duration and phoneme control, as well as the generation of ultra-long speech up to one hour.

English

0

1

2

352

OpenMOSS@Open_MOSS·30 Oca

We took iconic screenshots from classic cinema and remake the scene using #MOVA. 🎬✨ github.com/OpenMOSS/MOVA Our focus was on seamless end-to-end audio & video generation. #AIVideo #OpenSource #ClassicMovies #GenAI

English

0

1

8

608

OpenMOSS@Open_MOSS·29 Oca

Huge shoutout to the SGLang community @lmsysorg for their incredible support! 🚀 We are thrilled to announce that MOVA features Day-0 support for SGLang-Diffusion, ensuring high-performance inference right out of the gate.

English

0

9

358

OpenMOSS@Open_MOSS·29 Oca

Sora 2? Closed. Veo 3? Closed. Kling? Closed. 🚫 MOVA? Open. ✅ We’re thrilled to release MOVA (MOSS-Video-and-Audio), a powerhouse foundation model designed for high-fidelity, synchronized video-audio synthesis. ✨ The Magic: Traditional Video model generates sound as an afterthought. MOVA synthesizes sight and sound simultaneously via bidirectional cross-attention. The result? Audio that doesn't just match—it belongs. 18B Active Params (MoE Architecture, 32B in total.) LoRA Support for fine-tuning Production-ready generation pipelines The era of "hollow" AI video is gone. Long live MOVA. 🚀 Star the repo: github.com/OpenMOSS/MOVA #MOVA #SORA2 #Veo3 #OpenSourceAI #VideoGeneration #AI

English

7

4

34

5.1K

OpenMOSS@Open_MOSS·29 Oca

MOVA achieves state-of-the-art (SOTA) performance among open-source models in both human subjective arena evaluations and objective metrics such as lip-sync and audio-visual synchronization, rivaling the capabilities of proprietary closed-source models.

English

0

6

682

OpenMOSS@Open_MOSS·24 Kas

Welcome Nex-N1, a new series of agentic foundational models. Nice Work! 🎉

Tiezhen WANG@Xianbao_QIAN

Welcome Nex-N1, a new series of agentic foundational models, to @huggingface - available in different sizes from 8B, 30B, 32B to 671B - strong in tool-use, web-search and real-world agentic workflow - some SFT dataset has been open sourced Technical report come up soon!

English

0

4

312

OpenMOSS@Open_MOSS·17 Eki

Switch to Libero+ in just a few steps and unlock your VLA’s true generalization ability.

Siyin Wang@wang_siyin

🚀Tired of Libero? Try our Libero-Plus! 🤔Libero’s at 99%, but we’ve found VLA drops points with even minor disturbances. 🤩Switch to Libero+ in just a few steps and unlock your VLA’s true generalization ability. #Embodied #VLA #Robotics

English

0

2

309

OpenMOSS

Descubrir