
Darin
139 posts

Darin
@darin_ver
DevRel @FeatherlessAI, AI Engineer, eu/acc


Cursor’s Composer 2 is likely built on Kimi K2.5. The model URL + tokenizer are strong signals. I love this direction: companies mid-train and post-train on top of OSS LLMs. Prediction: open-source model labs will monetize by taking a cut when others build on top of their models and scale to millions of real users. They will enforce this via licensing. That’s the flywheel. That’s how open-source AI thrives.

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models.

🚀 Announcing Nanbeige4.1-3B – our latest open-source 3B model mastering reasoning, preference alignment, & agent capabilities! Try it now: huggingface.co/Nanbeige/Nanbe… reddit.com/r/LocalLLaMA/c…

Nationalistic ego is a strange thing. I see in some smaller countries where there are multiple famous researchers / startup founders, things can get complex and messy really quickly. Thankfully in Singapore there is one and only one legendary agi hippo 🦛

Qwen3-TTS is officially live. We’ve open-sourced the full family—VoiceDesign, CustomVoice, and Base—bringing high quality to the open community. - 5 models (0.6B & 1.8B) - Free-form voice design & cloning - Support for 10 languages - SOTA 12Hz tokenizer for high compression - Full fine-tuning support - SOTA performance We believe this is arguably the most disruptive release in open-source TTS yet. Go ahead, break it and build something cool. 🚀 Everything is out now—weights, code, and paper. Enjoy. 🧵 Github: github.com/QwenLM/Qwen3-T… Hugging Face: huggingface.co/collections/Qw… ModelScope: modelscope.cn/collections/Qw… Blog: qwen.ai/blog?id=qwen3t… Paper: github.com/QwenLM/Qwen3-T… Hugging Face Demo: huggingface.co/spaces/Qwen/Qw… ModelScope Demo: modelscope.cn/studios/Qwen/Q… API: alibabacloud.com/help/en/model-…


We’re releasing TranslateGemma, a new family of open translation models with support for 55 languages. 🌐 Available in 4B, 12B, and 27B parameter sizes – they’re designed for efficiency without sacrificing quality.





NeurIPS 2025 papers per 1 Million People 1. Singapore – 64.51 2. Switzerland – 22.13 3. Israel – 11.17 4. UAE – 9.47 5. UK – 7.50 6. US – 7.44 7. Denmark – 7.37 8. Australia – 7.31 9. Canada – 6.93 10. South Korea – 5.78

Introducing the Devstral 2 coding model family. Two sizes, both open source. Also, meet Mistral Vibe, a native CLI, enabling end-to-end automation. 🧵





