Yutong

42

Panda 🐼🐼@pax__in__bello·21 Nis

@gnotuy Is there a coding plan available?

English

0

57

Yutong@gnotuy·20 Nis

We open sourced Kimi K2.6. The next frontier in test-time compute isn't bigger models. It's better organizations of intelligence. The hardest things were never built by one person. They require coordination. Different skills, different contexts, different minds arguing until something better emerges.

Meet Kimi K2.6: Advancing Open-Source Coding 🔹Open-source SOTA on HLE w/ tools (54.0), SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), Toolathlon (50.0), Charxiv w/ python(86.7), Math Vision w/ python (93.2) What's new: 🔹Long-horizon coding - 4,000+ tool calls, over 12 hours of continuous execution, with generalization across languages (Rust, Go, Python) and tasks (frontend, devops, perf optimization). 🔹Motion-rich frontend - Videos in hero sections, WebGL shaders, GSAP + Framer Motion, Three.js 3D. 🔹Agent Swarms, elevated - 300 parallel sub-agents × 4,000 steps per run (up from K2.5's 100 / 1,500). One prompt, 100+ files. 🔹Proactive Agents - K2.6 model powers OpenClaw, Hermes Agent, etc for 24/7 autonomous ops. 🔹Claw Groups (research preview) - bring your own agents, command your friends', bots & humans in the loop. - K2.6 is now live on kimi.com in chat mode and agent mode. For production-grade coding, pair K2.6 with Kimi Code: kimi.com/code - 🔗 API: platform.moonshot.ai 🔗 Tech blog: kimi.com/blog/kimi-k2-6 🔗 Weights & code: huggingface.co/moonshotai/Kim…

English

29

45

937

43.7K

Yutong@gnotuy·21 Nis

@antonpme Personally I'd like to see that too.

English

48

Anton P. 👽@antonpme·21 Nis

@gnotuy Impressive work. One question: can we expect to have Kimi to have a 1M context window soon?

English

0

8

591

Yutong@gnotuy·21 Nis

@PromptInjection Yes. K3 is yet to come.

English

2

50

Prompt Injection@PromptInjection·21 Nis

@gnotuy Awesome. Is it all based on moonshotai/Kimi-K2-Base? What changed? Better post-training?

English

0

1

216

Yutong@gnotuy·20 Nis

@Mind_Of_Machine when it's ready 😄

English

0

21

1.2K

Yutong retweetledi

Kimi.ai@Kimi_Moonshot·20 Nis

Meet Kimi K2.6: Advancing Open-Source Coding 🔹Open-source SOTA on HLE w/ tools (54.0), SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), Toolathlon (50.0), Charxiv w/ python(86.7), Math Vision w/ python (93.2) What's new: 🔹Long-horizon coding - 4,000+ tool calls, over 12 hours of continuous execution, with generalization across languages (Rust, Go, Python) and tasks (frontend, devops, perf optimization). 🔹Motion-rich frontend - Videos in hero sections, WebGL shaders, GSAP + Framer Motion, Three.js 3D. 🔹Agent Swarms, elevated - 300 parallel sub-agents × 4,000 steps per run (up from K2.5's 100 / 1,500). One prompt, 100+ files. 🔹Proactive Agents - K2.6 model powers OpenClaw, Hermes Agent, etc for 24/7 autonomous ops. 🔹Claw Groups (research preview) - bring your own agents, command your friends', bots & humans in the loop. - K2.6 is now live on kimi.com in chat mode and agent mode. For production-grade coding, pair K2.6 with Kimi Code: kimi.com/code - 🔗 API: platform.moonshot.ai 🔗 Tech blog: kimi.com/blog/kimi-k2-6 🔗 Weights & code: huggingface.co/moonshotai/Kim…

English

938

2.4K

18.1K

7.5M

Yutong@gnotuy·20 Mar

Love seeing open source works. Impressed by what @cursor_ai built on top.

Congrats to the @cursor_ai team on the launch of Composer 2! We are proud to see Kimi-k2.5 provide the foundation. Seeing our model integrated effectively through Cursor's continued pretraining & high-compute RL training is the open model ecosystem we love to support. Note: Cursor accesses Kimi-k2.5 via Fireworks' hosted RL and inference platform as part of an authorized commercial partnership.

English

1

14

1.2K

Yutong@gnotuy·27 Oca

Introducing Kimi K2.5: open-source visual agentic intelligence 🚀State-of-the-art benchmarks: Humanity's Last Exam full set (50.2%), BrowseComp (74.9%) Vision is humanity's native language. When Kimi understands what you see, creation becomes instinctive. No coding, No frontend jargon. Upload a mockup, Share a video, Describe your vision. Kimi turns it into code with taste. Enjoy creation 🌈

🥝Meet Kimi K2.5, Open-Source Visual Agentic Intelligence. 🔹Global SOTA on Agentic Benchmarks: HLE full set (50.2%), BrowseComp (74.9%) 🔹Open-source SOTA on Vision and Coding: MMMU Pro (78.5%), VideoMMMU (86.6%), SWE-bench Verified (76.8%) 🔹Code with Taste: turn chats, images & videos into aesthetic websites with expressive motion. 🔹Agent Swarm (Beta): self-directed agents working in parallel, at scale. Up to 100 sub-agents, 1,500 tool calls, 4.5× faster compared with single-agent setup. - 🥝K2.5 is now live on kimi.com in chat mode and agent mode. 🥝K2.5 Agent Swarm in beta for high-tier users. 🥝For production-grade coding, you can pair K2.5 with Kimi Code: kimi.com/code - 🔗 API: platform.moonshot.ai 🔗 Tech blog: kimi.com/blogs/kimi-k2-… 🔗 Weights & code: huggingface.co/moonshotai/Kim…

English

2

5

60

12.5K

Yutong@gnotuy·9 Kas

Kimi thinking agent available on mobile!

Kimi K2 Thinking is here! Scale up reasoning with more thinking tokens and tool-call steps. Now live on kimi.com, the Kimi app, and API.

English

3

1

9

1.8K

Yutong@gnotuy·9 Kas

@noelhatem The “Vivace” plan is sufficient for most power users. Or you may consider using the API service.

English

0

142

⚡️ Noel Hatem@noelhatem·7 Kas

@gnotuy Well done, what's the best place to get an unlimited version of Kimi K2 Thinking such as in Pro plans?

English

0

1.1K

Yutong@gnotuy·6 Kas

Today, we're releasing Kimi K2 Thinking, our best open-source model. What makes it different isn't just the benchmarks, though it achieves SOTA results on Humanity's Last Exam, BrowseComp, and other challenging tests. What matters is how it thinks. It reminds me of the minds on our team: always asking the next question, refusing to settle for the first answer, following each thread until it leads somewhere true. This is test-time scaling in its full form, giving models the space to think longer and act more deliberately.

🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here. 🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%) 🔹 Executes up to 200 – 300 sequential tool calls without human interference 🔹 Excels in reasoning, agentic search, and coding 🔹 256K context window Built as a thinking agent, K2 Thinking marks our latest efforts in test-time scaling — scaling both thinking tokens and tool-calling turns. K2 Thinking is now live on kimi.com in chat mode, with full agentic mode coming soon. It is also accessible via API. 🔌 API is live: platform.moonshot.ai 🔗 Tech blog: moonshotai.github.io/Kimi-K2/thinki… 🔗 Weights & code: huggingface.co/moonshotai

English

73

197

3.3K

432.3K

Yutong@gnotuy·9 Kas

@martin_ict_algo We’re working on capacity expansion.

English

1

100

intelligence 🇦🇺@martin_ict_algo·7 Kas

@gnotuy Its very thorough with its replies and good but very slow it seems

English

0

528

Yutong@gnotuy·7 Kas

@imnotkeril LMarena

English

8

3.4K

Keril 🔮@imnotkeril·7 Kas

@gnotuy Where i can see any examples comparing these models in real-world tasks and case studies?

English

0

3.7K

Yutong@gnotuy·7 Kas

@tengyanAI you bet

English

0

16

3.9K

Teng Yan@tengyanAI·7 Kas

@gnotuy did kimi write this tweet? if so, v impressed

English

0

13

4.5K

Yutong@gnotuy·17 Tem

Kimi K2 is the best Open Source model in the LMArena!

Arena.ai@arena

🚨 BREAKING: @Kimi_Moonshot’s Kimi-K2 is now the #1 open model in the Arena! With over 3K community votes, it ranks #5 overall, overtaking DeepSeek as the top open model. Huge congrats to the Moonshot team on this impressive milestone! The leaderboard now features 7 different providers in the top 15 - the most competitive it’s ever been. More insights in the thread 🧵

English

0

30

7.8K

Yutong@gnotuy·17 Tem

@jasonzhou1993 We’ve heard the feedback, the API is just too slow! We're on it! 🔧 Scaling GPUs + optimizing inference. Speed boost coming within days.

English

160

Jason Zhou@jasonzhou1993·13 Tem

Kimi K2 - On-par with Claude 4, but 80% cheaper!! I connected Kimi K2 to Claude Code to get a sense of real performance (Kimi Code!) Overall findings: 1. Exceptional coding capability 2. Cost only 20% of Claude 4 (Huge!) 2. Only downside is API is a bit slow 🧵 Below is some experiments I did + how can you test yourself 👇

English

27

102

1.2K

171.6K

Yutong@gnotuy·17 Tem

@LEON_0xx0 @Kimi_Moonshot @YouWareAI Thank you for the support! We’re pushing hard to make it better. You should see a noticeable speed bump within the next few days!

English

0

4

235

Leon.M@leon2mcp·17 Tem

We put Kimi K2 @Kimi_Moonshot to the test on @YouWareAI using actual user queries.The performance is shockingly good, and the cost savings are amazing. Here're 8 test cases from our platform to show you the difference👇

English

13

7

51

170.3K

Yutong@gnotuy·17 Tem

Awesome thread! Thank you for the real world test. Now I really need to know those beautiful prompts! :)

Leon.M@leon2mcp

We put Kimi K2 @Kimi_Moonshot to the test on @YouWareAI using actual user queries.The performance is shockingly good, and the cost savings are amazing. Here're 8 test cases from our platform to show you the difference👇

English

10

3.7K

Yutong@gnotuy·17 Tem

ZXX

3

34

6.8K

Yutong@gnotuy·17 Tem

Open source wins!

OpenRouter@OpenRouter

Moonshot AI has surpassed xAI in token market share, just a few days after launching Kimi K2 🎁 We also just put up a free endpoint for Kimi - try it now! 👇

English