Yutong

44 posts

Yutong

Yutong

@gnotuy

All in @ Kimi

Palo Alto Katılım Ocak 2010
681 Takip Edilen3.6K Takipçiler
Yutong
Yutong@gnotuy·
@antonpme Personally I'd like to see that too.
English
0
0
0
48
Anton P. 👽
Anton P. 👽@antonpme·
@gnotuy Impressive work. One question: can we expect to have Kimi to have a 1M context window soon?
English
1
0
8
591
Prompt Injection
Prompt Injection@PromptInjection·
@gnotuy Awesome. Is it all based on moonshotai/Kimi-K2-Base? What changed? Better post-training?
English
1
0
1
216
Yutong retweetledi
Kimi.ai
Kimi.ai@Kimi_Moonshot·
Meet Kimi K2.6: Advancing Open-Source Coding 🔹Open-source SOTA on HLE w/ tools (54.0), SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), Toolathlon (50.0), Charxiv w/ python(86.7), Math Vision w/ python (93.2) What's new: 🔹Long-horizon coding - 4,000+ tool calls, over 12 hours of continuous execution, with generalization across languages (Rust, Go, Python) and tasks (frontend, devops, perf optimization). 🔹Motion-rich frontend - Videos in hero sections, WebGL shaders, GSAP + Framer Motion, Three.js 3D. 🔹Agent Swarms, elevated - 300 parallel sub-agents × 4,000 steps per run (up from K2.5's 100 / 1,500). One prompt, 100+ files. 🔹Proactive Agents - K2.6 model powers OpenClaw, Hermes Agent, etc for 24/7 autonomous ops. 🔹Claw Groups (research preview) - bring your own agents, command your friends', bots & humans in the loop. - K2.6 is now live on kimi.com in chat mode and agent mode. For production-grade coding, pair K2.6 with Kimi Code: kimi.com/code - 🔗 API: platform.moonshot.ai 🔗 Tech blog: kimi.com/blog/kimi-k2-6 🔗 Weights & code: huggingface.co/moonshotai/Kim…
Kimi.ai tweet media
English
938
2.4K
18.1K
7.5M
Yutong
Yutong@gnotuy·
Love seeing open source works. Impressed by what @cursor_ai built on top.
Kimi.ai@Kimi_Moonshot

Congrats to the @cursor_ai team on the launch of Composer 2! We are proud to see Kimi-k2.5 provide the foundation. Seeing our model integrated effectively through Cursor's continued pretraining & high-compute RL training is the open model ecosystem we love to support. Note: Cursor accesses Kimi-k2.5 via Fireworks' hosted RL and inference platform as part of an authorized commercial partnership.

English
0
1
14
1.2K
Yutong
Yutong@gnotuy·
Introducing Kimi K2.5: open-source visual agentic intelligence 🚀State-of-the-art benchmarks: Humanity's Last Exam full set (50.2%), BrowseComp (74.9%) Vision is humanity's native language. When Kimi understands what you see, creation becomes instinctive. No coding, No frontend jargon. Upload a mockup, Share a video, Describe your vision. Kimi turns it into code with taste. Enjoy creation 🌈
Kimi.ai@Kimi_Moonshot

🥝Meet Kimi K2.5, Open-Source Visual Agentic Intelligence. 🔹Global SOTA on Agentic Benchmarks: HLE full set (50.2%), BrowseComp (74.9%) 🔹Open-source SOTA on Vision and Coding: MMMU Pro (78.5%), VideoMMMU (86.6%), SWE-bench Verified (76.8%) 🔹Code with Taste: turn chats, images & videos into aesthetic websites with expressive motion. 🔹Agent Swarm (Beta): self-directed agents working in parallel, at scale. Up to 100 sub-agents, 1,500 tool calls, 4.5× faster compared with single-agent setup. - 🥝K2.5 is now live on kimi.com in chat mode and agent mode. 🥝K2.5 Agent Swarm in beta for high-tier users. 🥝For production-grade coding, you can pair K2.5 with Kimi Code: kimi.com/code - 🔗 API: platform.moonshot.ai 🔗 Tech blog: kimi.com/blogs/kimi-k2-… 🔗 Weights & code: huggingface.co/moonshotai/Kim…

English
2
5
60
12.5K
Yutong
Yutong@gnotuy·
@noelhatem The “Vivace” plan is sufficient for most power users. Or you may consider using the API service.
English
1
0
0
142
⚡️ Noel Hatem
⚡️ Noel Hatem@noelhatem·
@gnotuy Well done, what's the best place to get an unlimited version of Kimi K2 Thinking such as in Pro plans?
English
1
0
0
1.1K
Yutong
Yutong@gnotuy·
Today, we're releasing Kimi K2 Thinking, our best open-source model. What makes it different isn't just the benchmarks, though it achieves SOTA results on Humanity's Last Exam, BrowseComp, and other challenging tests. What matters is how it thinks. It reminds me of the minds on our team: always asking the next question, refusing to settle for the first answer, following each thread until it leads somewhere true. This is test-time scaling in its full form, giving models the space to think longer and act more deliberately.
Kimi.ai@Kimi_Moonshot

🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here. 🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%) 🔹 Executes up to 200 – 300 sequential tool calls without human interference 🔹 Excels in reasoning, agentic search, and coding 🔹 256K context window Built as a thinking agent, K2 Thinking marks our latest efforts in test-time scaling — scaling both thinking tokens and tool-calling turns. K2 Thinking is now live on kimi.com in chat mode, with full agentic mode coming soon. It is also accessible via API. 🔌 API is live: platform.moonshot.ai 🔗 Tech blog: moonshotai.github.io/Kimi-K2/thinki… 🔗 Weights & code: huggingface.co/moonshotai

English
73
197
3.3K
432.3K
Keril 🔮
Keril 🔮@imnotkeril·
@gnotuy Where i can see any examples comparing these models in real-world tasks and case studies?
English
1
0
0
3.7K
Teng Yan
Teng Yan@tengyanAI·
@gnotuy did kimi write this tweet? if so, v impressed
English
1
0
13
4.5K
Yutong
Yutong@gnotuy·
Kimi K2 is the best Open Source model in the LMArena!
Arena.ai@arena

🚨 BREAKING: @Kimi_Moonshot’s Kimi-K2 is now the #1 open model in the Arena! With over 3K community votes, it ranks #5 overall, overtaking DeepSeek as the top open model. Huge congrats to the Moonshot team on this impressive milestone! The leaderboard now features 7 different providers in the top 15 - the most competitive it’s ever been. More insights in the thread 🧵

English
1
0
30
7.8K
Yutong
Yutong@gnotuy·
@jasonzhou1993 We’ve heard the feedback, the API is just too slow! We're on it! 🔧 Scaling GPUs + optimizing inference. Speed boost coming within days.
English
0
0
0
160
Jason Zhou
Jason Zhou@jasonzhou1993·
Kimi K2 - On-par with Claude 4, but 80% cheaper!! I connected Kimi K2 to Claude Code to get a sense of real performance (Kimi Code!) Overall findings: 1. Exceptional coding capability 2. Cost only 20% of Claude 4 (Huge!) 2. Only downside is API is a bit slow 🧵 Below is some experiments I did + how can you test yourself 👇
Jason Zhou tweet mediaJason Zhou tweet media
English
27
102
1.2K
171.6K
Yutong
Yutong@gnotuy·
@LEON_0xx0 @Kimi_Moonshot @YouWareAI Thank you for the support! We’re pushing hard to make it better. You should see a noticeable speed bump within the next few days!
English
1
0
4
235
Leon.M
Leon.M@leon2mcp·
We put Kimi K2 @Kimi_Moonshot to the test on @YouWareAI using actual user queries.The performance is shockingly good, and the cost savings are amazing. Here're 8 test cases from our platform to show you the difference👇
Leon.M tweet media
English
13
7
51
170.3K
Yutong
Yutong@gnotuy·
Awesome thread! Thank you for the real world test. Now I really need to know those beautiful prompts! :)
Leon.M@leon2mcp

We put Kimi K2 @Kimi_Moonshot to the test on @YouWareAI using actual user queries.The performance is shockingly good, and the cost savings are amazing. Here're 8 test cases from our platform to show you the difference👇

English
0
0
10
3.7K
Yutong
Yutong@gnotuy·
Yutong tweet media
ZXX
1
3
34
6.8K