



Smerity
13.2K posts

@Smerity
ML x society. Founding Member of Technical Staff at Project Prometheus. Prev @midjourney, @SFResearch, @CommonCrawl. @Harvard '14, @Sydney_Uni '11. 🇦🇺 in SF.












Introducing Ricursive Intelligence, a frontier AI lab enabling a recursive self-improvement loop between AI and the chips that fuel it. Learn more at ricursive.com

Is the world ready for Metaballs?













Most important to remember is that this is the raw language model. This is literally the equivalent of you hitting <next> on your predictive keyboard. It hasn't been tuned. LMs serve as the base layer of knowledge in many NLP tasks and a better LM almost always helps downstream!

For those not in machine learning, these new results reinforce an underlying narrative of the language modeling community - that if the predictive text in your mobile had a supercomputer behind it, you could tab complete real work. You can see why that excites us in the field ^_^

🚀 Hello, Kimi K2! Open-Source Agentic Model! 🔹 1T total / 32B active MoE model 🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models 🔹Strong in coding and agentic tasks 🐤 Multimodal & thought-mode not supported for now With Kimi K2, advanced agentic intelligence is more open and accessible than ever. We can't wait to see what you build! 🔌 API is here: platform.moonshot.ai - $0.15 / million input tokens (cache hit) - $0.60 / million input tokens (cache miss) - $2.50 / million output tokens 🔗 Tech blog: moonshotai.github.io/Kimi-K2/ 🔗 Weights & code: huggingface.co/moonshotai 🔗 Github: github.com/MoonshotAI/Kim… Try it now at Kimi.ai or via API!


Tokenization has been the final barrier to truly end-to-end language models. We developed the H-Net: a hierarchical network that replaces tokenization with a dynamic chunking process directly inside the model, automatically discovering and operating over meaningful units of data

