Sapient Intelligence

28 posts

Sapient Intelligence banner
Sapient Intelligence

Sapient Intelligence

@Sapient_Int

Building efficient & powerful general intelligence through brain-inspired architecture

Palo Alto, CA Katılım Temmuz 2024
2 Takip Edilen4.9K Takipçiler
Sabitlenmiş Tweet
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
Introducing HRM-Text. An ultra-lean 1B-parameter reasoning language model designed to deliver strong general performance with a fraction of the data, compute, and infrastructure. Trained on just 40B structured tokens, HRM-Text achieves competitive performance while using ~1/1000 of the training data of comparable models. The kicker? The full model trains in roughly one day on a $1,000 budget. This opens the door to a new generation of AI that is powerful, accessible, and radically easier to adapt. Theories and research concepts once deemed too expensive to test are officially back in the game. Sapient Intelligence invites you to help us shape a new paradigm for general intelligence.
English
144
243
2.2K
422.2K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
In this benchmark deep-dive, Sapient’s founders William and Guan are joined by research team members Changling and Yasin to unpack HRM-Text’s performance across MATH, DROP, ARC-Challenge, and MMLU. 📊 Beyond the scores, they discuss what each benchmark measures, how HRM-Text compares with larger models, and why efficiency matters. Watch the full discussion to learn more about HRM-Text and Sapient’s leaner path toward general intelligence.
English
35
8
114
72.4K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
HRM-Text 101 is here. This tutorial takes you from zero to one: from setup to fine-tuning to evaluation. Download the base checkpoint. Fine-tune it on a real task. Evaluate the results. End to end, on a single GPU. Watch the tutorial and start building with HRM-Text.
English
36
42
472
176.9K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
Introducing HRM-Text. An ultra-lean 1B-parameter reasoning language model designed to deliver strong general performance with a fraction of the data, compute, and infrastructure. Trained on just 40B structured tokens, HRM-Text achieves competitive performance while using ~1/1000 of the training data of comparable models. The kicker? The full model trains in roughly one day on a $1,000 budget. This opens the door to a new generation of AI that is powerful, accessible, and radically easier to adapt. Theories and research concepts once deemed too expensive to test are officially back in the game. Sapient Intelligence invites you to help us shape a new paradigm for general intelligence.
English
144
243
2.2K
422.2K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
Tomorrow, we will unveil a new path to general intelligence. Lean. Powerful. Efficient. The countdown is on⏳
Sapient Intelligence tweet media
English
35
29
607
44.6K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
It is time to liberate reasoning from language! HRM (Hierarchical Reasoning Model) takes a simple idea from the brain: separating reasoning (thinking) from language. When we think, our brains process information in high-dimensional, abstract streams--deep, instantaneous, and unbounded. Only after we formulate an idea do we compress it into concrete, low-dimensional language for communication. Current LLMs do most of their "thinking" in the token space via Chain-of-Thought. The results are fascinating, but structurally shallow and highly resource-intensive. HRM changes this. By reasoning natively in a dedicated latent space, it unlocks a massive internal "scratchpad." It thinks deeper and is unconstrained by tokens, only translating to language when the thought is fully formed. Deeper reasoning. Way fewer tokens.
Sapient Intelligence tweet media
English
1
2
29
2.4K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
Is bigger always better in AI? 🧠 We've reached incredible SOTAs by brute-forcing scale with astronomical token counts. But consider the human brain: running on just 20W of power and trained on ~1B language tokens, it's still making miracles happen. That efficiency is inspiring. There are smarter paths toward smarter models, and much smarter ways to scale, and this will be the next breakthrough.
English
1
0
15
1.5K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
At Sapient Intelligence, we enable deep, efficient reasoning with our Hierarchical Reasoning Model (HRM)—a brain-inspired, latent-space architecture that moves beyond traditional, data-heavy AI. By decoupling the cognitive load, HRM uses a Slower Controller to guide abstract, deliberate reasoning and a Faster Processor to handle detailed computations. This dual-stream design allows systems to reason, plan, and converge on solutions within latent space.
Sapient Intelligence tweet media
English
1
4
43
2.8K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
Behind the code, there is a specific kind of expertise. We are a team of researchers and engineers rooted in the labs of Tsinghua University, University of Cambridge, University of Alberta, Carnegie Mellon University, and Peking University—with experience at DeepMind, DeepSeek, xAI, and more. We've seen the limits of the current AI architectures firsthand from within the organizations that scaled them. Now, across three countries, we are building an alternative. We aren't just shipping another wrapper; we are shipping a new fundamental architecture.
Sapient Intelligence tweet media
English
11
34
365
47.7K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
We were honored to support the global AI community as a Gold Sponsor of the #AAAI26 Conference on Artificial Intelligence. It was truly inspiring to connect with so many brilliant minds across the industry. The future of AGI isn’t just being imagined, it is being built.
Sapient Intelligence tweet mediaSapient Intelligence tweet media
English
0
1
19
2.5K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
Proud to share that TRM, derived from our HRM model, is highlighted in Nature ! 🎉🎉🎉 This marks an important step forward for HRM-based reasoning systems, demonstrating the strength of small, structured models in complex reasoning tasks.💡
Sapient Intelligence tweet media
English
2
1
12
1.4K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
🔥It’s official-Sapient HRM Discord Community is now live! This is a place to discuss, connect, and collaborate as we shape HRM’s future together. We will be sharing our latest work, releases, and tips, as well as hosting Q&A sessions💬💬 Hop on this journey with us as we push the boundaries of what HRM and AGI at large can achieve!🙌 ➡️Join us on Discord here discord.gg/sapient
Sapient Intelligence tweet media
English
1
5
37
4.9K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
We are on the @arcprize leaderboard now - a good starting point! Meanwhile, we are accelerating the iteration and application of the HRM model; stay tuned!
Guan Wang@makingAGI

Thanks to @arcprize for reproducing and verifying the results! ARC-AGI-1: public 41% pass@2 - semi private 32% pass@2 ARC-AGI-2: public 4% pass@2 - semi private 2% pass@2 Due to differences in testing environments, a certain amount of variance in results is acceptable. According to tests run on our infrastructure, the open-source version of HRM on our GitHub can achieve a score of 5.4% pass@2 on the ARC-AGI-2. We welcome everyone to run it on your own infra and share your scores~ This is our first submission to the leaderboard, and it's a good starting point. We appreciate everyone for your support and feedback on HRM, both before and after our appearance on the ARC leaderboard. All of this encourages and motivates us to improve. The hierarchical architecture is designed to resolve premature convergence in long-horizon tasks, like master-level Sudoku that takes hours for humans to solve. See the comparison with a simple recurrent Transformer. Such a long chain might not be essential for ARC problems, and we only used a high-low ratio of 1/2. Larger ratios are often needed for optimal performance for Sudoku problems. In the case of ARC-AGI, the success of HRM is a testament to the model's ability to exhibit fluid intelligence - that is, its capability to infer and apply abstract rules from independent and flat examples. We are glad it was discovered in a recent blog post that the outer loop and data augmentation are essential for this ability, and we especially thank @fchollet @GregKamradt @k_schuerholt for pointing this out. Finally, we are accelerating the iteration of the HRM model and continuously pushing its limits, with good progress so far. At the same time, we believe the hierarchical architecture is highly effective in many scenarios. Moving forward, we will make further targeted updates to the architecture and validate it on more applications. We will also release an FAQ to address the key questions raised by the community. 🧠 Stay tuned!

English
1
1
18
2.1K
Sapient Intelligence
Sapient Intelligence@Sapient_Int·
Bigger ≠ Better. The GPT-5 rollout reminded everyone that raw scale isn’t a strategy. Real value now lives in agent reliability, not leaderboard one-shots. Our stance: optimize for closed-loop task success (plans → tools → checks → handoff), not just next-token accuracy. We benchmark Sapient HRM against process metrics: tool-call precision, recovery after tool error, and end-to-end SLA success.
English
4
2
23
1.8K