Dries Smit

153 posts

Dries Smit

Dries Smit

@DriesSmit1

Researcher at Tufa Labs, working on fundamental AI research. https://t.co/2HNB9DxLWj

Katılım Ekim 2018
184 Takip Edilen249 Takipçiler
Jack Cole
Jack Cole@MindsAI_Jack·
@DriesSmit1 @tufalabs Yeah. At least V2. If you look at the leaderboard now, you'll see me there at the top for now. 🚀
English
1
0
1
16
Dries Smit retweetledi
Jack Cole
Jack Cole@MindsAI_Jack·
Good luck everyone! Especially my friends at @tufalabs ! Looking forward to seeing the innovation and creativity that happens as a result of this contest.
Greg Kamradt@GregKamradt

Today we're launching ARC-AGI-3 135 Novel Environments (nearly 1K levels) we build by hand It is the only unsaturated agent benchmark in the world Each game is 100% human solvable, AI scores <1% This gap between human and AI performance proves we do not have AGI Agents today need human handholding. Agents that beat V3 will prove they don’t need that level of supervision. Agents that beat V3 will demonstrate: * Continual learning - Each level builds on top of each other. You can’t beat level 3 without carrying forward what you learned in levels 1 and 2. * World modeling - Many of the environments require planning actions many actions ahead. AI will have no choice but to build an internal world model for how the environment works, run simulations “in its head” and proceed with an action In our early testing, we’ve seen a few clear failure modes of AI: * Anticipation of future events - If an environment requires that AI set up a scene, and then carry out a scenario (like in sp80), it starts to break down. * Anchoring on early hypothesis - Early in a game it comes up with a hypothesis (even if wrong) and refuses to update its beliefs later. * Thinking it’s playing another game - AI thinks it’s playing chess, pacman. The training data holds hard! One major problem is there is too much data to carry forward in a single context. Models must learn what to remember and what to forget The agent that beats ARC-AGI-3 will have demonstrated the most authoritative evidence of progress towards general intelligence to date We're excited to get this out and excited to see what you think

English
1
2
11
671
Dries Smit
Dries Smit@DriesSmit1·
@vr4300 @symbolica Fantastic result! Is this a single run across all the games, and how long did it take to complete them? Also, was the same harness used for every game without any game-specific hints?
English
0
0
0
601
Dries Smit retweetledi
Noam Brown
Noam Brown@polynoamial·
1987: AI can't win at chess—planning is uniquely human 1997: AI can't win at Go—intuition is uniquely human 2016: AI can't win at poker—bluffing is uniquely human 2023: AI can't get IMO gold—reasoning is uniquely human 2026: AI can't make wise decisions—judgment is uniquely human
Noam Brown tweet mediaNoam Brown tweet media
English
233
412
3.5K
967.2K
Dries Smit retweetledi
Mark Barney
Mark Barney@82deutschmark·
ARC-AGI Discord weekly Sunday community discussion (1pm ET / 18:00 UTC): chill deep-dive on 2025 results! @arcprize Friendly vibes, all welcome 🤝 discord.gg/9b77dPAmcA #ARCAGI Congrats to all the 2025 winners! Check the Hall of Fame for more fan art! 😁🎨
Mark Barney tweet mediaMark Barney tweet mediaMark Barney tweet mediaMark Barney tweet media
English
0
1
4
300
Dries Smit
Dries Smit@DriesSmit1·
Congrats to Jack Cole and the team on achieving third place in ARC-AGI-2!🥳 Great work from @MindsAI_Jack for driving this effort. Thanks to @arcprize for organising another exciting competition this year. We're looking forward to the ARC-AGI-3 competition next year at @tufalabs.
ARC Prize@arcprize

@jm_alexia @PourcelJulien @cedcolas @pyoudeyer @LiaoIsaac91893 @JFPuget @dvhrtm @JDisselh ARC Prize 2025 / Top Score Winner Third Place / MindsAI @ Tufa Labs @MindsAI_Jack, @DriesSmit1, @MohamedOsmanML, @bayesilicon Interview: youtu.be/3lXXfNsWIgo

English
0
0
13
1.3K
Dries Smit retweetledi
Simo Ryu
Simo Ryu@cloneofsimo·
We get transfered SO MANY STUFF out of the box from our ancestors, its ridiculous. My favorite example is that empirically, we seem to have "snake detection module" hard-wired to our brain (that detects snake faster than other things) We are, in so many ways, literally pretrained models
Simo Ryu tweet media
roon@tszzl

people forget about the Baldwin effect when reasoning about the difference between evolution and learning when you recapitulate learning that your ancestors did, when you learn to see, walk, and speak in an unbroken line for thousands of generations, it gets easier each time

English
87
291
5.1K
529K
Dries Smit retweetledi
Mikhail Samin
Mikhail Samin@Mihonarium·
Eliezer Yudkowsky and Nate Soares wrote a book about superintelligence, called If Anyone Builds It, Everyone Dies. It was released last week. The book is now a New York Times bestseller. It is endorsed by leading scientists across many fields, including a "godfather of AI" and the most cited living scientist; by national security experts and former government officials; by journalists; by many other public figures. Max Tegmark, the MIT physics professor, calls it the most important book of the decade. Stephen Fry says it's the most important book he's read for years. No surprise. In the coming years, humanity could develop a mind smarter than all of us combined. The book makes a careful argument for why, with anything remotely like the current state of this field, no one would have any control over what this mind would be or what goals it would have, and humanity will end if it creates a superintelligence. Humanity faces a clear danger of literal extinction caused by AI. This is not an exaggeration; literal deaths of everyone and the end of basically everything of value to us in the universe. And AI companies are in a race to kill everyone. This book is so important because it could help stop humanity on this path to self-annihilation. The book of the day in The Guardian. As The Spectator puts it: "If more and more people understand the danger, wake up and decide to end the “suicide race,” our fate is still in our own hands". Now, a New York Times bestseller. We need to raise the alarm; make the careful case; make it easier for people to be brave to speak up about the danger. While humanity lives, it is still possible for us to stop. To convince the governments to make sure that no one, anywhere, can create a superintelligence until humanity knows how to do that safely. This book can serve as a wake up call for our species. I really hope this is just the beginning.
English
18
20
186
22.2K
Dries Smit retweetledi
Haider.
Haider.@slow_developer·
"1m context window" models after 200k tokens
Haider. tweet media
English
180
408
9.9K
648.1K
Jack Cole
Jack Cole@MindsAI_Jack·
Great writeup by Dr. Dries Smit about his winning approach to the ARC 3 Preview competition. @DriesSmit1 @arcprize @dries.epos/1st-place-in-the-arc-agi-3-agent-preview-competition-49263f6287db" target="_blank" rel="nofollow noopener">medium.com/@dries.epos/1s…
English
1
3
27
2.9K
Dries Smit
Dries Smit@DriesSmit1·
I’m thrilled to share that we won 1st place in the ARC-AGI 3 Agent Preview competition! 🏆🎉 Huge thanks to @ARCPrize and @gregkamradt for organising such an exciting and tense contest. Looking forward to seeing everyone on the Kaggle leaderboard in 2026🚀 @MindsAI_Jack @tufalabs
ARC Prize@arcprize

Learning #3: The collective intelligence of the community is great. We hosted a Agent Preview Competition and awarded prizes to @DriesSmit1 @MindsAI_Jack and Will Dick github.com/wd13ca

English
7
7
84
6.7K
Dries Smit retweetledi
Justin Drake
Justin Drake@drakefjustin·
Yesterday Ethereum turned 10. Today, lean Ethereum is unveiled as a vision—and personal mission—for the next 10 years. We stand at the dawn of a new era. Millions of TPS. Quantum adversaries. How does Ethereum marry extreme performance with uncompromising security and decentralization? TLDR: next-generation cryptography is central to winning both offense and defense. Disclaimer: This is a Drake take™ aimed at a broad audience. A technical deep dive into hash-based post-quantum signatures and SNARKs will follow. A healthy diversity of views across Protocol, the EF, and the broader Ethereum community is expected and welcome. It strengthens us. defense—fort mode Ethereum is special. 100% uptime since genesis. Unrivaled client diversity. $130B in economic security (35.7M ETH staked × $3.7K)—maybe soon $1T. Ethereum is poised to become the bedrock of the internet of value, securing hundreds of trillions over decades, even centuries. Ethereum must survive anything: nation states, quantum computers. Whatever comes. Call it fort mode. If the internet is up, Ethereum is up. If the world is online, the world is onchain. offense—beast mode Ethereum is hungry. “Scale L1, scale blobs” is a strategic urgency inside the EF’s Protocol cluster. Expect low-hanging performance gains over the next 6–12 months. Longer term? Think gigagas L1, teragas L2. Call it beast mode. → 1 gigagas/sec on L1: 10K TPS, ambitious vertical scale → 1 teragas/sec on L2: 1M TPS, sprawling horizontal scale Scale vs decentralization? Why not both. The moon math we need is now tamed: → real-time zkVMs for lean execution → data availability sampling (DAS) for lean data A delicious cherry on top: full chain verification across every browser, wallet, phone. lean upgrades Lean Ethereum proposes bold upgrades across all three L1 sublayers: → lean consensus is beacon chain 2.0: hardened for ultimate security and decentralization, plus finality in seconds; formerly branded as “beam chain” → lean data is blobs 2.0: post-quantum blobs, plus granular blob sizing for a calldata-like developer experience → lean execution is EVM 2.0: a minimal, SNARK-friendly instruction set (possibly RISC-V; pronounced “risk five”), boosting performance while preserving EVM compatibility and its network effects The consensus layer (CL), data layer (DL), execution layer (EL) have each been reimagined from first principles. Together, they unlock fort mode and beast mode. The goal: performance abundance under the constraint of non-negotiable continuity, maximum hardness, and refreshing simplicity. lean cryptography Hash-based cryptography is emerging as the ideal foundation for lean Ethereum. It offers a compelling, unified answer to two megatrends reshaping the ecosystem: → the explosive rise of SNARKs → the looming quantum threat Imagine the leanest cryptographic brick—the hash function—singlehandedly powering L1: → CL: hash-based aggregate signatures upgrade BLS signatures → DL: hash-based DAS commitments upgrade KZG commitments → EL: hash-based real-time zkVMs upgrade EVM re-execution A cryptographic jewel in each of lean CL, lean DL, lean EL. lean craft Lean Ethereum is more than a blueprint for hardening and scaling Ethereum. More than just doubling down on security, decentralisation, and cutting-edge cryptography. It is an aesthetic. An art form. A craft. Think Jiro in Dreams of Sushi. When we can go the extra mile, we do. Minimalism. Modularity. Encapsulated complexity. Formal verification. Provable security. Provable optimality. These are subtle yet important technical considerations. Stay tuned for the post on post-quantum cryptography that will make them explicit. lean legacy After 10 fantastic years, lean Ethereum is a generational oath. To keep Ethereum online no matter what. To scale it without compromise. To make it worthy of those who come next. This is about legacy. We are builders, we are missionaries. We are Ethereum. I hope you join us.
English
348
488
2.1K
311K
Dries Smit retweetledi
Elon Musk
Elon Musk@elonmusk·
The path to solving hunger, disease and poverty is AI and robotics
English
33.2K
13.2K
141.9K
34.7M
Dries Smit retweetledi
Joshua Kushner
Joshua Kushner@JoshuaKushner·
the goal of life is to be excited to go to work and excited to go home
English
292
2K
16.8K
1.9M
Dries Smit retweetledi
vitrupo
vitrupo@vitrupo·
Jeff Clune says early OpenAI felt like spotting alien intelligence on approach. “It's like you're an astronomer.. and a handful of people have the expertise to say, aliens are on the way.” He knew they were on their way. But everything around him still looked normal. What happens when the Trisolarans arrive?
English
39
71
459
93.1K