Arun Bahl (e/reason)

216 posts

Arun Bahl (e/reason)

@arunbahl

Partner, friend, son, brother, dog dad. CEO at @AloeInc. Cognitive science + AI. Advocate for reason in both humans and machines. Specialization is for insects.

San Francisco, CA Katılım Şubat 2009

452 Takip Edilen220 Takipçiler

Arun Bahl (e/reason) retweetledi

Richard Sutton@RichardSSutton·2d

The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.

English

133

964

7.3K

533.8K

Arun Bahl (e/reason) retweetledi

J. M.@jmjjohnson·13 May

Yesterday's panel by @Vancity/@FrontierBC on "The AI Question: What do we actually want from it and who gets to decide?" -- moderated by Gurpreet Jhaj, with @GaryMarcus, @MadisonMills22 (@axios), and @arunbahl (@AloeInc) -- cut right to the heart of the question of how humans will exert or hand over our agency in this technological transition. Closing remark couldn't have put a finer point on it: Gurpreet: "What is one thing you want the founders and investors in this room to do differently when it comes to AI starting today?" Arun: "You still have a lot of choice. The other machine that directs our behavior is the economy. Yet humans can exercise their decision making, and build things in line with their own values. So do that."

English

394

Arun Bahl (e/reason)@arunbahl·27 Eyl

Gift link: puck.news/is-aloe-the-fi…

English

Arun Bahl (e/reason)@arunbahl·27 Eyl

It was a pleasure speaking with @PuckNews' A.I. correspondent @IKrietzberg about how @AloeInc's AI leapt ahead to state-of-the-art – and in doing so ushered in the “Dawn of the Self-Building A.I.” We’ve taken a fundamentally different approach to AI because we are a fundamentally different kind of company: designed from the ground up to support human minds, not exploit them for their attention. We believe this is a prerequisite for AI we can trust.

English

343

Arun Bahl (e/reason) retweetledi

Infinite Books@infinitebooks·22 Eyl

Nietzsche, what a line

English

1.6K

16K

555.9K

Arun Bahl (e/reason) retweetledi

Arthur Schopenhauer@SchopenhauerNow·11 Eyl

“Every man takes the limits of his own field of vision for the limits of the world.”

English

104

7.9K

Arun Bahl (e/reason) retweetledi

François Chollet@fchollet·27 Ağu

Saying that deep learning is "just a bunch of matrix multiplications" is about as informative as saying that computers are "just a bunch of transistors" or that a library is "just a lot of paper and ink." It's true, but the encoding substrate is the least important part here. It's the programs being encoded that are interesting and useful: what they can do, what they can't do, how well they generalize, how efficiently they can be learned, etc.

English

125

241

2.9K

207.4K

Arun Bahl (e/reason) retweetledi

anton 🇺🇸@atroyn·12 Tem

technologists often do themselves a disservice by dismissing philosophy as 'unscientific', pointless, meaningless etc. you are immersed in ideas, and without the tools to apprehend them, you're at their mercy. philosophy is a system of inquiry, not a set of conclusions.

English

159

17.5K

Arun Bahl (e/reason)@arunbahl·10 Ağu

@nainia_ayoub That’s exactly what we’ve been building at @AloeInc - and we just got to sota on GAIA by creating an agent with the ability to write its own composable tools to reason through data as it works. Would love to chat - we’re on the same team.

English

Ayoub Nainia@nainia_ayoub·9 Ağu

This matches what we've been seeing in other domains: LLMs act like accelerators for pattern recognition and idea generation, but don't necessarily strengthen slower, step-by-step logic. The challenge now is designing pedagogy that boosts both inductive and deductive reasoning when AI is in the loop.

Rohan Paul@rohanpaul_ai

A research finds in a standardized critical thinking test, that LLM‑integrated group improved more overall, with a notable gain in inductive reasoning. That adding AI to an established pedagogy did not erode critical thinking Researchers ran a randomized controlled trial with 100 first-year nursing students, splitting them 50 and 50 into traditional problem-based learning and an LLM-assisted version. Neither group had prior exposure to problem-based learning or LLMs. Everyone took the California Critical Thinking Skills Test before and after an 8-week, 16-hour course. After adjusting for starting scores, total critical thinking rose in both groups, but the LLM group improved a bit more, roughly 0.60 points versus 0.50 points, with a p value under 0.01. The clear standout was inductive reasoning. The LLM group showed a marked jump on questions that ask students to generalize from cases, while other subskills like analysis, inference, evaluation, and deduction were similar between groups. Course grades did not differ meaningfully, about 77.6 versus 74.3, which suggests the benefit targeted thinking skills rather than test performance. Why this likely happened is straightforward. The assistant can summarize readings quickly, break a messy case into smaller questions, surface overlooked details, and propose alternative solutions that students can compare to their own, which trains pattern recognition. There is a tradeoff. When the assistant helps structure problems, students may do less slow, step-by-step analysis, which fits the flat results on deductive and evaluative subscales. Overall, pairing an LLM with problem-based learning nudged critical thinking up, and the biggest lift was in pattern-building skills. --- journals. lww. com/nurseeducatoronline/fulltext/2025/07000/randomized_controlled_study_on_the_impact_of.15.aspx

English

3.2K

Arun Bahl (e/reason) retweetledi

Jeremy Howard@jeremyphoward·7 Ağu

The GPT 5 launch included a chart showing 52.8 as a bigger number than 69.1, which in turn is shown as the same magnitude as 30.8. Not quite ASI…

English

891

95.1K

Arun Bahl (e/reason)@arunbahl·7 Ağu

@jburnmurdoch It's unsurprising this coincides with the liftoff of the distraction economy – human thinking breaks in predictable ways when our attention is consistently overburdened. We found something similar. aloe.inc/blog/know-thys…

English

125

John Burn-Murdoch@jburnmurdoch·14 Mar

NEW 🧵: Is human intelligence starting to decline? Recent results from major international tests show that the average person’s capacity to process information, use reasoning and solve novel problems has been falling since around the mid 2010s. What should we make of this?

English

1.7K

4.6K

17K

Arun Bahl (e/reason)@arunbahl·7 Ağu

Today I get to share the AI we affectionately nicknamed Aloe habilis: the AI that builds itself. It creates its own tools, shares them with other Aloes, and can use them to build still-better tools. So excited to show you where we go next.

Aloe@AloeInc

Aloe builds itself. We are now state-of-the-art on the GAIA benchmark of generalist AIs, beating OpenAI, Manus, and Genspark by a wide margin. How? Like other AIs, Aloe uses tools. Unlike other AIs, if Aloe doesn't have the right tool for the job, it creates a new one first. This is composable program synthesis, and it's a new species of AI (we nicknamed it 𝘈𝘭𝘰𝘦 𝘩𝘢𝘣𝘪𝘭𝘪𝘴): when one Aloe increases its capability, they all get more capable together as they share tools and use them to create even better tools. Aloe’s lead over other systems is highest on the most difficult scenarios - there’s plenty of headroom to expand. We are just getting started – this is the floor of what we can do.

English

119

Arun Bahl (e/reason) retweetledi

J. M.@jmjjohnson·1 Ağu

Absolutely nailed it @om: "However, in this new AI-first internet era, AI is your attention manager. So how does Meta translate its past business model of “capture and monetize attention” to “optimize and enhance attention?” The attention economy business model of “endless scroll for ad revenue” fundamentally breaks in the new AI reality." This is why @arunbahl and I started @AloeInc - we fundamentally want to help people reclaim their agency - by equipping them with superhuman attention - and we are certain that the companies that built the Distraction Economy have 0% credibility toward that goal.

Hiten Shah@hnshah

This is the post you want to read about Zuck’s Superintelligence Memo. @om breaks it down and teaches us a thing or two about corporate communications.

English

3.1K

Arun Bahl (e/reason)@arunbahl·6 Haz

@JiahaoQiu99 Fantastic paper! We built @AloeInc to use program synthesis and web tools natively too. Here's me using it on mobile. We're growing our team and would love to chat. youtube.com/watch?v=y6M5wS…

YouTube

English

244

Jiahao Qiu@JiahaoQiu99·27 May

The GAIA game is over, and Alita is the final answer. Alita takes the top spot in GAIA, outperforming OpenAI Deep Research and Manus. Many general-purpose agents rely heavily on large-scale, manually predefined tools and workflows. However, we believe that for general AI assistants: "Simplicity is the ultimate sophistication." 🔗Full paper: arxiv.org/abs/2505.20286 🔗More Details will be updated here: github.com/CharlesQ9/Alita #AI #Agent #LLM

English

26.3K

Arun Bahl (e/reason) retweetledi

Aloe@AloeInc·2 Haz

The distraction economy has been a Bad Thing for our species, full stop. We’re building a better way at Aloe. Tech that helps, not hijacks your attention. Appreciating @parmy’s perspective in @business bloomberg.com/opinion/articl…

English

203

Arun Bahl (e/reason)@arunbahl·13 May

Aloe is a personal generalist AI that clears space for you. Humans didn’t evolve to handle today’s information overload: Aloe knows your context across work and life, brings you the information you need, and handles tasks on your behalf - freeing up your time and mind for the things that actually matter to you. youtu.be/y6M5wSv6pyU?fe…

YouTube

English

Saharsh Agrawal@saharsh·12 May

yc deadline is coming up! if there is nothing else you would rather do than build something of your own, then YC is the best place to get started @arjsahai and I built a demo in 3 days, applied, and got in 3 days later after being interviewed by @garrytan drop your 1-2 sentence pitch and I'll share my thoughts or DM me for application reviews!

English

309

76.5K

Arun Bahl (e/reason)@arunbahl·3 Mar

Scaling test-time thinking is the next locus for advanced capabilities. Still multiplicative to pair the base model with the right strategies – program synthesis, OOM symbolic tools – but it's no longer large/expensive. Training on how to get the right answer, rather than what the right answer is, requires dramatically different data.

English

sarah guo@saranormous·2 Mar

New dominant question is not “is this the end of scaling” but how does the base model quality interact with test-time scaling / (o1/o3 style RL)? Is it multiplicative (a little bit moar in the base model will you get advanced new behaviors in the reasoning version still)?

English

9.2K

Arun Bahl (e/reason)@arunbahl·10 Şub

@ashugarg Agreed on the missing ingredient for the next step change. Human critical thinking is a composite process – five different kinds of reasoning – each requiring a different mechanism and data to build, not gated by raw compute. Would love to chat more, sent you an email.

English

ashu garg@ashugarg·5 Şub

Our industry has a scaling addiction. We're moving faster but innovating less. In 2015 a small AI research community created the core breakthroughs we still use today—transformers, GANs, and reinforcement learning. Each opened new paths to machine intelligence. Now we pour billions of dollars into marginal improvements to a single architecture. The founders I back pursue deeper questions: → How do we build systems that understand causality? → What architectures can move beyond pattern matching? → How do we create genuine reasoning capabilities? Current LLMs have reached what I call "minimum viable intelligence." They'll generate billions in enterprise value by augmenting knowledge work. But building trillion-dollar companies requires moving past current designs and exploring completely new approaches. Two years ago ChatGPT emerged seemingly from nowhere, transforming our understanding of what AI can do. The next step change may be equally unexpected, coming not from raw computing power but truly understanding the intelligence we aim to build.

English

645

Arun Bahl (e/reason)@arunbahl·28 Ara

@Scobleizer Still following! Let’s catch up at CES. Hope all is well in your world.

English

Robert Scoble@Scobleizer·27 Ara

I have muted everyone in tech. I am unmuting now that I know that it hurts your reach. But only if you are following me and you leave a comment here. That way I know you haven't muted me. I will leave the rest muted because Elon said that if I bug those who have muted me, I'll be marked as a spammer and penalized. Everyone is on my lists: x.com/scobleizer/lis…

English

1.5K

1.2K

125.4K

Arun Bahl (e/reason) retweetledi

Dylan O'Sullivan@DylanoA4·4 Ara

ZXX

2.4K

13.6K

542.1K

Keşfet

@Vancity @FrontierBC @GaryMarcus @MadisonMills22 @axios @AloeInc @PuckNews @IKrietzberg