Eduardo Reis

7.7K posts

Eduardo Reis

@edreisMD

Vice President Strategy @ Hippocratic AI, Founding Radiologist @ Cognita Imaging, AI @StanfordAIMI, Neuroradiologist.

Katılım Eylül 2015

2.7K Takip Edilen1.3K Takipçiler

Eduardo Reis retweetledi

jack@jack·2d

people are sleeping on how excellent goose has become under the hood (interface needs some work but team is pushing). it's a superpower. github.com/block/goose

English

208

424

4.9K

449.1K

Eduardo Reis retweetledi

0xMarioNawfal@RoundtableSpace·1d

Send this article to your agent and thank me later x.com/kevingu/status…

Kevin Gu@kevingu

x.com/i/article/2039…

English

252

3.1K

804.7K

Eduardo Reis retweetledi

Garry Tan@garrytan·1d

I started using the ideas in this and it is so powerful Thank you Ryan (GStack for Openclaw coming soon)

Ryan Carson@ryancarson

x.com/i/article/2039…

English

119

1.7K

384.7K

Eduardo Reis retweetledi

Pedro Franceschi@pedroh96·2d

How I'm using OpenClaw today. I'll open-source my autopilot system soon.

Ashlee Vance@ashleevance

Don't think I've heard any other CEO describe agent use in such detail before. @pedroh96 out here running a $5bn company on OpenClaw Full episode with lots more detail in the replies

English

1.1K

158.5K

Eduardo Reis retweetledi

Gauri Gupta@gauri__gupta·4d

3/ We start with a baseline agent on Tau3 bench and run our system directly on top of it where it: - observes and mines failures from production traces - automatically clusters them into underlying failure modes - converts failure clusters into reusable living eval cases - proposes and experiments multiple harness changes and validates them - accepts only changes that both improve performance and don’t regress on previously fixed failures.

English

8.6K

Eduardo Reis retweetledi

shyamal@shyamalanadkat·4d

clearly the future of self-improving agents and AI FDEs that live within your stack

Gauri Gupta@gauri__gupta

English

4.6K

Eduardo Reis retweetledi

shyamal@shyamalanadkat·4d

evals grounded in real usage are the foundation of systems that compound in quality over time. companies that close the loop between production signals and evaluation will win. @NeoSigmaAI is building the infra to make that possible. very excited for their launch!

Gauri Gupta@gauri__gupta

We @neosigmaai @RitvikKapila are building the future of self-improving AI systems! By closing the feedback loop between production data and system improvements, we help teams capture failures, convert them into structured evaluation signals, and use them to drive continuous improvements in agent behavior. We show how our system works on Tau3 bench across retail, telecom, and airline domains. Agent performance on the validation set (with a fixed underlying model, GPT5.4) improves from 0.56 → 0.78 (~40% jump in accuracy).

English

4.3K

Eduardo Reis retweetledi

Sebastian Caliri@SebastianCaliri·5d

American physicians are remarkably pro-AI, and getting more supportive each year. 76% of doctors believe AI can help their ability to care for patients. And 70% believe that patients’ use of general-purpose AI chatbots for health information is positive / or has no impact. AI will not become a part of American healthcare without buy in from physicians (i.e. individual physicians, not the AMA or other societies, that don't necessarily speak for docs). Misguided ideas like the NY state bill would limit patient access to these tools. But most American doctors understand that banning data centers is bad for patients.

English

120

15.2K

Eduardo Reis retweetledi

Chris Orlob@Chris_Orlob·5d

Gong grew from $200k ARR to $200M ARR and $7.2B valuation in a 5 year span. Buyers told us our demos were 2nd to none. 9 lessons I learned about SaaS demos I'll never forget:

English

996

259.4K

Eduardo Reis retweetledi

Yoonho Lee@yoonholeee·5d

How can we autonomously improve LLM harnesses on problems humans are actively working on? Doing so requires solving a hard, long-horizon credit-assignment problem over all prior code, traces, and scores. Announcing Meta-Harness: a method for optimizing harnesses end-to-end

English

263

1.6K

425.6K

Eduardo Reis retweetledi

Ankit Gupta@agupta·5d

Fun update: I got tired of disliking every email client I’ve ever used and built my own. It’s called Exo (for exoskeleton). It’s Claude Code for my inbox. It manages my inbox for me, and it’s open source. Link to repo + some notable features in thread!

English

100

984

166.8K

Eduardo Reis retweetledi

Salma@Salmaaboukarr·29 Mar

it's time @claudeai

English

215

686

776.4K

Eduardo Reis retweetledi

vitrupo@vitrupo·6d

Jeff Dean says we’re going to have to re-engineer our tools because they were designed for human speed. An AI agent can run 50x faster, but the tools it relies on don’t. So even if the model gets infinitely fast, you only get 2-3x improvement overall. Amdahl’s law still applies.

English

126

169

1.3K

339.2K

Eduardo Reis retweetledi

Olafur Pall Olafsson@olafurpall80·28 Mar

@JoinLifespan Just because epigenetic drift follows a predictable pattern doesn't necessarily mean it's causal in aging. Predictable patterns can form from stochastic reactions. Also you cannot fix all extracellular damages with cellular rejuvenation. More here: olafurpall.substack.com/p/why-aging-is…

English

188.9K

Eduardo Reis retweetledi

David Sinclair@davidasinclair·29 Mar

This paper took us 13 years and is one of the longest papers ever in Cell. Check it out & judge for yourself cell.com/cell/fulltext/…

Olafur Pall Olafsson@olafurpall80

English

271

1.5K

221.1K

Eduardo Reis retweetledi

Bo Wang@BoWang87·6d

Imagine an AI paper after 13 years

David Sinclair@davidasinclair

This paper took us 13 years and is one of the longest papers ever in Cell. Check it out & judge for yourself cell.com/cell/fulltext/…

English

195

59K

Eduardo Reis retweetledi

Dalton (Analyze & Optimize)@Outdoctrination·29 Mar

A groundbreaking new study shows that a mineral deficiency can drive Alzheimer's, and consuming it can reverse it. (🧵1/10)

English

212

1.2K

95.3K

Eduardo Reis retweetledi

shyamal@shyamalanadkat·26 Mar

context is the new code - and right now it’s not easily version-controlled, portable, or auditable

English

Eduardo Reis retweetledi

Patrick OShaughnessy@patrick_oshag·25 Mar

William on how an early stage employee takes way more risk than a founder: "If I'm making $400-500K at Google or Meta and go to an early stage company to get 1% of this company and make $90,000. I've now changed the trajectory of my life, that's a lot of risk. But as a founder, you're not. It's a much higher likelihood that of the next round, regardless of your company, you'll be able to sell some secondary. If it shuts down, you can get employed at a great company, and you have a CEO on your resume. That first employee, they have first employee at a failed company. That's actually not a great resume line item. So we've de-risked the founder, but we haven't de-risked the early stage employee."

Patrick OShaughnessy@patrick_oshag

.@williamhockey is one of the least visible founders in tech relative to what he has created. He co-founded Plaid and is now building Column, a software company that owns a bank, and powers Ramp, Wise, Bilt, Mercury, and others. He funded it himself by borrowing against nearly everything he had in Plaid shares, and has never raised any outside capital. His story matters because so much of the value in our industry gets created through exactly this kind of extreme personal risk. He is maniacal about being the best in the world at his thing, and has spent his entire career betting on himself and doing whatever it takes to win. He also spends a lot of time outside the US (in places like Kinshasa) which has given him a rare perch on the power of the US dollar. We discuss: - Why emerging markets are often the most financially innovative - What owning 100% of his company allows him to do that VC-backed founders cannot - Getting margin called and nearly going bankrupt - Why the best founders are specialists - What it takes to be the best in the world at your thing - How Silicon Valley's consensus culture produces consensus founders - How the US dollar functions as an instrument of national security Enjoy! Timestamps: 0:00 Intro 9:19 Emerging Markets 14:03 Silicon Valley's Elite Consensus Problem 16:03 Rejecting the VC Hamster Wheel 21:45 Equity and Liquidity 26:03 Funding a Bank 29:45 The Necessity of Extreme Founder Risk 37:18 Finding Leverage 45:20 Longevity and Profitability in Banking 48:46 Matching Your Capital Structure to Your Business 51:44 The Unseen Power of the US Dollar 1:02:30 How AI Will Transform Legacy Banks 1:09:23 The Kindest Thing

English

109

104

2.5K

900K

Eduardo Reis retweetledi

Vincent Koc@vincent_koc·25 Mar

More RL goodness!

General Reasoning@GenReasoning

Introducing OpenReward. 🌍 330+ RL environments through one API ⚡ Autoscaled sandbox compute 🍒 4.5M+ unique RL tasks 🚂 Works like magic with Tinker, Miles, Slime Link and thread below.

English

3.5K

Keşfet

@NeoSigmaAI @claudeai @JoinLifespan @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates