fabio casati

459 posts

fabio casati

@sphoebs

father of two

Palo Alto, CA เข้าร่วม Mart 2009

140 กำลังติดตาม249 ผู้ติดตาม

ทวีตที่ปักหมุด

fabio casati@sphoebs·31 Oca

Enterprises are doing AI eval wrong - and it's causing wasted iteration cycles and wrong decision making. I’m preparing my spring semester course on designing large-scale AI systems and I need feedback: What’s the one change you’d make to eval practice + reporting to make it reliable? What's the error you see made often?

English

82.2K

fabio casati@sphoebs·1d

@nic_amadio This is very true. Italian engineers are massively underpaid. Companies that hire and pay well do wonders in Italy - and everywhere.

English

282

Nicola Amadio@nic_amadio·1d

Italy, with €53k average yearly comp, is the 2nd lowest paid country in Europe. Data from Euro Top Tech salary sharing doc: 1k+ contributions, 90 from Italy. Sad. I got a 80k CHF salary in 2018 at 24, in Zurich. Then, a few 100-150k salaries, and, at 28, a $200k offer. Most of my engineer friends in Italy, meanwhile, are making ~2k euros a month. And, while I can't be 100% sure, I don't think they're worse engineers. They went to the best engineering school in Italy, top grades, hard-working, bright. The fact is: there's a HUGE pool of GROSSLY UNDERPAID talent in Italy right now. I had a few clients from Italy, in my coaching program. A few examples: • One went from 40k Onsite Italy to 100k Euro-remote • Another from 35k Onsite Italy to 160k Worldwide-remote • Another from 70k Remote Italy to 200k Onsite Zurich, 300k Onsite London You see how underpaid they were. Not every engineer in Italy invests in a coaching program, or spends the time to find the best tech jobs remotely and abroad. But many of them follow my profile on LinkedIn and are subscribed to The European Engineer newsletter. My latest product - Forfettariato.com - allows you to put your jobs in front of them. A $50-100k remote job will get you an engineer that you'll pay 200k in London or Zurich. I care about the cause, and I made the entry price super low: just 50 euro per month per job ad, for the first 20 customers. If you're hiring tech talent, this is your chance.

English

217

39.1K

fabio casati@sphoebs·16 Mar

@ylecun @haider1 Why S?

English

Yann LeCun@ylecun·16 Mar

@haider1 Note to Elon fanbois: I own a 2023 Model S with FSD. I use it and find it useful. But it's far from Level 5. It's rated Level 2. Also, this: motherfrunker.ca/fsd/

English

1.2K

142.3K

Haider.@haider1·15 Mar

Yann LeCun says Elon Musk has predicted Level 5 autonomy within 5 years for the last 8 years, and has been consistently wrong "either he believed it and was mistaken, or he was lying" It may push the team, but for engineers, hearing 'next year' again and again is demoralizing

English

187

183

230.3K

fabio casati@sphoebs·12 Mar

@karpathy We als need new programming abstractions. A powerful one is the notion of statistical assertions. Properties that you’ve weakly expect to be true over your flow but not all the times.

English

Andrej Karpathy@karpathy·11 Mar

Expectation: the age of the IDE is over Reality: we’re going to need a bigger IDE (imo). It just looks very different because humans now move upwards and program at a higher level - the basic unit of interest is not one file but one agent. It’s still programming.

Andrej Karpathy@karpathy

@nummanali tmux grids are awesome, but i feel a need to have a proper "agent command center" IDE for teams of them, which I could maximize per monitor. E.g. I want to see/hide toggle them, see if any are idle, pop open related tools (e.g. terminal), stats (usage), etc.

English

826

833

10.5K

2.4M

fabio casati@sphoebs·24 Şub

@DanielMiessler “The best influencers”? What’s that category. The most competent or the most controversial or, like, Mr. Beast?

English

164

ᴅᴀɴɪᴇʟ ᴍɪᴇssʟᴇʀ 🛡️@DanielMiessler·24 Şub

x.com/i/article/2026…

ZXX

122

588

144.4K

fabio casati@sphoebs·23 Şub

I keep seeing posts on AI and SaaS. My (biased) take is that AI agents need SaaS way more than humans do. And this is from experience. Agents can consume knowledge like no human can. They can learn and iterate faster than humans. They can also go off track faster than humans—and at scale. They need a platform that maximizes their potential while keeping controls.

English

fabio casati@sphoebs·23 Şub

@Lol19559014 @JenniferSey That is not what he said. That is a redacted selection. It’s ok if you are trying to help him get more supporters, because this is what’s posts like this do.

English

Commonsensical Human@Lol19559014·23 Şub

@sphoebs @JenniferSey What do you mean that it's a "misrepresentation". It's a video clip of him showing him saying that. How is that "mispresentation"?

English

107

Jennifer Sey@JenniferSey·23 Şub

Gavin Newsom: “I’m just like you. I got a shitty SAT score and can’t read too good.” He said this to appear down with black folks. Could he be more racist? Like actually really racist?

English

324

892

6.8K

114.8K

fabio casati@sphoebs·23 Şub

@petergyang Air way better than pro. Minimum 15. 14 is frustrating.

English

Peter Yang@petergyang·22 Şub

I want to buy a new personal MacBook Pro - I’m still on M1 😅 Apparently Apple has both M5 and M6 coming out? Do you all prefer the 14 or 16 inch? Use case is vibe coding and video editing

English

340

623

181.2K

fabio casati@sphoebs·23 Şub

@_BMSimpson @curiosityonx It’s always 5pm where I am.

English

168

Brian Simpson-Author@_BMSimpson·23 Şub

@curiosityonx It's five o'clock somewhere.

English

219

42.4K

Curiosity@CuriosityonX·23 Şub

What would you do in this situation?

English

10.6K

1.2K

15.4K

14.9M

fabio casati@sphoebs·23 Şub

@Angaisb_ I don’t get all the people with “my agent takes 10 minutes”. Try the same simple ask to both opus and gpt. Difference is huge.

English

Angel 🌼@Angaisb_·22 Şub

How do Claude models get such good results while barely thinking at all? I've never seen Opus 4.6 or Sonnet 4.6 think for more than a few seconds, while every other model takes way longer

English

137

1.3K

152.7K

fabio casati@sphoebs·18 Şub

@IntCyberDigest Please do!! This idiotic healthcare privacy forced on people who don’t want it is preventing progress in medicine.

English

International Cyber Digest@IntCyberDigest·18 Şub

‼️ This is insane. Please don't listen to this. Don't upload your medical data into public LLMs, folks.

English

295

1.4K

5.8K

121.4K

fabio casati@sphoebs·18 Şub

@lugaricano @mustafasuleyman @Microsoft You are asking for the moon. Let’s set a lower goal, like automating many routine elderly care tasks with robots.

English

2.4K

Luis Garicano 🇪🇺🇺🇦@lugaricano·17 Şub

Rather than aiming to automate all white collar jobs I would be impressed if Mr @mustafasuleyman and @Microsoft would eliminate the bugs and annoyances of PPT, Word, Excel and Outlook. Seriously, anyone here thinks there is any chance this happens in 12-18 months?

English

7.1K

fabio casati@sphoebs·18 Şub

@travisakers Why. Do you need to hide your cholesterol from the CIA?

English

Travis Akers 🇺🇸@travisakers·18 Şub

Nobody, and I repeat, absolutely nobody should ever upload their medical information into an AI platform. I am telling you this as a former intelligence officer.

English

3.2K

25.9K

109.5K

3.7M

fabio casati@sphoebs·18 Şub

@operationdanish Why not?

English

Dr Danish@operationdanish·18 Şub

Nobody, and I repeat, absolutely nobody should ever upload their medical information into an AI platform. I am telling you this as a doctor.

Travis Akers 🇺🇸@travisakers

Nobody, and I repeat, absolutely nobody should ever upload their medical information into an AI platform. I am telling you this as a former intelligence officer.

English

1.9K

3.2K

12.2K

461.8K

fabio casati@sphoebs·18 Şub

Anybody remembers the 2025 NY humanoid robots show ? Aside from dancing, I hope companies work on robotic care for older adults. I’d trust it more than nursing homes

ChineseEmbassyManila@Chinaembmanila

At the 2025 Spring Festival Gala, 16 humanoid robots joined a traditional folk dance known for its sweeping steps and vibrant handkerchiefs. During the grand opening performance, humanoid robots demonstrated the "Thomas 360" stunt move. In a subsequent stage comedy, a group of AI-powered humanoid robots, along with a bionic robot, took on comedic roles. And in a separate martial arts performance, robots executed a series of high-difficulty movements.

English

fabio casati@sphoebs·16 Şub

@kmcannon ^^ this. This never happened before. At scale. Think about it.

English

270

Kevin Cannon@kmcannon·16 Şub

There are PhDs being handed out each day to people living in the past: the students, their advisors, their universities. Dissertations that took 5 years of work, and which 4.6 Opus could re-produce then improve on in an afternoon.

English

267

933

1.1M

fabio casati@sphoebs·12 Şub

@JoshDaws Very true. Nothing compares to it. Not even iPhone. Maybe the internet

English

Josh Daws@JoshDaws·11 Şub

I'm 48 and have worked as a software engineer for nearly 30 years. I've grown numb to the Silicon Valley hype machine. My default posture is "meh, we'll see." What I've seen and experienced firsthand in the past two months is not hype. Ignore it at your peril.

English

227

289

5.1K

1.3M

fabio casati@sphoebs·9 Şub

Imagine how fun for people preparing presentations on red teaming AI ))

English

fabio casati@sphoebs·9 Şub

@tomfgoodwin When people write “it’s so over” it’s a sign of a user to be blocked

English

1.3K

Tom Goodwin@tomfgoodwin·8 Şub

The work was never the slides. The work was literally everything else.

0xMarioNawfal@RoundtableSpace

Claude makes a powerpoint presentation in less than 5 seconds It’s so over

English

2.5K

147.5K

fabio casati@sphoebs·5 Şub

@carlodreyer @ylecun @SebastienBubeck Not really.

English

Carlìto@carlodreyer·5 Şub

@ylecun @sphoebs @SebastienBubeck The bazaar model enables faster iteration and community validation. Open collaboration consistently outpaces closed development.

English

Sebastien Bubeck@SebastienBubeck·4 Şub

I've been in lots of places in my career. OAI is simply the best research environment I have ever seen. It's a combination of the field itself being a research gold mine + having access to the right mining tools + (most importantly) the freedom to explore. It's special.

Mark Chen@markchen90

How does OpenAI balance long-term research bets with product-forward research fundamentals? I’ve been getting this question a lot lately, usually framed as a suggestion that Jakub (@merettm) and I are pushing an increasingly product-focused agenda. That characterization is simply wrong. Foundational research has been core to OpenAI from the start, and today we run a research program with hundreds of exploratory projects - much like the ones that led to our reasoning-model breakthrough. The majority of our compute is allocated to foundational research and exploration - and not product milestones. Anyone who has spent time with me or Jakub knows we are the last people in the world who would push for the advancement of products over the advancement of research. We’re in the business of creating an automated scientist, and capabilities that were considered grand challenges just a few years ago (like IMO-level mathematical reasoning) now emerge as normal parts of the research process. We’re also seeing our models accelerate researchers worldwide, helping advance work across biology, mathematics, physics, and even our own research. Jakub and I put a lot of effort into ensuring that research stays focused on uncovering algorithms that will scale to the compute we’ll have a year from now. We protect mindshare and amplify discourse on exploratory work. We do this while recognizing that we’re also a deployment company - and that deployment gives us access to even larger-scale compute, richer feedback, and more room for exploration. Our researchers are passionate about having their work out in the world, and a special slice of our org is dedicated to making sure our deployments are delightful for end users. Our goal isn’t to turn research into a quarterly race. It’s to build a durable research engine - one that compounds learning over time and consistently turns long-horizon exploration into real, measurable advances, while ensuring those advances become valuable in the real world. That’s the roadmap we’re executing on. And while there have been ups and downs over the last decade (as you expect with any research program), I think most of our researchers would share my strong optimism today.

English

566

157.4K

ค้นพบ

@nic_amadio @ylecun @haider1 @karpathy @DanielMiessler @Lol19559014 @JenniferSey @petergyang