fabio casati

459 posts

fabio casati

fabio casati

@sphoebs

father of two

Palo Alto, CA เข้าร่วม Mart 2009
140 กำลังติดตาม249 ผู้ติดตาม
ทวีตที่ปักหมุด
fabio casati
fabio casati@sphoebs·
Enterprises are doing AI eval wrong - and it's causing wasted iteration cycles and wrong decision making. I’m preparing my spring semester course on designing large-scale AI systems and I need feedback: What’s the one change you’d make to eval practice + reporting to make it reliable? What's the error you see made often?
fabio casati tweet media
English
1
5
14
82.2K
fabio casati
fabio casati@sphoebs·
@nic_amadio This is very true. Italian engineers are massively underpaid. Companies that hire and pay well do wonders in Italy - and everywhere.
English
1
0
2
282
Nicola Amadio
Nicola Amadio@nic_amadio·
Italy, with €53k average yearly comp, is the 2nd lowest paid country in Europe. Data from Euro Top Tech salary sharing doc: 1k+ contributions, 90 from Italy. Sad. I got a 80k CHF salary in 2018 at 24, in Zurich. Then, a few 100-150k salaries, and, at 28, a $200k offer. Most of my engineer friends in Italy, meanwhile, are making ~2k euros a month. And, while I can't be 100% sure, I don't think they're worse engineers. They went to the best engineering school in Italy, top grades, hard-working, bright. The fact is: there's a HUGE pool of GROSSLY UNDERPAID talent in Italy right now. I had a few clients from Italy, in my coaching program. A few examples: • One went from 40k Onsite Italy to 100k Euro-remote • Another from 35k Onsite Italy to 160k Worldwide-remote • Another from 70k Remote Italy to 200k Onsite Zurich, 300k Onsite London You see how underpaid they were. Not every engineer in Italy invests in a coaching program, or spends the time to find the best tech jobs remotely and abroad. But many of them follow my profile on LinkedIn and are subscribed to The European Engineer newsletter. My latest product - Forfettariato.com - allows you to put your jobs in front of them. A $50-100k remote job will get you an engineer that you'll pay 200k in London or Zurich. I care about the cause, and I made the entry price super low: just 50 euro per month per job ad, for the first 20 customers. If you're hiring tech talent, this is your chance.
Nicola Amadio tweet media
English
21
22
217
39.1K
Yann LeCun
Yann LeCun@ylecun·
@haider1 Note to Elon fanbois: I own a 2023 Model S with FSD. I use it and find it useful. But it's far from Level 5. It's rated Level 2. Also, this: motherfrunker.ca/fsd/
English
59
27
1.2K
142.3K
Haider.
Haider.@haider1·
Yann LeCun says Elon Musk has predicted Level 5 autonomy within 5 years for the last 8 years, and has been consistently wrong "either he believed it and was mistaken, or he was lying" It may push the team, but for engineers, hearing 'next year' again and again is demoralizing
English
187
183
2K
230.3K
fabio casati
fabio casati@sphoebs·
@karpathy We als need new programming abstractions. A powerful one is the notion of statistical assertions. Properties that you’ve weakly expect to be true over your flow but not all the times.
English
0
0
0
42
Andrej Karpathy
Andrej Karpathy@karpathy·
Expectation: the age of the IDE is over Reality: we’re going to need a bigger IDE (imo). It just looks very different because humans now move upwards and program at a higher level - the basic unit of interest is not one file but one agent. It’s still programming.
Andrej Karpathy@karpathy

@nummanali tmux grids are awesome, but i feel a need to have a proper "agent command center" IDE for teams of them, which I could maximize per monitor. E.g. I want to see/hide toggle them, see if any are idle, pop open related tools (e.g. terminal), stats (usage), etc.

English
826
833
10.5K
2.4M
fabio casati
fabio casati@sphoebs·
@DanielMiessler “The best influencers”? What’s that category. The most competent or the most controversial or, like, Mr. Beast?
English
0
0
2
164
fabio casati
fabio casati@sphoebs·
I keep seeing posts on AI and SaaS. My (biased) take is that AI agents need SaaS way more than humans do. And this is from experience. Agents can consume knowledge like no human can. They can learn and iterate faster than humans. They can also go off track faster than humans—and at scale. They need a platform that maximizes their potential while keeping controls.
English
0
0
0
37
fabio casati
fabio casati@sphoebs·
@Lol19559014 @JenniferSey That is not what he said. That is a redacted selection. It’s ok if you are trying to help him get more supporters, because this is what’s posts like this do.
English
1
0
0
66
Commonsensical Human
Commonsensical Human@Lol19559014·
@sphoebs @JenniferSey What do you mean that it's a "misrepresentation". It's a video clip of him showing him saying that. How is that "mispresentation"?
English
1
0
5
107
Jennifer Sey
Jennifer Sey@JenniferSey·
Gavin Newsom: “I’m just like you. I got a shitty SAT score and can’t read too good.” He said this to appear down with black folks. Could he be more racist? Like actually really racist?
English
324
892
6.8K
114.8K
Peter Yang
Peter Yang@petergyang·
I want to buy a new personal MacBook Pro - I’m still on M1 😅 Apparently Apple has both M5 and M6 coming out? Do you all prefer the 14 or 16 inch? Use case is vibe coding and video editing
English
340
9
623
181.2K
Curiosity
Curiosity@CuriosityonX·
What would you do in this situation?
English
10.6K
1.2K
15.4K
14.9M
fabio casati
fabio casati@sphoebs·
@Angaisb_ I don’t get all the people with “my agent takes 10 minutes”. Try the same simple ask to both opus and gpt. Difference is huge.
English
0
0
0
20
Angel 🌼
Angel 🌼@Angaisb_·
How do Claude models get such good results while barely thinking at all? I've never seen Opus 4.6 or Sonnet 4.6 think for more than a few seconds, while every other model takes way longer
English
137
13
1.3K
152.7K
fabio casati
fabio casati@sphoebs·
@IntCyberDigest Please do!! This idiotic healthcare privacy forced on people who don’t want it is preventing progress in medicine.
English
0
0
0
10
International Cyber Digest
International Cyber Digest@IntCyberDigest·
‼️ This is insane. Please don't listen to this. Don't upload your medical data into public LLMs, folks.
International Cyber Digest tweet media
English
295
1.4K
5.8K
121.4K
Luis Garicano 🇪🇺🇺🇦
Luis Garicano 🇪🇺🇺🇦@lugaricano·
Rather than aiming to automate all white collar jobs I would be impressed if Mr @mustafasuleyman and @Microsoft would eliminate the bugs and annoyances of PPT, Word, Excel and Outlook. Seriously, anyone here thinks there is any chance this happens in 12-18 months?
Luis Garicano 🇪🇺🇺🇦 tweet media
English
10
5
80
7.1K
Travis Akers 🇺🇸
Travis Akers 🇺🇸@travisakers·
Nobody, and I repeat, absolutely nobody should ever upload their medical information into an AI platform. I am telling you this as a former intelligence officer.
Travis Akers 🇺🇸 tweet media
English
3.2K
25.9K
109.5K
3.7M
fabio casati
fabio casati@sphoebs·
@kmcannon ^^ this. This never happened before. At scale. Think about it.
English
0
0
0
270
Kevin Cannon
Kevin Cannon@kmcannon·
There are PhDs being handed out each day to people living in the past: the students, their advisors, their universities. Dissertations that took 5 years of work, and which 4.6 Opus could re-produce then improve on in an afternoon.
English
267
63
933
1.1M
fabio casati
fabio casati@sphoebs·
@JoshDaws Very true. Nothing compares to it. Not even iPhone. Maybe the internet
English
0
0
0
34
Josh Daws
Josh Daws@JoshDaws·
I'm 48 and have worked as a software engineer for nearly 30 years. I've grown numb to the Silicon Valley hype machine. My default posture is "meh, we'll see." What I've seen and experienced firsthand in the past two months is not hype. Ignore it at your peril.
English
227
289
5.1K
1.3M
fabio casati
fabio casati@sphoebs·
Imagine how fun for people preparing presentations on red teaming AI ))
fabio casati tweet media
English
0
0
0
39
fabio casati
fabio casati@sphoebs·
@tomfgoodwin When people write “it’s so over” it’s a sign of a user to be blocked
English
0
0
4
1.3K
Carlìto
Carlìto@carlodreyer·
@ylecun @sphoebs @SebastienBubeck The bazaar model enables faster iteration and community validation. Open collaboration consistently outpaces closed development.
English
1
0
0
25
Sebastien Bubeck
Sebastien Bubeck@SebastienBubeck·
I've been in lots of places in my career. OAI is simply the best research environment I have ever seen. It's a combination of the field itself being a research gold mine + having access to the right mining tools + (most importantly) the freedom to explore. It's special.
Mark Chen@markchen90

How does OpenAI balance long-term research bets with product-forward research fundamentals? I’ve been getting this question a lot lately, usually framed as a suggestion that Jakub (@merettm) and I are pushing an increasingly product-focused agenda. That characterization is simply wrong. Foundational research has been core to OpenAI from the start, and today we run a research program with hundreds of exploratory projects - much like the ones that led to our reasoning-model breakthrough. The majority of our compute is allocated to foundational research and exploration - and not product milestones. Anyone who has spent time with me or Jakub knows we are the last people in the world who would push for the advancement of products over the advancement of research. We’re in the business of creating an automated scientist, and capabilities that were considered grand challenges just a few years ago (like IMO-level mathematical reasoning) now emerge as normal parts of the research process. We’re also seeing our models accelerate researchers worldwide, helping advance work across biology, mathematics, physics, and even our own research. Jakub and I put a lot of effort into ensuring that research stays focused on uncovering algorithms that will scale to the compute we’ll have a year from now. We protect mindshare and amplify discourse on exploratory work. We do this while recognizing that we’re also a deployment company - and that deployment gives us access to even larger-scale compute, richer feedback, and more room for exploration. Our researchers are passionate about having their work out in the world, and a special slice of our org is dedicated to making sure our deployments are delightful for end users. Our goal isn’t to turn research into a quarterly race. It’s to build a durable research engine - one that compounds learning over time and consistently turns long-horizon exploration into real, measurable advances, while ensuring those advances become valuable in the real world. That’s the roadmap we’re executing on. And while there have been ups and downs over the last decade (as you expect with any research program), I think most of our researchers would share my strong optimism today.

English
34
21
566
157.4K