Patrick Samy

3.6K posts

Patrick Samy banner
Patrick Samy

Patrick Samy

@patricksamy

Co-founder & CEO @joinhale (backed by @menloventures). Prev CEO @span_health (acquired by @eightsleep), @microsoft, @stanford. Tennis player. Zenonian 🇫🇷

states Katılım Ocak 2012
3.7K Takip Edilen3.6K Takipçiler
Patrick Samy
Patrick Samy@patricksamy·
Dad Mode shares a lot of similar skills with Founder Mode.
English
0
0
1
60
TJ Parker⚡️
TJ Parker⚡️@tjparker·
I think my many-thousand-year-old tree, which was transplanted into the courtyard at my house 18 months ago, is finally starting to push out new growth. And boy, does that make me happy.
TJ Parker⚡️ tweet media
English
46
2
361
26.1K
Patrick Samy
Patrick Samy@patricksamy·
There's only a handful of products you can design that will change millions of lives for the better. This is one of them. Whether you work on a consumer product in big tech or startups, if you have exceptional attention to detail and live to design unique experiences and category-defining products, we want to have a chat! Location: Europe or US East Coast
Patrick Samy@patricksamy

I’m hiring a Founding Designer to build the future of preventive health at Hale. We're a small, low ego, high ambition team, building a company where smart generalists do the most important work of their careers and have fun doing it.

English
4
0
12
2.2K
Hale Prevention
Hale Prevention@HalePrevention·
Get your Calcium Score today. Best predictor of heart disease. - $249. No referral needed. - 10 minutes per scan. - Same day appointments. - Live across all 50 states.
English
10
6
83
331.8K
Patrick Samy
Patrick Samy@patricksamy·
@sama I do that a lot. Also to chat with past context and remember very specific details and decisions.
English
0
0
0
67
Sam Altman
Sam Altman@sama·
people are really starting to use voice to interact with AI, especially when they have a lot of context to dump. GPT-Realtime-2 comes to the API today; it is a pretty big step forward. (we are working on improvements to voice in chat.)
English
875
289
7.1K
484.3K
Patrick Samy
Patrick Samy@patricksamy·
I’m hiring a Founding Designer to build the future of preventive health at Hale. We're a small, low ego, high ambition team, building a company where smart generalists do the most important work of their careers and have fun doing it.
English
40
3
192
19.4K
Maddy
Maddy@ammaddyaseen·
@patricksamy Hey! I’m a designer and really like what you’re building around preventive health at Hale. I’d be interested to learn more about the Founding Designer role. Is the role remote onsite or hybrid? Your DMs seem closed on my side can you open them?
English
1
0
0
432
Patrick Samy
Patrick Samy@patricksamy·
If this sounds like you, DM me or email patrick@joinhale.com
English
5
0
13
1.1K
Patrick Samy
Patrick Samy@patricksamy·
🤖 You love contributing code directly with AI when needed 😍 You pay attention to details and care about delivering world-class UX 🦄 Your past teammates and managers say great things about you 🚀 Location: ideally UK, open to EU and US East Coast.
English
2
0
13
1.3K
Patrick Samy
Patrick Samy@patricksamy·
Vibeware is the new software.
English
2
0
1
293
Hale Prevention
Hale Prevention@HalePrevention·
Get your Calcium Score today. Best predictor of heart disease. - $249. No referral needed. - 15 minutes per scan. - Same day appointments. - Live across all 50 states.
English
2
21
113
384.9K
zehra ✨
zehra ✨@zehranaqvi·
The internet curated for the obsessed. Welcome to Lore.
English
384
151
1.5K
251.4K
Patrick Samy
Patrick Samy@patricksamy·
A subscription product with no tangible recurring value is destined to fail. Consumers are also pretty smart in their perception of tangibility.
English
1
0
1
172
Patrick Samy retweetledi
Aakash Gupta
Aakash Gupta@aakashgupta·
Ilya said the quiet part out loud on Dwarkesh's pod, but most people still aren't processing what it means. Here's what's actually happening inside AI labs. Research teams have entire divisions that do nothing but create new RL training environments specifically designed to boost benchmark scores. They treat AIME, SWE-bench, and MMLU like standardized tests. The model practices 10,000 hours on competitive programming problems until every proof technique is at its fingertips. Then it fails to fix a simple bug in production without introducing two new ones. Sutskever used the perfect analogy. Student A grinds 10,000 hours of competitive programming. Memorizes every algorithm, every edge case, every proof technique. Becomes the #1 ranked competitive coder in the world. Student B practices 100 hours but has "it." Intuition. Taste. The ability to learn new things quickly. Who has the better career? Student B. Current AI models are all Student A. The benchmark gaming runs deeper than most realize. Studies have shown data contamination inflates model scores by 20-80% on popular benchmarks. The training-test boundary is porous. Models memorize answers rather than learn concepts. And when you control for contamination, much of what looks like intelligence is pattern-matching on seen data. This explains the economic puzzle Ilya pointed to. Models score 100% on AIME 2025. They hit 70%+ on GDPval beating human professionals. Yet businesses still struggle to extract value. The benchmark performance says genius. The P&L says otherwise. The sample efficiency gap tells you everything. A human teenager learns to drive any car after 10 hours. An AI model might need millions of examples and still fail on slight variations. A human learns a concept once and applies it everywhere. Models need to see the exact pattern thousands of times and still choke when the formatting changes slightly. Sutskever's diagnosis: we're moving from the "age of scaling" (2020-2025) back to the "age of research." The belief that 100x more compute would transform everything is dying. His $3B company SSI is betting that the next breakthrough comes from solving generalization, not stacking more GPUs. The labs know this. That's why the benchmark arms race is accelerating. It's easier to show impressive numbers than admit the fundamental approach might be plateauing.
Nek@Enscion25

Ilya is 100% correct .it's a pattern that keeps repeating It's very clear with GPT5.2 Overfit the model to produce impressive looking benchmarks, have it excels in a few domains, but fall flat in many others. There's not enough generalization, and even if there is, the model has been so heavily reinforced that it becomes buried .

English
93
297
2.1K
322.7K