Yuting D.

119 posts

Yuting D. banner
Yuting D.

Yuting D.

@maybeTuring

Building @SantoriLabs. Ex-GM@scale_AI, ex-@Google Sustainable founding, Human-ai-interaction

San Francisco, CA Inscrit le Nisan 2013
148 Abonnements101 Abonnés
Yuting D.
Yuting D.@maybeTuring·
@chongz I really do think it's because it's trained on real coding , not just stupid benchmarks that none of us do day to day. 🤷‍♀️ Too many PhD experts for the other models
English
0
0
2
110
Timothy Luong (Chongz)
Timothy Luong (Chongz)@chongz·
I’ll save you guys some time, none of these models are better at coding than Claude. Not a fucking clue why, Dario must have traded his ability to make eye contact to the coding devil.
English
7
0
49
3.9K
Yuting D.
Yuting D.@maybeTuring·
👇 just as good a day as any
Yuting D. tweet media
English
0
0
0
52
Yuting D.
Yuting D.@maybeTuring·
@AnthropicAI Known bug? Are we paying for Claude 4.0 but getting Claude 3.5 instead?
Yuting D. tweet media
English
0
0
0
87
Yuting D.
Yuting D.@maybeTuring·
I meant, why not just build it directly? or better , skip all the middle-layer and give me the money?
Yuting D. tweet media
English
0
0
1
98
Yuting D.
Yuting D.@maybeTuring·
Wow I'm surprised but I understand. Surprised that they didn't get acquired by Openai or any of those AI players. Atlassian seems to be in their last round. I understand because paradigm shifts tend to send everyone back to square one, which I wrote about this back in May. There will be others. More later stage ones in particular. I'm glad to hear that they can really focus on just building out @diabrowser
Yuting D. tweet media
Josh Miller@joshm

The @browsercompany just signed a merger agreement to be acquired. We will remain independent. Our focus is Dia. I’ve written and rewritten this post more times than I’d like to admit, but what I keep coming back to is simple: the work continues, and we’re grateful for this moment. The work continues because when I stop by the coffee shop near our office, nobody is using Dia yet. Our “internet computer” vision hasn’t been realized. Dia hasn’t yet changed how you work on a Tuesday morning. This deal is about giving us the resources, distribution, and monetization muscle to get there. At the same time, it feels disingenuous not to pause and briefly celebrate this milestone. It reflects our team’s craftsmanship and relentlessness, the support of our coaches, board members, and advisors, and the incredible effort from our deal team: Ryan Purcell from Gunderson, Nancy Peretsman and Leah Schwartz from Allen & Co., and Clare, Abby, Eissra, Rebecca, Cory, Nash, and Hursh from The Browser Company. Most of all, we’re grateful for what this means for Dia. It means we can hire faster, ship faster, and bring Dia to more people. We can now invest in cross-platform support and secure syncing, train custom AI models designed specifically for Dia, and turn ambitious ideas about “computer use” and “memory” into reality. To everyone who’s filed a bug, sent feedback, or shared a kind word: thank you. We haven’t always gotten it right, but we’ve always cared deeply. That will never change. Dia isn’t going anywhere. We’ll be here for the long haul, with the same team just a new partner helping us push further. We’ll take a breath this weekend, and then get back to work. Big launch next month. In the meantime...

English
1
0
2
346
Greg Kamradt
Greg Kamradt@GregKamradt·
Just tried ChatGPT agent on @arcprize ARC-AGI-3 > Told it to play a game > Couldn't figure out what to do > Agent did a web search, "how to beat X arc-agi-3 game" > It didn't find answers > I told it to try clicking red/blue blocks > It clicked them, noticed something happened, kept clicking > Nudged more > Couldn't figure it out > Searched again Then I cut it off btw agent is a very cool tool
English
14
13
154
15.4K
Yuting D.
Yuting D.@maybeTuring·
@levelsio It is not a tech problem. It is an org problem. Transformers were born out of that group and Google is really bad at pushing research efforts into real products without an army of directors and VPs getting in the way
English
0
0
0
11
@levelsio
@levelsio@levelsio·
I didn't know how bad Google Translate was until I started learning Portuguese It consistently makes really bad mistakes and doesn't consider context Which is crazy cause if you just ask any LLM to translate it, it's flawless Why doesn't Google Translate use AI to translate? In this case it translates "depois" to "then" when it should be "later"
@levelsio tweet media@levelsio tweet media@levelsio tweet media
English
159
4
423
77.1K
Yuting D.
Yuting D.@maybeTuring·
RL is just getting started, but higher order thinking—the why behind human actions—are the biggest data gap towards more autonomous agents. Before Google Search: you dig through categories (think Yahoo directories, library catalogs), matching your need to rigid buckets—not your real intent. After Google Search: you could ask in freeform, but systems still just see your keywords, not the real goal (“pottery artist near me” hides “birthday gift for mom”). The core reasoning stays invisible. What we need now is human reasoning data: not just actions, but the why behind them. People aren’t trained to make this explicit, so AI keeps learning from the surface layer. This is why we started @santorilabs but we are not interested in selling these data to labs. Gonna start to write abt our thesis on this.
Brendan (can/do)@BrendanFoody

Mercor (@mercor_ai) is now working with 6 out of the Magnificent 7, all of the top 5 AI labs, and most of the top application layer companies. One trend is common across every customer: we are entering The Era of Evals. RL is becoming so effective that models will be able to saturate any evaluation. This means that the primary barrier to applying agents to the entire economy is building evals for everything. This will be one of the largest buildouts we have ever seen with enterprises pouring hundreds of billions of dollars into evals for every workflow we want agents to automate. We're quickly defining a new class of work and hiring across nearly every domain: software engineers, consultants, bankers, lawyer, doctors, gamers, and many more.

English
0
0
1
215
Yuting D.
Yuting D.@maybeTuring·
Let's not forget that @cursor_ai didn't spend a single dime on marketing until sometime this year. Good product still works
English
0
0
1
88
Yuting D.
Yuting D.@maybeTuring·
@joshm @zoink Kinda The ones who can design + build (used to be a hard requirement for design at Quora). Roles are blurring imo, and one needs to be better than pure vibe coding with AI.
English
1
0
0
15
Dylan Field
Dylan Field@zoink·
Companies are starting to fully understand that design and craft is the differentiator. Designer talent war is just the start.
English
77
164
1.8K
191.7K
Yuting D.
Yuting D.@maybeTuring·
It's def coding then and probably still is (this is not scale specific just general landscape). Competitive programming was the main one for a while. Now there is a lot more focus on real engineering use cases. Multimodal data is another one and still exploding: audio is still huge demand
English
0
0
0
87
andre --dangerously-skip-permissions
@maybeTuring what kind of data was most in demand when you were running that unit? was it mostly just coding or is there expansion into other domains as well? what was the source of that data? super curious to learn more! 😀
English
1
0
0
190
Yuting D.
Yuting D.@maybeTuring·
Scale ai 101 (my wall is full of wrong info): 1. Not Philippines. The game has already changed to PhDs, competitive programmers, a k.a. extremely expensive domain experts. 2. Ask your VC friends which company they think has the best founders. Scale is likely in the top 3 3. The era of experience still requires humans. As we stand today, paying people to give you feedback is 1000% more effective than your vanilla users Ama for the next 4hrs, I used to run that biz unit. And ofc I'm biased, but it doesn't make them not true.
English
3
1
16
68.7K
Guillermo Rauch
Guillermo Rauch@rauchg·
This is v cool. Scouts by @yutori_ai is like "AI-native Google Alerts". It can monitor the web and process it with a prompt. The as-you-type form UX is 🔥 too.
English
13
36
771
74K
Yuting D.
Yuting D.@maybeTuring·
1. You still need to run data collection/eval as a huge ops - the hottest verticals are not open domains or at least you need closed efforts to get ahead. 2. Every ai startup and their neighbor is essential building a labeling/feedback platform. With agents the feedback platform is still human in the loop (human puts signals back into the env) 3. aw is not your average founder and the talent density is crazy there still. (Disclaimer: I used to run that biz unit at scale)
English
0
1
3
1.2K
Greg Kamradt
Greg Kamradt@GregKamradt·
The bet on human data is bold, if @RichardSSutton's era of experience comes true, then agents/AI will go get their own data You don't need a Scale AI's worth of data to train a human - this bet from Meta looks like a bet *against* self-discovering agents Unless Scale is cooking up something new What am I not seeing here?
will brown@willccbb

spending everything on Alexandr Wang

English
6
2
27
7.4K
dalibali
dalibali@dalibali2·
ScaleAI is the highest price/filipino ever paid in history
English
21
30
1.2K
136K
Yuting D.
Yuting D.@maybeTuring·
I'm already speculating about the next pair. Time to get on polymarket
zain@zainbacchus

@pitdesi a new challenger has appeared

English
0
0
1
581
Yuting D.
Yuting D.@maybeTuring·
Stuck on a long-haul flight for today and the wifi is too bad for real work but not for X. Lucky girl indeed
English
0
0
2
114