Yuting D.

119 posts

Yuting D.

@maybeTuring

Building @SantoriLabs. Ex-GM@scale_AI, ex-@Google Sustainable founding, Human-ai-interaction

San Francisco, CA Inscrit le Nisan 2013

148 Abonnements101 Abonnés

Yuting D.@maybeTuring·20 Kas

@chongz I really do think it's because it's trained on real coding , not just stupid benchmarks that none of us do day to day. 🤷‍♀️ Too many PhD experts for the other models

English

110

Timothy Luong (Chongz)@chongz·19 Kas

I’ll save you guys some time, none of these models are better at coding than Claude. Not a fucking clue why, Dario must have traded his ability to make eye contact to the coding devil.

English

3.9K

Yuting D.@maybeTuring·1 Eki

👇 just as good a day as any

English

Yuting D.@maybeTuring·25 Eyl

@AnthropicAI Known bug? Are we paying for Claude 4.0 but getting Claude 3.5 instead?

English

Yuting D.@maybeTuring·21 Eyl

I meant, why not just build it directly? or better , skip all the middle-layer and give me the money?

English

Yuting D.@maybeTuring·4 Eyl

Wrote this reflection back in May in building products during paradigm shifts I continue to root for good products with amazing tastes open.substack.com/pub/thoughtsin…

English

Yuting D.@maybeTuring·4 Eyl

Wow I'm surprised but I understand. Surprised that they didn't get acquired by Openai or any of those AI players. Atlassian seems to be in their last round. I understand because paradigm shifts tend to send everyone back to square one, which I wrote about this back in May. There will be others. More later stage ones in particular. I'm glad to hear that they can really focus on just building out @diabrowser

Josh Miller@joshm

The @browsercompany just signed a merger agreement to be acquired. We will remain independent. Our focus is Dia. I’ve written and rewritten this post more times than I’d like to admit, but what I keep coming back to is simple: the work continues, and we’re grateful for this moment. The work continues because when I stop by the coffee shop near our office, nobody is using Dia yet. Our “internet computer” vision hasn’t been realized. Dia hasn’t yet changed how you work on a Tuesday morning. This deal is about giving us the resources, distribution, and monetization muscle to get there. At the same time, it feels disingenuous not to pause and briefly celebrate this milestone. It reflects our team’s craftsmanship and relentlessness, the support of our coaches, board members, and advisors, and the incredible effort from our deal team: Ryan Purcell from Gunderson, Nancy Peretsman and Leah Schwartz from Allen & Co., and Clare, Abby, Eissra, Rebecca, Cory, Nash, and Hursh from The Browser Company. Most of all, we’re grateful for what this means for Dia. It means we can hire faster, ship faster, and bring Dia to more people. We can now invest in cross-platform support and secure syncing, train custom AI models designed specifically for Dia, and turn ambitious ideas about “computer use” and “memory” into reality. To everyone who’s filed a bug, sent feedback, or shared a kind word: thank you. We haven’t always gotten it right, but we’ve always cared deeply. That will never change. Dia isn’t going anywhere. We’ll be here for the long haul, with the same team just a new partner helping us push further. We’ll take a breath this weekend, and then get back to work. Big launch next month. In the meantime...

English

346

Yuting D.@maybeTuring·19 Tem

@GregKamradt @arcprize "how to beat X arc-agi-3 game" 😂 Gotta say, very human like.

English

533

Greg Kamradt@GregKamradt·18 Tem

Just tried ChatGPT agent on @arcprize ARC-AGI-3 > Told it to play a game > Couldn't figure out what to do > Agent did a web search, "how to beat X arc-agi-3 game" > It didn't find answers > I told it to try clicking red/blue blocks > It clicked them, noticed something happened, kept clicking > Nudged more > Couldn't figure it out > Searched again Then I cut it off btw agent is a very cool tool

English

154

15.4K

Yuting D.@maybeTuring·16 Tem

@levelsio It is not a tech problem. It is an org problem. Transformers were born out of that group and Google is really bad at pushing research efforts into real products without an army of directors and VPs getting in the way

English

@levelsio@levelsio·16 Tem

I didn't know how bad Google Translate was until I started learning Portuguese It consistently makes really bad mistakes and doesn't consider context Which is crazy cause if you just ask any LLM to translate it, it's flawless Why doesn't Google Translate use AI to translate? In this case it translates "depois" to "then" when it should be "later"

English

159

423

77.1K

Yuting D.@maybeTuring·15 Tem

"We’re doing so in a way that treats the team with the value and respect that they deserve." cognition's employer brand just 10x'ed. Love this for the @cognition_labs team and @windsurf_ai team.

Russell Kaplan@russelljkaplan

Seeing lots of questions like: wait, I thought Windsurf was already acquired? What is Cognition buying? Let me explain. Windsurf the company is an *extraordinary* asset. It was missing its founders and research team, but it has a beloved product, valuable IP, an incredible business ($82M ARR with enterprise growth doubling quarter-over-quarter), known brand, and most importantly: a world-class team in every function—GTM, enterprise engineering, and much more. With today’s news, we’re adding all that firepower to Cognition to deliver the most complete AI coding solution in the market. And we’re doing so in a way that treats the team with the value and respect that they deserve. And here’s what’s also ours: - all improvements we build on top of Windsurf’s IP from here - all Windsurf training data - all Windsurf trademark and brand assets The meme over the weekend was “Is Windsurf now an empty shell?” The opposite is true, and we’re going to be even stronger together. Today is a huge win for Windsurf and Devin customers everywhere.

English

198

Yuting D.@maybeTuring·1 Tem

RL is just getting started, but higher order thinking—the why behind human actions—are the biggest data gap towards more autonomous agents. Before Google Search: you dig through categories (think Yahoo directories, library catalogs), matching your need to rigid buckets—not your real intent. After Google Search: you could ask in freeform, but systems still just see your keywords, not the real goal (“pottery artist near me” hides “birthday gift for mom”). The core reasoning stays invisible. What we need now is human reasoning data: not just actions, but the why behind them. People aren’t trained to make this explicit, so AI keeps learning from the surface layer. This is why we started @santorilabs but we are not interested in selling these data to labs. Gonna start to write abt our thesis on this.

Brendan (can/do)@BrendanFoody

Mercor (@mercor_ai) is now working with 6 out of the Magnificent 7, all of the top 5 AI labs, and most of the top application layer companies. One trend is common across every customer: we are entering The Era of Evals. RL is becoming so effective that models will be able to saturate any evaluation. This means that the primary barrier to applying agents to the entire economy is building evals for everything. This will be one of the largest buildouts we have ever seen with enterprises pouring hundreds of billions of dollars into evals for every workflow we want agents to automate. We're quickly defining a new class of work and hiring across nearly every domain: software engineers, consultants, bankers, lawyer, doctors, gamers, and many more.

English

215

Yuting D.@maybeTuring·27 Haz

Let's not forget that @cursor_ai didn't spend a single dime on marketing until sometime this year. Good product still works

English

Yuting D.@maybeTuring·27 Haz

@joshm @zoink Kinda The ones who can design + build (used to be a hard requirement for design at Quora). Roles are blurring imo, and one needs to be better than pure vibe coding with AI.

English

Josh Miller@joshm·26 Haz

@zoink 💯

QME

1.9K

Dylan Field@zoink·26 Haz

Companies are starting to fully understand that design and craft is the differentiator. Designer talent war is just the start.

English

164

1.8K

191.7K

Yuting D.@maybeTuring·13 Haz

Truly the end of an era. Congrats @alexandr_wang on your next chapter! And thank you for bringing together such an amazing group of people to run through walls together.

Alexandr Wang@alexandr_wang

My note to Scale employees today—

English

351

Yuting D.@maybeTuring·13 Haz

It's def coding then and probably still is (this is not scale specific just general landscape). Competitive programming was the main one for a while. Now there is a lot more focus on real engineering use cases. Multimodal data is another one and still exploding: audio is still huge demand

English

andre --dangerously-skip-permissions@andrezfu·12 Haz

@maybeTuring what kind of data was most in demand when you were running that unit? was it mostly just coding or is there expansion into other domains as well? what was the source of that data? super curious to learn more! 😀

English

190

Yuting D.@maybeTuring·11 Haz

Scale ai 101 (my wall is full of wrong info): 1. Not Philippines. The game has already changed to PhDs, competitive programmers, a k.a. extremely expensive domain experts. 2. Ask your VC friends which company they think has the best founders. Scale is likely in the top 3 3. The era of experience still requires humans. As we stand today, paying people to give you feedback is 1000% more effective than your vanilla users Ama for the next 4hrs, I used to run that biz unit. And ofc I'm biased, but it doesn't make them not true.

English

68.7K

Yuting D.@maybeTuring·11 Haz

@rauchg @yutori_ai Insanely fast, is the speed real?

English

352

Guillermo Rauch@rauchg·11 Haz

This is v cool. Scouts by @yutori_ai is like "AI-native Google Alerts". It can monitor the web and process it with a prompt. The as-you-type form UX is 🔥 too.

English

771

74K

Yuting D.@maybeTuring·11 Haz

1. You still need to run data collection/eval as a huge ops - the hottest verticals are not open domains or at least you need closed efforts to get ahead. 2. Every ai startup and their neighbor is essential building a labeling/feedback platform. With agents the feedback platform is still human in the loop (human puts signals back into the env) 3. aw is not your average founder and the talent density is crazy there still. (Disclaimer: I used to run that biz unit at scale)

English

1.2K

Greg Kamradt@GregKamradt·10 Haz

The bet on human data is bold, if @RichardSSutton's era of experience comes true, then agents/AI will go get their own data You don't need a Scale AI's worth of data to train a human - this bet from Meta looks like a bet *against* self-discovering agents Unless Scale is cooking up something new What am I not seeing here?

will brown@willccbb

spending everything on Alexandr Wang

English

7.4K

Yuting D.@maybeTuring·11 Haz

@dalibali2 s/Filipino/Phds

1.5K

dalibali@dalibali2·11 Haz

ScaleAI is the highest price/filipino ever paid in history

English

1.2K

136K

Yuting D.@maybeTuring·11 Haz

I'm already speculating about the next pair. Time to get on polymarket

zain@zainbacchus

@pitdesi a new challenger has appeared

English

581

Yuting D.@maybeTuring·6 Haz

Stuck on a long-haul flight for today and the wifi is too bad for real work but not for X. Lucky girl indeed

English

114

Découvrir

@chongz @AnthropicAI @diabrowser @GregKamradt @arcprize @levelsio @cognition_labs @windsurf_ai