Michael Liao

258 posts

Michael Liao

Michael Liao

@michaelcfix

@mercor_ai, ex-scale ai, founder, @uoft

San Francisco Sumali Mart 2022
183 Sinusundan92 Mga Tagasunod
Michael Liao nag-retweet
adarsh
adarsh@adarsh_exe·
Traditional coding benchmarks do not reflect how software is actually built and maintained. That's why we built a new benchmark, APEX-SWE, in partnership with @cognition. It measures whether AI models can perform complex, real-world software engineering work to ship systems that work and debug them when they don't. @OpenAI GPT 5.3 Codex (High) tops the leaderboard at 41.5% on Pass@1.
English
126
128
826
195.1K
Rich
Rich@richzou·
I just left @xai It was not an easy decision. The past three months were an absolute blast - I've been in many trenches in my life and can say this was by far one of the most intense warzones. I love fighting. Especially being in the trenches with my friends, working on problems that will actually advance humanity. But the current environment wasn't serving my growth. And that's a really hard thing to admit - I've always looked up to Elon, and I genuinely believe xAI will win. I still do. One thing I'll say: don't stay somewhere just because of the name. If you're unhappy, and you know you can't grow 100x where you are - it's the right call to leave. What's next? Get some sleep back. Then find the next trench worth fighting in. I'll always be meeting exceptional people - that was never because of a recruiting title. I just love finding smart people and helping however I can. Many more side quests to come!!!
English
354
47
2.2K
3M
Michael Liao
Michael Liao@michaelcfix·
i always thought that uoft was one of the best sources in terms of talent per $ since there are many very smart people that don’t get paid very well but it seems like uoft students are starting to do quite well nowadays
English
0
0
2
107
Michael Liao
Michael Liao@michaelcfix·
@dopabees yes but grade inflation in canada is also horrendous
English
0
0
0
254
Emily Han
Emily Han@emilyhanyf·
sf needs more board game nights
Emily Han tweet mediaEmily Han tweet media
English
17
2
164
12.6K
Michael Liao nag-retweet
Mercor
Mercor@mercor_ai·
Today, we're releasing our first version of the AI Consumer Index (ACE). ACE tests what people actually ask, and expect, AI to do for them in their personal life. From shopping for a gift to tackling home projects, people are turning to AI for recommendations and step-by-step guidance. ACE contains realistic and challenging evals, split across shopping, food, gaming, and DIY. The results show that models routinely fail on consumer tasks: - @OpenAI GPT 5 is the top model but scores only 56.1% overall. - No model scores over 50% on Shopping tasks, an opportunity worth $5+ trillion globally. - Frontier models frequently hallucinate web content they were supposed to retrieve, getting numbers or a link wrong between 29% to 62% of the time.
Mercor tweet media
English
1
10
35
4.4K
Michael Liao
Michael Liao@michaelcfix·
maybe sf isn’t that bad
Michael Liao tweet media
English
1
0
4
187
Michael Liao nag-retweet
Brendan (can/do)
Brendan (can/do)@BrendanFoody·
We’ve raised our $350M Series C at a $10B valuation from @felicis, @benchmark, and @generalcatalyst. Just 2 years after starting, Mercor is paying $1.5 million per day to experts in our marketplace. We’re creating a new category of work in the AI economy, where software engineers, bankers, lawyers, and other professionals earn based on their experience while advancing the frontier of AI. While most new categories take time to build momentum, we’ve broken every growth record. For comparison, in their first 2 years: - Uber paid out just over a $1 million to drivers - Airbnb paid out $10 million to hosts We are unlocking human potential in the AI economy.
Brendan (can/do) tweet media
English
179
161
1.8K
1.9M
Michael Liao
Michael Liao@michaelcfix·
@cynthwangg depends on what you want it’s still good for an above average corporate career for starting or running a company, probably not so much
English
1
0
1
74
Cynthia Wang
Cynthia Wang@cynthwangg·
just curious after seeing posts about the ex-McKinsey Starbucks CEO – is there still value in the 2-3 year MBB consulting path for new grads?
English
4
0
8
1.9K
melody kim
melody kim@melodyskim·
i figured it out
melody kim tweet media
English
2
1
66
6.8K
Michael Liao
Michael Liao@michaelcfix·
CANT STOP BUMPING KNOCK2
English
0
0
1
129
Michael Liao
Michael Liao@michaelcfix·
one of the biggest red flags i see among startup founders is making a fundraise the biggest talking point vs. product/users/revenue raising a big seed round should never be the end goal
English
0
0
0
134
lei
lei@ujustgotleid·
YC should make a Canadian branch
English
62
7
322
19.4K
Ryan Haraki
Ryan Haraki@ryanharaki_·
life update: I’ve joined @andocorporation as a founding engineer! after spending the summer working on aisdk at Vercel, I was looking for the next ambitious problem to tackle. then I met @saradu and the team, and it just clicked
Ryan Haraki tweet media
English
37
2
199
110.4K
Michael Liao
Michael Liao@michaelcfix·
ichiko aoba u are the best
English
0
0
1
146
ella schlaghecke
ella schlaghecke@ella_schlags·
Nothing gives me the ick faster than performative entrepreneurship
English
2
0
25
1.4K