Chris Huskey

1.9K posts

Chris Huskey banner
Chris Huskey

Chris Huskey

@chrishuskey

Public markets L/S + late stage VC @OctahedronCap. Prev. same @ Tencent 腾讯投资. 🇺🇸🇨🇳🇮🇳 since birth. NerdNation 🌲 @Stanford is home.

San Francisco, CA Katılım Nisan 2009
4.7K Takip Edilen2K Takipçiler
Chris Huskey retweetledi
Arjun Narayan
Arjun Narayan@narayanarjun·
SpaceX drops the hypiest IPO filing and says "we believe we have identified the largest TAM in human history", and you're like, my interplanetary east India company, what delectable spice have you decided to ship across the stars, and it's like... 22 trillion dollars of b2b saas
Arjun Narayan tweet media
English
42
119
1.5K
413.6K
Chris Huskey
Chris Huskey@chrishuskey·
One of the better descriptions of models’ extreme jaggedness: “You're either in the data distribution (on the rails of the RL circuits) and flying, or you're off-roading in the jungle with a machete (in relative terms).”
Andrej Karpathy@karpathy

Fireside chat at Sequoia Ascent 2026 from a ~week ago. Some highlights: The first theme I tried to push on is that LLMs are about a lot more than just speeding up what existed before (e.g. coding). Three examples of new horizons: 1. menugen: an app that can be fully engulfed by LLMs, with no classical code needed: input an image, output an image and an LLM can natively do the thing. 2. install .md skills instead of install .sh scripts. Why create a complex Software 1.0 bash script for e.g. installing a piece of software if you can write the installation out in words and say "just show this to your LLM". The LLM is an advanced interpreter of English and can intelligently target installation to your setup, debug everything inline, etc. 3. LLM knowledge bases as an example of something that was *impossible* with classical code because it's computation over unstructured data (knowledge) from arbitrary sources and in arbitrary formats, including simply text articles etc. I pushed on these because in every new paradigm change, the obvious things are always in the realm of speeding up or somehow improving what existed, but here we have examples of functionality that either suddenly perhaps shouldn't even exist (1,2), or was fundamentally not possible before (3). The second (ongoing) theme is trying to explain the pattern of jaggedness in LLMs. How it can be true that a single artifact will simultaneously 1) coherently refactor a 100,000-line code base *and* 2) tell you to walk to the car wash to wash your car. I previously wrote about the source of this as having to do with verifiability of a domain, here I expand on this as having to also do with economics because revenue/TAM dictates what the frontier labs choose to package into training data distributions during RL. You're either in the data distribution (on the rails of the RL circuits) and flying or you're off-roading in the jungle with a machete, in relative terms. Still not 100% satisfied with this, but it's an ongoing struggle to build an accurate model of LLM capabilities if you wish to practically take advantage of their power while avoiding their pitfalls, which brings me to... Last theme is the agent-native economy. The decomposition of products and services into sensors, actuators and logic (split up across all of 1.0/2.0/3.0 computing paradigms), how we can make information maximally legible to LLMs, some words on the quickly emerging agentic engineering and its skill set, related hiring practices, etc., possibly even hints/dreams of fully neural computing handling the vast majority of computation with some help from (classical) CPU coprocessors.

English
0
0
1
254
Chris Huskey retweetledi
Shaurya Jain
Shaurya Jain@jain_shaurya_·
Therapist: linear neolabs are not real, they cannot hurt you. me:
Shaurya Jain tweet media
English
56
129
2.1K
137K
Chris Huskey retweetledi
dalibali
dalibali@dalibali2·
Bare metal infra double backlog and growing 50%+ on multibillion scale. Meanwhile, SaaS CFOs be like "well technically, our growth did not decelerate since if you round up the decimal of 9.55%, its basically 10% once you adjust for FX tailwinds"
English
6
4
139
12.3K
Jason ✨👾SaaStr.Ai✨ Lemkin
Ok we're shipping something this weekend that we've REALLY needed 💯An Agentic API Grader. An objective view of APIs from an agent perspective. For builders of B2B apps especially. Which APIs are easiest to use for AI Agents? The hardest? We've got 100 APIs already loaded in, especially ones forces on B2B apps, and will keep working on it: @stripe: A (95) @Adyen: A- (83) @RevenueCat: A- (82) @linear: A- (80) @clay: B (73) @brexHQ: B (72) @HubSpot : B (70) @tryramp: B (67) @Gong_io: B (60) @ZoomInfo: C+ (58) @NotionHQ : C (52) @Marketo: C (50) @Workday: D (38) And we've got full details on why. We're entering the age where you should pick your vendor based not on what your humans need. But what your agents need. For Team SaaStr AI, we're already there. Much more at link in 🧵
Jason ✨👾SaaStr.Ai✨ Lemkin tweet media
English
24
6
55
9.5K
Jason ✨👾SaaStr.Ai✨ Lemkin
It took us 47 iterations to stop one of our AI SDRs from being too aggressive on pricing. Not 3. Not 10. Forty-seven. Each iteration required reviewing outputs, adjusting prompt instructions, testing again. That's not a bug in AI GTM. That's the job. The companies failing with AI agents are the ones who expected 2 iterations. The ones winning expected 50.
English
13
6
97
11.1K
Nathan Baschez
Nathan Baschez@nbaschez·
@paulg Pretty brutal time spent in car unless you can lie-flat sleep the trip
Nathan Baschez tweet media
English
4
0
227
7K
Paul Graham
Paul Graham@paulg·
Whoah, self-driving cars compete with airlines. I never considered that till now.
Nahuel Hilal - TattooGuy@nahuelhilal

Yesterday I drove my @tesla 900 miles on FSD from Miami to Nashville and I realized it’s genuinely the better option. I fly that route 2 to 3 times a month. Flights are never under $400. Most times $600. Sometimes $800. Add Uber to and from both airports, or parking garage fees. Then factor in the delays, the cancellations, the security theater, the chaos, the guy next to you who hasn’t met deodorant yet. On the other hand: I pack healthy snacks, press one button, and the car just goes. I took calls. Replied to emails. FaceTimed my family. Ate without pulling over. Did everything I normally do on a travel day, except none of the stuff that makes travel days miserable. My biggest concern going in was range and charging. Here’s what actually happened: My bladder needed one extra stop the car didn’t even suggest. Most charging stops were under five minutes. Total cost for the whole trip was less than just the uber to the airport. And this was the base model Y. Now I’m thinking I should get something comfier and just make this the default.

English
1K
1.4K
10.7K
3.5M
Chris Huskey
Chris Huskey@chrishuskey·
@paulg SF-LA overnight trips in AVs (“moving hotel rooms”) will be here years before CA completes even the first 1/10 of the promised high-speed rail line…
Chris Huskey tweet media
English
4
0
19
1.8K
Chris Huskey
Chris Huskey@chrishuskey·
@mckaywrigley Same here, but I mean, definitely wouldn’t mind if they also improved Gemini CLI and fixed Google Workspace CLI while the stock price kept going up…
English
0
0
2
160
Mckay Wrigley
Mckay Wrigley@mckaywrigley·
@chrishuskey my stock purchases very much indicate that i’m bullish google. tpu is somehow still underrated and there’s definitely something to be said for serving ai summaries, which are quite good now, to billions of people. but yeah the product stuff is just weird weird weird
English
2
0
5
2.2K
Mckay Wrigley
Mckay Wrigley@mckaywrigley·
some random ai thoughts: - for code, i went from 80/20 claude/gpt to 80/20 gpt/claude in <3 months. surprised by this tbh, and interested to see where the split is at in another 3mo. - claude still mogs gpt for non-coding agent stuff. codex feels like an engineer (which is great for coding!), whereas claude still feels like a general purpose coworker. gpt still lacks that coworker magic - i’m pretty meh on opus 4.7. my experience hasn’t been *bad*, but it certainly hasn’t been good. sideways if anything. - anthropic has got to figure out the compute thing. you can feel it as a user. vibes are all out of whack bc of it. my opinions above are all likely downstream of this. it’s an issue. - anthropic labs continues to be the goat of ai product. claude design is another hit. it’s fantastic. idk why it’s not talked about more? a+ - updated claude code app is great. i finally switched out of the terminal for it. very well done. - how are people STILL sleeping on the claude agent sdk? i feel like i’m going insane. - gpt 5.5 is incredible. the level to which i trust it for engineering is amazing. if i could only have one model rn, it would be this one just bc of strong need for the coding use case. - codex team is killing it. app has been the gold standard since 5.3 release (buuut i credit conductor team for the ui innovation that everyone is using now). though i could do with a little less passive aggressive shots at ant from the codex team. TARS, dial up class by 30%. it’s a long race guys haha - i uninstalled cursor this month and am now back to vs code for my ide. composer just can’t hang with claude/gpt, and the product feels a bit all over the place. pretty stoked about the xai thing though, because their team is absolutely stacked and i’m excited to see what they might be able to do with that compute. codex and claude code are t1, cursor is t2. i would love if this deal got xai/cursor to t1 for a real trio there. - gemini…? seems like this is 2-3 models now where the model seems like a great release and then nobody ever uses it? i’m bullish google/deepmind but weird it hasn’t translated to product use in any form. kinda disappointed still - no open source models have hit the opus 4.5 level. was hopeful the new deepseek would get there, but nope. good oss agents will have to wait a few more months it would seem…
English
151
88
1.6K
218.6K
Chris Huskey
Chris Huskey@chrishuskey·
- New version of Claude Code desktop app is the first really good version -> finally starting to bring me over from terminal to desktop app. (Good UX for running multiple sessions in parallel with easy preview, terminal options for each is the key.)
English
0
0
1
99
Chris Huskey
Chris Huskey@chrishuskey·
- Gemini/Google take nails the conflicted feeling: "…? seems like this is 2-3 models now where the model seems like a great release and then nobody ever uses it? i’m bullish google/deepmind but weird it hasn’t translated to product use in any form. kinda disappointed still"
English
1
0
1
89
Chris Huskey
Chris Huskey@chrishuskey·
Every single line of this is spot on, h/t @mckaywrigley. Could not have said it better point-by-point.
Mckay Wrigley@mckaywrigley

some random ai thoughts: - for code, i went from 80/20 claude/gpt to 80/20 gpt/claude in <3 months. surprised by this tbh, and interested to see where the split is at in another 3mo. - claude still mogs gpt for non-coding agent stuff. codex feels like an engineer (which is great for coding!), whereas claude still feels like a general purpose coworker. gpt still lacks that coworker magic - i’m pretty meh on opus 4.7. my experience hasn’t been *bad*, but it certainly hasn’t been good. sideways if anything. - anthropic has got to figure out the compute thing. you can feel it as a user. vibes are all out of whack bc of it. my opinions above are all likely downstream of this. it’s an issue. - anthropic labs continues to be the goat of ai product. claude design is another hit. it’s fantastic. idk why it’s not talked about more? a+ - updated claude code app is great. i finally switched out of the terminal for it. very well done. - how are people STILL sleeping on the claude agent sdk? i feel like i’m going insane. - gpt 5.5 is incredible. the level to which i trust it for engineering is amazing. if i could only have one model rn, it would be this one just bc of strong need for the coding use case. - codex team is killing it. app has been the gold standard since 5.3 release (buuut i credit conductor team for the ui innovation that everyone is using now). though i could do with a little less passive aggressive shots at ant from the codex team. TARS, dial up class by 30%. it’s a long race guys haha - i uninstalled cursor this month and am now back to vs code for my ide. composer just can’t hang with claude/gpt, and the product feels a bit all over the place. pretty stoked about the xai thing though, because their team is absolutely stacked and i’m excited to see what they might be able to do with that compute. codex and claude code are t1, cursor is t2. i would love if this deal got xai/cursor to t1 for a real trio there. - gemini…? seems like this is 2-3 models now where the model seems like a great release and then nobody ever uses it? i’m bullish google/deepmind but weird it hasn’t translated to product use in any form. kinda disappointed still - no open source models have hit the opus 4.5 level. was hopeful the new deepseek would get there, but nope. good oss agents will have to wait a few more months it would seem…

English
1
0
2
433
Ethan Mollick
Ethan Mollick@emollick·
I have been using GPT ImageGen-2 for the past weeks I didn't think that better image-generators would be a big deal but it turns out that there is a quality threshold I didn't expect, where you can now get text, slides, academic papers Look at what it does with my "otter test"!
Ethan Mollick tweet mediaEthan Mollick tweet mediaEthan Mollick tweet mediaEthan Mollick tweet media
English
73
125
2K
198.4K
Chris Huskey
Chris Huskey@chrishuskey·
@patrick_oshag Agreed except moving this to #1: 1) We need more compute (GPUs, memory, CPUs…just more everything pls)
English
0
0
0
82
Patrick OShaughnessy
Patrick OShaughnessy@patrick_oshag·
Some early thoughts after building real apps by myself for the first time… We built an internal tool called Conveyor It’s an app builder, and internal App Store It is connected to all of our data, context, and external data APIs I’m completely and utterly useless as an engineer, but I’m good at knowing what I want a tool to do. I’d previously struggled to make useful programs with pure CLIs. Our wrapper made it easy for me. In the first 3 days of having this tool, I’ve built several fairly complicated applications, two of which I’ve used a ton for real work. I’ve only used a couple hundred million tokens so far. Some early feelings: 1) It’s obvious to my that my companies Positive Sum and Colossus will have fully bespoke operating systems, built in house. They will manage as much of our work as possible. This is already exploding for things like research and reporting. Every business will want this for themselves. Sure we won’t built our own slack, but we will built everything that pertains specifically to our shape as a firm, which is a lot. 2) x402 protocol (which enables AI agents and users to pay for API access and digital services instantly, without accounts or subscriptions) is immediately interesting to me. Many times I’ve wished I could just stream payments for individual data points. 3) right now each loop of prompt to output takes 5 to 15 minutes. As models and ASICs (@Etched !) make this faster, it’s going to be so much more fun. Even 5 minutes makes it hard to get in the flow. Can’t wait for seconds instead of minutes. 4) it’s so much easier to design things by starting with a shitty first draft of an app and seeing what’s wrong and iterating than nailing a full design ahead of time. When I had directed the design of software before this was always maddening and slow. 5) this has made me realize that my imagination had atrophied. Use it or lose it is real. Very quickly I’m finding it easier to have good ideas by building more stuff. I encourage everyone to do the same. So fun and rewarding. 6) We need more compute
English
21
13
256
31.6K
Aaron Levie
Aaron Levie@levie·
The more I meet enterprise CIOs and AI leaders outside of tech, the more it’s obvious that if you’re building software that doesn’t have a great headless mode, you’re going to be at risk in the coming years. Asked a group of 20 IT leaders across banking, media, finance, and healthcare if they will have any vendors left in 3-5 years that don’t have a good API option for their service and it was a unanimous “no”. This is clearly going to change the nature of software going forward. You have to be completely comfortable serving up your value proposition as much through agent on or off your platform, as you are your own interface. I suspect most platforms will make it to the other side because of how forceful the trend will be, but of course some won’t if their heads in the sand. But on the other end, the upside is that in a world of 100X+ more agents doing work with with software than people ever did, there are far more use-cases to drive and be a part of. In many ways it’s a renaissance if you’re tied to critical data or workflows because of what customers can now use you for. It will certainly force an evolution of business models over time - whether you embed all of this agentic usage in a seat license or make it all consumption based - but dollars will always flow to where value is created. Going to be fun!
English
45
43
468
79.1K
Chris Huskey retweetledi
Peter Gostev
Peter Gostev@petergostev·
I wonder if Anthropic would manage to push past IBM if they release Mythos
Peter Gostev tweet media
English
90
68
1.8K
173.1K