andrew chen

31.2K posts

andrew chen banner
andrew chen

andrew chen

@andrewchen

🇺🇸 a16z speedrun

Katılım Nisan 2007
14.9K Takip Edilen364.4K Takipçiler
andrew chen
andrew chen@andrewchen·
for founders in Boston later this month - we’re hosting a few events for new startups + lots of stuff on the calendar too 👇🏼
Tech Week@Techweek_

if you're raising a seed round this year, 4 boston tech week events that will actually help: → The Investment Game with @emilybenn12 (@a16z). watch three founders pitch live and see exactly what an a16z partner cares about. you'll spot patterns in your own pitch in 30 minutes. may 28. partiful.com/e/ytxnudn6nO1v… → Founders & Funders VC Reverse Pitch. investors pitch you their thesis, check size, what they look for. previous editions produced term sheets. partiful.com/e/GCNJtSkNtbbN… → Two Lanterns 2LVC x OSS Investor Dinner. open source founders + investors in a dinner format. lower pressure than a pitch event, higher signal than a happy hour. may 26. partiful.com/e/N0GDRp8irDf7… → Finta. How Founders Actually Raise in 2026. The real playbook on what's working right now. less aspirational, more operational. may 27. partiful.com/e/zdC0kojFy1dJ…

English
8
4
50
22.6K
Dennis Motta
Dennis Motta@desno365·
@andrewchen I am. the main issue is that only verified apps can access gmail and drive, so it will be much harder to set up openclaw/hermes to access your gmail
English
1
0
2
121
andrew chen
andrew chen@andrewchen·
dear lazy web who’s on the Google “Advanced Protection” program and whatcha think?
English
6
0
21
7.6K
andrew chen
andrew chen@andrewchen·
(This was my reaction after watching the Google keynote)
English
0
0
8
3.4K
andrew chen
andrew chen@andrewchen·
is agentic coding in terminal vs in the desktop IDE like the new “emacs vs vim” debate?
English
52
2
78
10K
Founder Engineer
Founder Engineer@founderengineer·
@andrewchen I am and iCloud advanced protection as well but as usual you are sacrificing convenience
English
1
0
1
205
andrew chen
andrew chen@andrewchen·
who’s in boston/nyc later over the next few weeks? The team is hosting a bunch of stuff alongside portcos, partners, etc — May 26-June 7 Boston Tech Week, followed by NY Tech Week right after. This is going to our biggest Tech Week ever:

- We have over  2000+ events - 15+ tracks - infra, founders, engineers, hackathons etc - 50+ portcos participating: OpenAI, Elevenlabs, Deel, Gamma, xAI, Stripe and speedrun companies too
andrew chen tweet media
English
61
12
298
865.7K
Tery Emilson
Tery Emilson@emilson_tery·
@andrewchen For me, ~80% of daily LLM queries are already local — summarize/extract/search/classify on Qwen 3 4B Q4, Windows CPU. The remaining 20% (hard reasoning, code gen) goes cloud. The shift isn't "when" anymore; it's distribution. Most users don't know the stack exists.
English
1
0
1
209
andrew chen
andrew chen@andrewchen·
How soon before a real % of LLM queries are done via local AI models running webGPU in-browser, and are never sent to the SOTA model in the cloud? Couple things that might drive this: - you don’t need a frontier model for everything. A very large % of LLM queries are simple, google like queries. Easily handled - local models are getting really good, and getting better - a lot of consumer hardware (particular Apple!) can already run good models pretty well. Newish mac laptop running qwen 3.6 35b MoE LLM at great speeds. Hardware is going to get even better in this direction - there’s def some use cases where people will care about privacy. Health, financial, adult stuff etc. - nice part about browser/webGPU is that there’s no install. It’ll just work. And alleviate compute costs Of course the tension for this is that we’re just going to build a shitload of compute in the world, and tokens will get cheaper over time. Yet it seems like the demand is also so crazy that unlocking a bunch of local supply will be worth it too
English
41
7
97
11.3K
andrew chen
andrew chen@andrewchen·
@wkoszek Yeah seems like the best you can do rn with coding harnesses is to point it at a liteLLM instance, and then configure the routing there - since liteLLM can also send to Claude and GPT, in addition to local
English
2
0
1
189
Adam Koszek
Adam Koszek@wkoszek·
@andrewchen Claude will become super tool if it can do some minimal amount of local dispatching. Perhaps even just ! command improved would be a good start.
English
1
0
1
144
andrew chen
andrew chen@andrewchen·
@PaulGugAI We’ve been producing energy in our homes for a long time. Stoves, car, heater, etc! So yeah the analogy makes sense
English
0
0
1
184
GooGZ AI
GooGZ AI@PaulGugAI·
@andrewchen The analogy of solar panels come to mind.. ie energy demand is going up but home owners putting up panels to save on costs also helps to alleviate demand over time.. win win.
English
1
0
3
220
andrew chen
andrew chen@andrewchen·
@hanzi_li Just need to have really smart low latency routing to figure out where to send it to. I run liteLLM pointing at a fast vs big model but the logic for when to send isn’t great
English
1
0
1
127
hanzi
hanzi@hanzi_li·
@andrewchen yeah the annoying sweet spot is boring local stuff. classify this page, summarize this inbox, draft the 80 percent answer. cloud only when it gets weird
English
1
0
1
102
andrew chen
andrew chen@andrewchen·
Yeah the counterpoint might be: - TTFT is often faster on frontier than local simply bc it’s on specialized hardware - consumers say they care about privacy but then they just give data away as long as they can use a product for free But yeah I still mostly agree w you on the outcome
English
1
0
1
143
Roy
Roy@usr_bin_roygbiv·
@andrewchen I think due to latency and expense as well as privacy it makes sense for the market to bifurcate into local and big datacenter stuff and local models will be more specialized/fine tuned for specific tasks.
English
1
0
1
163
andrew chen
andrew chen@andrewchen·
haha - Shoutout to the “AI is moving too slowly” group!! My people
andrew chen tweet media
English
35
10
264
24.4K
shashank
shashank@aloobhujiyan·
I already have one. Yeah, markets are tough right now for Mac studio. eGPUs are great, just hard to connect to a mac, which is my personal device. I did build my own 2x3090 GPU rig as well, and its just keeping it powered on and stable while running linux, is a full time hobby that I chose not to continue and switched to modal.
English
1
0
2
200
andrew chen
andrew chen@andrewchen·
finding the main downside with experimenting with local AI models is that you end up buying one GPU, then another, then another, then another… But I’m running qwen3.6 27b dense at 100 tok/s now on a 5090 eGPU! It feels like sonnet 4.6? Fast and highly usable I figure the GPUs I have will now increase in value over the next few years so it’ll all be worth it
English
33
2
129
12.1K
andrew chen
andrew chen@andrewchen·
@kidxbt Aorus 5090 AI box My main complaint is that it only runs via windows and sucks at Linux and couldn’t get it work on Mac via tinygrad/tinygpu. But at least now it’s working. I’m sure an actual GPU is much faster but this has its own charm
English
1
0
0
207
DeKid
DeKid@kidxbt·
@andrewchen what enclosure are you using and how’s the bandwidth/latency holding up in practice?
English
1
0
0
212
andrew chen
andrew chen@andrewchen·
@SamJWasserman I do have two sparks but no I haven’t tried it even though I got the QSFP+ cable, just bc the big models are all super slow tokens/s… but I should try it eventually anyway just for fun
English
0
0
1
295
Sam Wasserman🦞
Sam Wasserman🦞@SamJWasserman·
@andrewchen have you tethered more than one together, for example two sparks to then have 40 cores and 256ram? With the high speed cable nvidia provides? thats what I'm trying to optimize for right now ideally.
English
1
0
1
424
andrew chen
andrew chen@andrewchen·
@KashPrime I just have one eGPU and yes I have a PC for it running windows (Linux+eGPU is unfort a mess rn)
English
0
0
1
154
andrew chen
andrew chen@andrewchen·
@cchevyyc The application process is the actual process, but yeah, lots of adjacent events also for folks who want to network and meet investors too
English
1
0
2
325
Chevy
Chevy@cchevyyc·
@andrewchen If we’re going through the application process for speedrun, should we also sign up to pitch for the event on June 1st?
English
1
0
1
387