Haven Vu

410 posts

Haven Vu

@havenvu

building something new, 2x founder, ex gen ai lead ai/ml @ucberkeley project 0: https://t.co/2Nb60qCFSn sleep tracker for Night Shift Nurses

Katılım Ağustos 2024

338 Takip Edilen113 Takipçiler

Sabitlenmiş Tweet

Haven Vu@havenvu·7 May

I’m a top 1% codex and claude code power user. 8-10 terminal tabs always running simultaneously. Hit $200 weekly limit in 1 day. Here are some of my biggest tips. Seriously plug this into your codex or Claude code and ask how you can begin doing this. 1. Stop caring so much about managing context windows. It’s always better to have large agents.md or Claude.md files and burn tokens than have your agent forget details and implement incorrectly. You’ll end up burning way more tokens and wasting way more time if you try to token optimize. Models typically get better and their context windows go up over time anyway. Don’t worry so much about having the perfect context length. That is very short sighted. Instead, you should have a memory log, decision log and a large instruction file so that literally every session has full context on what you’re trying to accomplish. 2. To make faster decisions, tell your agent to ask YOU questions on what you believe the ideal user experience is and work backwards. This will help you understand the tradeoffs of complex architecture without having to understand all of the nuances in architectural decisions. It’s always better to work backward from the user experience because you’ll likely end up refactoring any architecture to cater to a user experience anyway. I’ve refactored my architecture so many times because of some UX issue as opposed to some security issue/ logic issue. 3. To work in parallel, spawn multiple work trees & use docker. However, shipping and integrating with main should always be done sequentially. Shipping and integrating into main will take likely the same amount of time as building all the features out in parallel. But if you try to parallelize many agents writing to main in parallel your code will break. 4. Build harnesses and headless testing for EVERYTHING. The faster your AGENT is able to test its work, the faster you can ship, so spend time building tools for your agent to close its own loops. Without you needing to verify manually. 5. Start barebones with vanilla agents — I’ve uninstalled almost all MCP connections. Almost all of my skills and tools were just coming from workflows I found myself using repeatedly out of vanilla use. Just give your agent knowledge that certain tools exist and they can call it on demand. Otherwise just build your own skills. 6. To prevent your agent from lying about being “done” with a task: Always pair program with another model. The way you do that is to give your agent access to Claude code CLI and Codex CLI and Cursor CLI and Devin CLI as tools/ skills. These CLIs have the best unit economics for calling coding agents. You may end up burning 2x the tokens but you’ll save a ton of time and that will let you ship so much faster (for me 5x faster) because I’m able to have my agents run longer loops when it works with a pair programming agent. While it burns tokens, I can go ship another feature or work on something else. 7. Build your own tutor and spin up small internal tools and web apps to help you read through your codebase simply. Use excalidraw for diagrams and just have your agent teach you the codebase and update its own documentation as the codebase grows. When I was building out my mascots I literally had my agent build out a webpage for me to see all 150 iterations of my mascot. Why would I click through a complex file system when I can literally one shot internal tools for myself? Make yourself work more efficiently with the agent.

English

190

Haven Vu@havenvu·10h

@SIGKITTEN Reg capture and marketing at its peak

English

3.9K

SIGKITTEN@SIGKITTEN·13h

oh no what happened to all the principals and morals from last month

Watcher.Guru@WatcherGuru

JUST IN: 🇺🇸 Trump administration and Anthropic finalizing deal to let US spy agencies use its AI tools.

English

688

12.8K

656.2K

Haven Vu@havenvu·10h

The entire fallout with Department of War last month was marketing 101

Watcher.Guru@WatcherGuru

JUST IN: 🇺🇸 Trump administration and Anthropic finalizing deal to let US spy agencies use its AI tools.

English

Haven Vu@havenvu·11h

It will take you way longer than you think to accomplish what you want

Bambulu@Bqmbulu

What’s the harshest truth every young man must eventually learn?

English

219

Haven Vu@havenvu·11h

@PopcornPost_

GIF

QME

136

Popcorn Post@PopcornPost_·1d

Name a movie you had no idea was gonna be THAT good.

English

741

436

60.5K

Haven Vu@havenvu·16h

@robinebers Bro just get an independent model to run as an adversarial judge

English

Robin Ebers · AI for Non-Coders@robinebers·1d

Composer 2.5 in a nutshell: it's fantastic, until it isn't you can cruise smoothly for an hour, and then a silly thing trips it up (like some nested CSS that doesn't render correctly) it's when a lot of dots connect that these cheaper models still struggle the good news is that this is exactly where Cursor shines - literally switch a model mid-session, fix it, and move back to Composer 2.5

English

178

11.9K

Haven Vu@havenvu·23h

Founder of notion just echoed everything I said weeks ago with 1000x the reach. Stop babysitting your agents. You will get so much further if you let agents run longer loops and challenge one another as adversarials for task completion. Focus on UX, architecture and product.

Simon Last@simonlast

1/ Some things I've learned recently running coding agents on large-scale projects. Most of this contradicts advice from 6 months ago!

English

202

Haven Vu@havenvu·1d

People look at this and think "what a failure" Only to realize we had no choice but to plunge every single rocket into the ocean with 0 recoverable parts up to 10 years ago. This is what real progress looks like.

Elon Musk@elonmusk

English

Haven Vu@havenvu·1d

I have: 2x Codex $200 subscriptions 1x Claude $200 subscription 1x Cursor $20 subscription 1x Devin $20 subscription Anyone else on a similar boat?

English

Haven Vu@havenvu·1d

Literally called it

DiscussingFilm@DiscussingFilm

‘OBSESSION’ may become one of the only films in history to earn $100M+ on a budget of less than $1M. The film is expected to earn more this weekend than it did in its opening weekend, a rare feat.

English

Haven Vu@havenvu·1d

1. Switching costs: I want to be able to take convos over from Codex CLI to my codex app. Right now I’m so used to the CLI and when I open the codex app, I see nothing. 2. Software shape and work tree shape: as someone who runs multiple agents in parallel, understanding where they are all at and how merge/ integrate safe they are to main and having clear documentation/indexing would be really helpful

English

jason@jxnlco·1d

If you're using codex desktop app today, what features do you feel like are still missing? Let me know and I’ll summarize all the feedback and share internally.

English

933

572

71.8K

Haven Vu@havenvu·2d

@jaegermedia1 This has got to be rage bait

English

Jaeger Media@jaegermedia1·3d

Christopher Nolan has mastered the art of making his films appear to be deep at first glance, but on the second watch revealing just how superficial and pretentious they really are. Has anyone actually been able to enjoy a movie of his on the second viewing?

English

666

699

448.5K

Haven Vu retweetledi

arvo färt@arvofart·2d

It’s curious how often Hereditary gets talked about as the film that started the current trend in horror cinema when that trend had already been in full swing for years by that point. The real trendsetter was arguably The Babadook, but the trend started even earlier than that

English

122

3.3K

81.2K

Haven Vu@havenvu·2d

@kunchenguid @NedNguyen Not true, they also provide CUDA. If openAI had never touched harnesses, there would’ve never been a ChatGPT moment.

English

Kun Chen@kunchenguid·2d

@NedNguyen nvidia is at that size nvidia doesn’t try to own everything - they partner with the ecosystem “do as much as needed. as little as possible”

English

1.3K

Kun Chen@kunchenguid·2d

i'm strongly against model companies focusing too much on harness, but i would love to hear if anyone has a strong argument for it my reason against it: if openai didn't build GPT 5.5, no one else can. this is their core competence if openai didn't build codex cli and app, we have opencode and t3code. building harness is NOT their core competence this is not saying products like claude code, codex aren't good - i genuinely think these are top tier products built by really talented people my point is - the world might be a better place if model companies focus more on their core capability and give us better, faster, safer and cheaper models, rather than competing with the ecosystem in the application layer what do you think?

Greg Brockman@gdb

the model alone is no longer the product

English

250

521

119.8K

Haven Vu@havenvu·2d

@mark_k I basically never clear context anymore cause compaction is so good

English

Mark Kretschmann@mark_k·2d

Codex really needs a simple context meter. Just show the currently used context window percentage somewhere in the UI. When you’re deep into a long coding session, it would be extremely useful to know whether you’re at 30%, 70%, or about to hit the wall.

English

118

8.8K

Haven Vu@havenvu·2d

@yulo_tech Unit economics won’t allow them to win the market unless they own their own foundational model.

English

121

Yulo@yulo_tech·3d

PostHog will destroy Claude Code and Codex The moat they'll have from user behavior data and error logs will for the first time give AI tasks that are actually useful and not slop features or things that don't matter Can't wait to try it

PostHog@posthog

Introducing PostHog Code, the product editor that: - Understands your product - Identifies usage patterns - Triages bugs and errors for you - Creates PRs to fix them - Continuously monitors and improves your product Join the waitlist: posthog.com/code

English

716

272.2K

Haven Vu@havenvu·2d

What’s harder? Bankruptcy working 100 hrs/week or getting paid $500k/year working 60hrs/ week

signüll@signulll

which job is more difficult? vp/svp at a big co. or startup founder & why?

English

Haven Vu@havenvu·2d

It's okay because we save water from not showering

Acyn@Acyn

AOC: This is what drinking water in Georgia looks like after Meta began data center construction in the community.

English

Haven Vu@havenvu·3d

Is there any benefit to using the Codex/ Claude app compared to their CLIs?

English

Haven Vu@havenvu·3d

@_robyn_smith Why is this a horrible take?

English

822

robyn@_robyn_smith·3d

I think Jeff is one of the smarter entrepreneurs of our time and I think he should stay in that lane because this is an incredibly horrible take

Jeff Bezos@JeffBezos

Thank you. The important part is zeroing out taxes on the bottom half. Best way to put money in someone’s pocket is to not take it out in the first place. Bottom half is only 3% of total tax revenue. But it’s very meaningful to that person. Zero it out.

English

66.5K

Keşfet

@SIGKITTEN @PopcornPost_ @robinebers @jaegermedia1 @kunchenguid @NedNguyen @elonmusk @BarackObama