Sledge

1.1K posts

Sledge banner
Sledge

Sledge

@anthonysledgex

Growth @Lyft | Building @LeveredGrowth

🇺🇸 SF Katılım Mart 2012
1.7K Takip Edilen371 Takipçiler
Sledge retweetledi
TexasTSLA
TexasTSLA@TexasTSLA·
Wait we can just plug in a Mac Mini to our Teslas?? 👀👀😱😱
English
205
371
6.8K
1.1M
Sledge
Sledge@anthonysledgex·
Finally built a "true" harness in Cursor and improved successful chats by +7.4pp Before - AGENTS.md repo memory files - Session protocol docs - Skill routing table - Task-specific skills - Knowledge capture rules - Context-doc hygiene rules - Canonical path guidance - Early chat quality scoring - Mostly passive instructions Chats became way more effective at bringing in context, but I didn't have a lot of control over how they started their work. After W15/W16 – added a true harness - Task / Scope / Skill orientation rule - Skill-first hard gate - Hardened skill routing quick-checks - Harness audit/bootstrap scripts - Hook scaffolding - Cursor health canvas - Repeatable smoke/E2E skills - More client/operator skills - Success rose from 73.8% to 81.2% - Skill-first compliance rose from 15% to 43.5% Reviewed a few different articles about harnesses and scraped all of Cursor workshops to understand a bit more about best practices. Realized I wasn't using hooks and linters and was just praying that the agent followed my rules and routed accordingly. I was already analyzing chats, but canvases made it much easier to visualize what was going on. I didn't want to invest in building a dashboard just to understand this, so those really helped. During this time, I also increased my usage of Composer 2 which I think was the real reason why I had to improve the harness. I noticed that the effectiveness of my chats were obviously decreasing compared to Sonnet, so I wanted to see if there was a way I could supplement that degradations with a better harness. After Today - Re-scored 622 chats with GPT-5.5 - Built failure diagnosis report - Built harness scorecard - Built audit Canvas - Hardened first-action gate - Added stronger scope confirmation - Expanded routing - Archived audit outputs for future incremental runs - Added deterministic harness health check I was originally using 4o-mini to analyze chats. That was a terrible idea. I was just being too cheap. It wasn't really understanding the full context of an iteration in the chat and it was marking everything as failures. Which was fine if everything is graded that way, but I needed something a little bit more rigorous to explain why things were failing. Completed work for context W15 - Rebuilt the Levered site and free-audit funnel - Built the voice-agent eval harness - Shipped client sites and migrations - Pushed consumer app closer to launch W16 - Added the first real agent harness layer - Built the Cursor spend pipeline - Shipped client landing pages and variants - Hardened consumer app W17 - Built the Work OS dashboard for Levered - Hardened the Cursor quality loop - Codified more Levered operator skills - Advanced client ops and tracking workflows Day-to-Day Workflows - Pulling client meeting context from Granola - Filing call notes, summaries, and transcripts - Updating client masters, timelines, and standups - Turning meetings into next actions - Syncing open items into Work OS - Tracking client commitments and blockers - Preparing weekly debriefs and Monday prep - Running analytics / ads / SEO check-ins - Capturing site migration open items - Running consumer app e2e tests - Debugging lead alerts, voice, and SMS flows - QAing pages before launch - Building readouts, decks, and client briefs - Keeping knowledge-base daily logs current - Preserving session handoffs across chats
Sledge tweet media
English
0
0
0
23
Sledge
Sledge@anthonysledgex·
Using skills saves you money – fix your harness
Sledge tweet media
English
0
0
0
24
Sledge
Sledge@anthonysledgex·
deleting linear again lol
English
0
0
0
9
Sledge
Sledge@anthonysledgex·
Cursor multitask is 🔥
Português
0
0
0
25
Sledge
Sledge@anthonysledgex·
btw Zoom is here now too
Sledge tweet media
English
0
0
0
10
Sledge
Sledge@anthonysledgex·
build a better harness they said
Sledge tweet media
English
0
0
0
17
Sledge
Sledge@anthonysledgex·
@lingxi Please make a hotkey to move to the next unread chat that also works with the sidebar closed
English
0
0
1
26
Lingxi Li
Lingxi Li@lingxi·
shoot me any feedback about the keybindings (keyboard shortcuts) in cursor 3 agent window. be critic and harsh please. all ears!! 👂
English
17
3
42
4.7K
Sledge
Sledge@anthonysledgex·
skills are so powerful my Cursor agent just prevented a future Cursor agent from making a mistake by leaving a detailed skill write up
Sledge tweet media
English
0
0
0
43
Sledge retweetledi
Guillermo Flor
Guillermo Flor@guilleflorvs·
Sequoia's thesis that the next $1T company will sell work, not software, is the most important reframe in AI right now. The argument: if you sell a copilot, you're competing with every new model release. But if you sell the outcome — books closed, contracts reviewed, claims handled — every AI improvement makes your margins better, not your product obsolete. The key insight most people miss: for every $1 spent on software, ~$6 is spent on services. The entire SaaS playbook was about capturing the software dollar. The AI playbook is about capturing the services dollar — at software margins. Not "AI for accountants." The AI accounting firm. Not "AI for lawyers." The AI law firm. The companies that figure this out won't look like SaaS companies. They'll look like services firms rebuilt on software infrastructure. That's a fundamentally different company to build, fund, and scale. And most founders are still building copilots.
Guillermo Flor tweet media
English
227
540
5.6K
2.1M
Sledge
Sledge@anthonysledgex·
You have to become more efficient with token usage or it’s not worth the investment. If you find that you’re having ineffective sessions (e.g. complete restarts, iterations, unused code) there’s something wrong with your process – not the model, not the IDE. I spend at least 20-30% of my time in Cursor updating my “out-of-session” context, tweaking operations, and measuring performance. I’ve used Loveable, Cursor, Codex, Claude, TraeIDE, NotebookLM, all of them. I can confirm the key to a successful output mostly relies on the user, not the tool. The Don’t Repeat Yourself (DRY) principle is more important than ever right now when you’re billed by the character. DRY in practice makes me view things like portable knowledge bases and skills as no-brainer must haves out the gate. When I work on anything, it’s already designed to be accessible, independently iterable, and a source of truth. This helps manage context much more efficiently and creates more successful outcomes.
Anissa Gardizy@anissagardizy8

Uber's CTO told @LauraBratton5 that AI coding tools—particularly Anthropic’s Claude Code—has already maxed out its 2026 AI budget 📈 “I'm back to the drawing board, because the budget I thought I would need is blown away already,” Neppalli Naga said. theinformation.com/newsletters/ap…

English
0
0
2
75
Sledge
Sledge@anthonysledgex·
Composer 2 is much faster than Comet Browser Assistant I just wish it could sign in with Google
English
0
0
0
35
Sledge
Sledge@anthonysledgex·
wonder what this will do
Sledge tweet media
English
0
0
0
21
Sledge
Sledge@anthonysledgex·
chats are ephemeral anyway – we should just be able to jump to the next "ready" convo or the last convo what's the point of grouping them for storage
Dara A.@daradoescode

I love Cursor and I think the ppl there are building something great, which is why I have to say this: @cursor_ai the side bar in Glass, genuinely makes no sense rn why the hell does sorting my threads by recent remove the entire workspace grouping? And the workspace grouping absolutely doesn't make sense either I have 2 folders pointing to diff branches on the same repo, why are they in the same group even though they are different folders and different branches ?? (you should not group workspaces by repo entirely, ik this is bec of cloud agent stuff but it makes actually using it Glass rlly frustrating) funny thing is you can't even tell what folder you're in unless you choose to open the terminal Also design mode does not work when you're starting a new thread, weird but okay? you guys have something great here, which is exactly why this is so frustrating

English
0
0
1
56
Sledge
Sledge@anthonysledgex·
@daradoescode @cursor_ai yeah that was really killing me – literally the only reason I didn't use glass when it launched
English
0
0
0
23
Dara A.
Dara A.@daradoescode·
I love Cursor and I think the ppl there are building something great, which is why I have to say this: @cursor_ai the side bar in Glass, genuinely makes no sense rn why the hell does sorting my threads by recent remove the entire workspace grouping? And the workspace grouping absolutely doesn't make sense either I have 2 folders pointing to diff branches on the same repo, why are they in the same group even though they are different folders and different branches ?? (you should not group workspaces by repo entirely, ik this is bec of cloud agent stuff but it makes actually using it Glass rlly frustrating) funny thing is you can't even tell what folder you're in unless you choose to open the terminal Also design mode does not work when you're starting a new thread, weird but okay? you guys have something great here, which is exactly why this is so frustrating
Dara A. tweet mediaDara A. tweet media
English
9
0
48
5.5K
Sledge
Sledge@anthonysledgex·
going to try Composer 2 only for the next 30 days will explore adding some prompt wrapping with Superwhisper to see if it can synthesize my voice-to-text ramblings better and provide clearer instructions and context
English
1
0
1
30