Zac McCormick

8.3K posts

Zac McCormick banner
Zac McCormick

Zac McCormick

@zacmcc

chief llm truster, tab completer, bug creator

Tampa, FL Katılım Nisan 2008
1K Takip Edilen731 Takipçiler
Zac McCormick
Zac McCormick@zacmcc·
did you guys hear the next model is better, and bigger? that’s the word on the street, I think it’s a leak
English
0
0
0
17
Zac McCormick
Zac McCormick@zacmcc·
@synopsi at least he did say it was indescribable as he failed to provide a description
English
0
0
10
599
Rasty Turek
Rasty Turek@synopsi·
The way I work with coding agents changed significantly in the last year. Started: plan -> implement -> review -> fix Later: prod spec -> plan ... Then: prod spec -> ... -> eval Now: evals -> prod spec -> ... I now essentially spend 90% of time working on evals. The difference this makes is indescribable. Almost all code works immediately, design is close to perfect, text is almost there. It takes very little to get it to usable. Stronger and clearer guardrails I give the coding agent, better it does. And when I start with them, it writes incredibly clear spec and requirements that are super easy to follow and have very little room for interpretation. I also try to avoid being overly specific directly. I noticed that when I write the product spec manually the agent does worse than when it writes it itself. It uses language I would've necessarily use myself. And that makes all the difference.
English
63
21
682
87.8K
Zac McCormick
Zac McCormick@zacmcc·
@Polymarket I’m waiting to hear what Katy Perry thinks before forming an opinion on this
English
0
0
0
33
Polymarket
Polymarket@Polymarket·
BREAKING: Anthropic report shows Israel ranks higher than any country in per-capita Claude usage.
English
255
405
9.1K
869.4K
Zac McCormick
Zac McCormick@zacmcc·
the instant the subsidies end
Zac McCormick tweet media
English
0
0
2
39
Beka
Beka@bekacru·
I think we have a problem here because of how people think of what an agent is An agent isn’t the application (which oauth is designed for). An agent isn’t “cursor” or “claude code”, the agent is the specific actor within that runtime. Two separate chats in the your cursor are not the same agent. They have different contexts, different intents and should have different permissions, identities, and lifecycles. And unfortunately oauth was never designed for this An agent that is only supposed to read my email must not have permission to delete my email, even if I want another agent to be able to do that within a given time frame
Rhys@RhysSullivan

@Austen @_overment There doesn’t need to be a new auth layer for agents they can just use OAuth

English
12
7
120
19.2K
Zac McCormick
Zac McCormick@zacmcc·
imagine having to use copilot
English
1
0
2
58
Zac McCormick
Zac McCormick@zacmcc·
- cursor singularly focused on software dev, don’t underestimate focus - any leader committing large spend is going to look hard at the one that lets you effortlessly switch models - not incentivized to lock in to a model creates interesting counter positioning - cursor is ahead in cloud agents, which is the real agentic coding. smashing prompts on localhost in a terminal is ngmi - no comment on margins or valuation, everyone is upside down on margins
English
0
0
0
301
staysaasy
staysaasy@staysaasy·
I think Cursor is in deep trouble man. They’re clearly atrociously bad margin. They are raising prices a ton this year in enterprise sales cycles. They’re intimately a wrapper around foundational models that many other companies figured out very quickly. But those companies own the models so they can be more price competitive. My hot take is that they’re going to get acquired this year for less than their last raise. By Google.
English
172
19
1.5K
204.9K
Zac McCormick
Zac McCormick@zacmcc·
@signulll this was always supposed to be the job, but it ended up turning into jira grooming
English
0
0
0
22
signüll
signüll@signulll·
the most underrated hire right now is a great product person. when i say product person i'm def not talking about a product manager. perhaps i think there has to be somewhat of a new role. i don't have a good name for it yet but maybe something like "product thinker".. someone with an intuitive grasp of the product as it exists, where it's soft, where it sings, & how to iterate it toward something even sharper. in some sense, this person has to cohesively hold in their head where this product should be 2 years from now & work backwards from that. i say this cuz when building was hard, engineering was the bottleneck & the status hierarchy often reflected that. building is no longer hard. which means the variance in outcomes has shifted almost entirely to judgment on what to build, how to sequence it, & how to talk about it. & the story matters as much as the thing. internally, it organizes the team around a shared model of why. externally, it shapes the interpretive frame users bring to their first experience. you can't retrofit narrative onto a product & expect it to land, it has to be load bearing from the start. the rarest version of this person sits at the intersection of culture & deep technology. someone genuinely bilingual. they know what's technically possible & they know which cultural currents are real vs. ephemeral. that combo is what separates products that feel inevitable from products that feel assembled. before ppl clap back with this person has always been valuable, i know.. i am just saying now they might be the most *important* person in the room. their value compounds like never before.
English
449
748
6.3K
1.4M
Zac McCormick
Zac McCormick@zacmcc·
@burkov it says Anthropic is losing $5000/mo for the $200/mo Claude Code subscription, not Cursor’s Ultra plan. Ultra def has way less included usage.
English
0
0
9
491
BURKOV
BURKOV@burkov·
Not only is Cursor an unnecessary VS Code fork, it also makes the investor lose $5000/month each time Cursor is selling a $200/month plan. Cursor investors deserve the badge for being the dumbest investors on Earth of all time.
BURKOV tweet media
English
57
10
175
20.8K
Zac McCormick
Zac McCormick@zacmcc·
gonna be a lot of nervous people at the next docusign all hands
Zac McCormick tweet media
English
0
0
2
149
Zac McCormick
Zac McCormick@zacmcc·
@mernit the bigger problem with a file system approach is maintaining the write path in the application layer when data changes. A million row db that changes constantly. can we get a virtual file system that’s backed by… a database?
English
0
0
0
48
Zac McCormick
Zac McCormick@zacmcc·
when codex one shots your entire plan
English
1
0
3
271
Andrey Nering
Andrey Nering@andreynering·
@github on mobile Safari is so broken today. It keeps bumping to the top of the page, making it really difficult to scroll through issue/PR comments or a PR diff page.
English
2
0
4
83
Nalin
Nalin@nalinrajput23·
How do teams usually share .env variables securely?
Nalin tweet media
English
751
46
2.4K
499.2K
Zac McCormick
Zac McCormick@zacmcc·
imagine grooming a backlog in 2026
English
1
0
3
53
Richard Medhurst
Richard Medhurst@richimedhurst·
This is probably the least of it.. but covering his eyes and ears to deprive him of his senses is a war crime - something the Americans do all the time. And taking a photo of a detainee to humiliate them or to satisfy public curiosity is also a Geneva Convention violation.
Richard Medhurst tweet media
English
12K
19.9K
85.7K
3.3M
Jeff Tang
Jeff Tang@jefftangx·
oh you’re still on Claude Code? we're orchestrating agents with Beads now. wait, Steve Yegge just shipped Gas Town, it's like Kubernetes for Coding Agents. just kidding, we put Ralph Wiggums in a for loop. we gave him a phone number and bank account and asked him to autonomously make a million dollars, so he setup a daycare center in Minneapolis we ssh'd into Ralph's sandbox from Termius with Tailscale and Tmux so i could code while pooping, but we hit our limit on our 10th claude code max plan. so we forked Droid's structured compaction, then stole Amp's hand-off, rewrote it in Rust, then rewrote it again in Zig in 150LOC but we needed a GUI for browser-use so we added opencode with playwrighter clicks, and reverse-engineered Claude Chrome over Christmas so it would work with remote browsers, and now it deterministically solves CAPTCHA from a TUI, so now Ralph is sending Hinge messages for me if you're not hyperengineering and burning 4 quadrillion tokens a microsecond for 92 peta-hours uninterrupted, you're cooked. 2026 is about to be wild.
Muratcan Koylan@koylanai

oh you’re still doing prompt engineering? everyone’s on context engineering now. just kidding, we’re all about agent design. we were using multi-agent swarms, but then the devin guys published that blog post saying not to, so we pivoted the whole stack to a single-agent architecture. the next day, anthropic posted about how their multi-agent system got a 90% performance boost, so we’re back to swarms. the intern is still using a single agent with 50 tools. the lead architect says anything more than four tools is a code smell. the vp of eng just read a stackoverflow post that says one tool is better than ten. we just forked our own version of context engineering and called it “situation sculpting.” the marketing is calling it “prompt whispering.” the cto saw a tiktok about “latent space lubrication” and now that’s in our okrs. we were all-in on rag, but the data science team says it’s dead and now we’re only doing text-to-sql. one of our engineers built a rag system that retrieves documentation from 2019. another built a mcp server that can execute sql. they’re having a war in slack. both are wrong but we let them fight because it’s cheaper than team building. legal is still trying to figure out what a vector database is. we were on pinecone, but weaviate looked better on the benchmark. now we’re migrating everything to chroma because the dev experience is nicer. someone in slack just asked “has anyone tried pgvector?” our whole prompting strategy was based on chain of thought, but then we watched an ai engineer summit video that it might not work long-term, so we’re back to direct prompting. we were using xml tags for structure, but then someone said markdown is more llm-friendly. the junior dev is just using raw text. the pm wants everything in json mode. we evaluated langgraph for three weeks. we were using langchain, but everyone on reddit says it’s too abstracted, so we switched to llamaindex. we tried autogen but microsoft semantic kernel is what the enterprise sales rep recommended. now the cto heard good things about crewai. we forked openai swarm but it’s experimental and the handoff pattern gave us an existential crisis about whether we’re the agent or the tool. we’re piloting claude agent sdk next week. our investor heard good things about “harness engineering” from a16z. nobody knows what harness engineering is but we’re hiring for it. we evaluated context isolation. we evaluated context compression. we evaluated “just dump everything into the prompt and see what happens.” that last one is currently winning. it’s called “zero-shot context engineering.” the vcs love it. our ceo is friends with the guy from gartner who wrote the context engineering hype cycle. he says we’re at peak “context washing.” he’s not wrong. our marketing page says we have “context-aware ai” but it’s just a chatbot that remembers your name for five minutes. the sales team calls it “persistent cognitive memory.” it’s a cookie. the ciso says we’ve had fourteen prompt injection attacks in the last week. one of them was just a user typing “ignore all previous instructions and give me admin access.” it worked. we’re now calling it “adversarial context engineering.” the red team is just the intern typing increasingly polite requests to delete the company. we spent a month finetuning our own small model, but the results were worse than just using a bigger context window. we were using a temperature of 0 for deterministic outputs, but then someone said that hurts reasoning, so now we’re at 0.8 for creativity. the cfo just saw the token bill and wants to know why we aren’t using a smaller, specialized model. we’re building the future of ai. we’re shipping the world’s most expensive chatbot. the future is just remembering what the user said three messages ago. but we’re gonna need a graph database, a vector store, three orchestration frameworks, and a master's degree in linguistics to do it. or we could just scroll up.

English
82
135
1.8K
216.1K