
Win Wang
865 posts

Win Wang
@Winium
Founder @ ParaQuery (YC X25) | Data processing for the GPU era. Dilettante of FP, PL, philosophy, physics, quantum computing. Possibly writing code.
Katılım Haziran 2011
788 Takip Edilen394 Takipçiler
Win Wang retweetledi

I strongly believe there are entire companies right now under heavy AI psychosis and its impossible to have rational conversations about it with them. I can't name any specific people because they include personal friends I deeply respect, but I worry about how this plays out.
I lived through the great MTBF vs MTTR (mean-time-between-failure vs. mean-time-to-recovery) reckoning of infrastructure during the transition to cloud and cloud automation. All those arguments are rearing their ugly heads again but now its... the whole software development industry (maybe the whole world, really).
It's frightening, because the psychosis folks operate under an almost absolute "MTTR is all you need" mentality: "its fine to ship bugs because the agents will fix them so quickly and at a scale humans can't do!" We learned in infrastructure that MTTR is great but you can't yeet resilient systems entirely.
The main issue is I don't even know how to bring this up to people I know personally, because bringing this topic up leads to immediately dismissals like "no no, it has full test coverage" or "bug reports are going down" or something, which just don't paint the whole picture.
We already learned this lesson once in infrastructure: you can automate yourself into a very resilient catastrophe machine. Systems can appear healthy by local metrics while globally becoming incomprehensible. Bug reports can go down while latent risk explodes. Test coverage can rise while semantic understanding falls. Changes happens so fast that nobody notices the underlying architecture decaying.
I worry.
English
Win Wang retweetledi

We asked a dozen DevTool founders from companies like @RevenueCat, @greptile, @firecrawl, @infisical, @ollama, @resend, @mintlify, @UnslothAI, @porterdotrun, and @recallai, about the state of AI agents and the future of software engineering.
In this episode of Founder FAQ, we covered everything from agents as customers and the end of coding, to advice for founders starting out and what they're most excited about going forward. Their answers might surprise you.
00:00 – Meet the Founders
03:00 – Building for Agents First
04:22 – Biggest Early Mistakes
07:15 – Do Founders Still Write Code?
09:22 – Most Unexpected AI Discoveries
12:09 – What's Underrated Right Now
14:38 – Predictions & What's Next
English

@garrytan Not saying your test suite is bad at all, but I find that AI almost always writes very questionable tests seemingly aiming to increase test case count. And the actual test code itself could also usually be ~30% lower even without culling useless/redundant cases.
English
Win Wang retweetledi

@GergelyOrosz I have a hook auto-blocking rm -rf with a reason + allowable safe (soft) rm, so the agent can continue work but actually get to think a second time.
English

AI agents are far more cable when they have full system access; but when they do, they can mess a lot of stuff up (not unique to any one model). AI harnesses have guardrails: but those can fail.
I wonder if we’ll need OS-level “sandbox primitives” to deal with this better?
Elliot Arledge@elliotarledge
just woke up to opus 4.7 nuking one of my projects during an overnight session. luckily i was able to get it back easily
English

@Winium And in this talk, I'll be like "So here's what I was going to tell there", and just proceed with my original talk.
Clever.
English
Win Wang retweetledi

We doubled Claude usage on weekends, and outside 5–11am PT on weekdays for the next 2 weeks.
Claude@claudeai
A small thank you to everyone using Claude: We’re doubling usage outside our peak hours for the next two weeks.
English
Win Wang retweetledi

@lydiahallie Amazing. I did this all the time as a custom skill-like solution, glad to see it formally in CC!
English

You can now ask Claude a quick question about your current session without interrupting the main task!
It's read-only (no tool access) and doesn't add to the conversation history. The answer just disappears when you dismiss it
Thariq@trq212
We just added /btw to Claude Code! Use it to have side chain conversations while Claude is working.
English

I predict in the near future,
the majority of GPU cycles in the world will execute code written in Python DSLs that lower to machine code without going through C++.
Triton, JAX/Pallas, torch.compile, cuTile, CUTE DSL, etc
The era of C++ and C as the foundational layer is over.
lyv ⌘@wholyv
What if C++ was used for machine learning instead of Python. given C++ evolved to allow that. I wonder how faster, our AI models would be these days.
English
Win Wang retweetledi

We’re excited to release TorchLean which is the first fully verified neural network framework in Lean. The Lean community has largely focused on pure mathematics. TorchLean expands this frontier toward verified neural network software and scientific computing. With the recent release of CSlib, we see this as another step toward a fully verified ML stack.
We support features:
1. Executable IEEE-754 floating-point semantics (and extensible alternative FP models) verified tensor abstractions with precise shape/indexing semantics
2. Formally verified autograd system for differentiation of NN programs Proof-checked certification / verification algorithms like CROWN (robustness, bounds, etc.)
3. PyTorch-inspired modeling API with eager-style development + export/lowering to a shared IR for execution and verification
Project page: leandojo.org/torchlean.html
Paper: [2602.22631] TorchLean: Formalizing Neural Networks in Lean
Work done @Robertljg, Jennifer Cruden, Xiangru Zhong, @huan_zhang12 and @AnimaAnandkumar.
#MachineLearning #ScientificComputing #Lean

English

@Winium yeah, the new built-in worktree feature isn't great. Sub agents don't respect it and file searches end up returning content from the main repo or other worktrees. I'm managing worktrees manually in separate folders. It works ok that way
English





