
Aksh Garg
234 posts

Aksh Garg
@AkshGarg03
@mercor_ai, CS @stanford | ex @point72, @tesla, @spacex, @deshaw


Traditional coding benchmarks do not reflect how software is actually built and maintained. That's why we built a new benchmark, APEX-SWE, in partnership with @cognition. It measures whether AI models can perform complex, real-world software engineering work to ship systems that work and debug them when they don't. @OpenAI GPT 5.3 Codex (High) tops the leaderboard at 41.5% on Pass@1.


We're joining @ycombinator this summer! We built @UseLitmus because technical hiring is broken—slow, expensive, & low-signal. Building a company is a privilege. Doing it with @elenaxzhao makes it really fun :) Thanks @snowmaker @gustaf for betting on us early. See you in SF!








Engineers spend 70% of their time understanding code, not writing it. That’s why we built Asimov at @reflection_ai. The best-in-class code research agent, built for teams and organizations.

How can we close the generation-verification gap when LLMs produce correct answers but fail to select them? 🧵 Introducing Weaver: a framework that combines multiple weak verifiers (reward models + LM judges) to achieve o3-mini-level accuracy with much cheaper non-reasoning models like Llama 3.3 70B Instruct! 🧵(1 / N)

Today we’re introducing Crosby, a hybrid AI law firm that helps rapidly growing businesses execute faster. Contracts are connection points. They allow companies to transact with one another and create economic growth. But while every aspect of business has sped up, the way we negotiate contracts hasn’t changed in 50 years. Crosby is building the API for human agreement. We combine the speed and intelligence of AI with the safety of lawyers-in-the-loop to review contracts in under an hour. Since quietly launching in January, we’ve reviewed over 1,000 MSAs, DPAs and NDAs for some of the fastest growing companies in history, including Cursor, Clay and UnifyGTM. Speed to execution is our north star, and today our median review time is 58 minutes. GTM teams call Crosby a secret weapon to close deals 80% faster. We’re just getting started. Today, we’re also excited to share that we’ve raised $5.8m from Sequoia Capital and Bain Capital Ventures, as well as the founders of Ramp, Instacart, Flatiron Health, and others. Crosby is a small, talent-dense team, combining lawyers from Harvard, Stanford, and Columbia Law with engineers from Ramp, Vanta, Meta, and Google. Every engineer on our team today is a former founder. We work in person in New York City. If our mission resonates with you, we are looking for technologists, legal experts, and former founders to join us. For high-growth companies looking to execute faster, we’ve opened up Early Access. Sign up on our website and we’ll be in touch.





