Lan Jiang
306 posts


"Our approach is to build products and conduct research that are in service of accelerated AI deployments. Our platform team builds tools and context primitives that enable faster deployment. Our research team builds frontier systems, including a state-of-the-art RL stack. We then take that research and product and forward-deploy with our customers to help deliver real value." Thanks @foundersysk for having us. Open roles at: jobs.ashbyhq.com/Applied%20Comp…

Introducing Devin 2.2 – the autonomous agent that can test with computer use, self-verify, and auto-fix its work. Try it for free! We’ve also overhauled Devin from the ground up: - 3x faster startup - fully redesigned interface - computer use + virtual desktop ...and hundreds more UX and functionality improvements.

We partnered with @mercor_ai to post-train custom models on high-quality expert data from fields like law, investment banking, and consulting. Our latest model ranks #1 on the APEX-Agents leaderboard in corporate law and #4 overall. Domain-specific post-training on high-quality, organization-specific data can systematically close the gap between general AI competence and expert-level reliability, making capable enterprise agents practical and affordable for knowledge-intensive industries. appliedcompute.com/case-studies/m…

Scaling Data leads to SOTA Legal Performance on APEX-Agents @appliedcompute built a custom model (Applied Compute: Small) by post-training GLM 4.7 on nearly 2,000 samples provided by Mercor. It is now top of the APEX-Agents leaderboard in corporate law, with a Pass@1 score of 26.6% and a mean score of 54.8%. Here’s what we learnt 👇


Excited to finally share this! It was an amazing collaboration with @andyfang and the @DoorDash team! We’re thrilled to continue partnering with one of the most innovative and execution-focused AI teams in the world.

We partnered with @DoorDash to train a proprietary RL-powered agent that encodes internal QA standards into an automated grader, turning expert judgment into a scalable training signal. The result: a 30% relative reduction in critical menu errors and a production system now live across all US menu traffic. appliedcompute.com/case-studies/d…

We partnered with @DoorDash to train a proprietary RL-powered agent that encodes internal QA standards into an automated grader, turning expert judgment into a scalable training signal. The result: a 30% relative reduction in critical menu errors and a production system now live across all US menu traffic. appliedcompute.com/case-studies/d…


We partnered with @DoorDash to train a proprietary RL-powered agent that encodes internal QA standards into an automated grader, turning expert judgment into a scalable training signal. The result: a 30% relative reduction in critical menu errors and a production system now live across all US menu traffic. appliedcompute.com/case-studies/d…

.@appliedcompute improved 19% on Corporate Law tasks in APEX Agents. Their model traverses data rooms with hundreds of files to prepare complex legal deliverables. This level of model improvement with just 1000 tasks is incredible and just the beginning. Great work @ypatil125, @rhythmrg, and @lindensli

.@appliedcompute improved 19% on Corporate Law tasks in APEX Agents. Their model traverses data rooms with hundreds of files to prepare complex legal deliverables. This level of model improvement with just 1000 tasks is incredible and just the beginning. Great work @ypatil125, @rhythmrg, and @lindensli


.@appliedcompute improved 19% on Corporate Law tasks in APEX Agents. Their model traverses data rooms with hundreds of files to prepare complex legal deliverables. This level of model improvement with just 1000 tasks is incredible and just the beginning. Great work @ypatil125, @rhythmrg, and @lindensli



Announcing Flapping Airplanes! We’ve raised $180M from GV, Sequoia, and Index to assemble a new guard in AI: one that imagines a world where models can think at human level without ingesting half the internet.


