Harvey

343 posts

Harvey banner
Harvey

Harvey

@harvey

AI for the world’s most complex legal work.

انضم Mart 2023
3 يتبع12.5K المتابعون
Harvey
Harvey@harvey·
Managed agent platforms are now emerging from frontier labs and cloud providers. So why build our own cloud agent platform at Harvey? Because legal agents have three hard requirements: model flexibility, zero data retention, and cost control. 1) Model flexibility. Lawyers can’t be locked into a single model provider. Client conflicts and confidentiality requirements can make a single model off-limits for a matter. Instead, Harvey routes to the best model for a task. Our recent Legal Agent Benchmark (LAB) shows that different frontier models excel in different legal practice areas. The optimal choice is a mix of models across different types of legal work. 2) Zero data retention. Lawyers work with client data that is often privileged, confidential, and subject to strict contractual controls. Zero data retention means that data is never written to persistent storage by the model provider or agent runtime. The major managed agent platforms do not offer ZDR, so we had to build our own. 3) Cost control. Legal agents run over large corpora and can involve many rounds of retrieval, reasoning, and tool use in a single workflow. If every step goes to the largest frontier model, cost runs out of control. Owning the runtime lets Harvey match model capability to task complexity, routing to open-source models when it makes sense. Empirically, we see 3-5x cost reductions versus a frontier-only approach. More from our cofounder @gabepereyra:
Harvey tweet mediaHarvey tweet mediaHarvey tweet mediaHarvey tweet media
Gabe Pereyra@gabepereyra

x.com/i/article/2061…

English
0
2
57
10.2K
Harvey
Harvey@harvey·
We’re partnering with Tirant lo Blanch, the leading legal publisher for Spain, Portugal, and 20 Latin American countries. Tirant’s database of legislation, case law, and more than 24,000 legal forms is now available in Harvey.
Harvey tweet media
English
1
0
22
2K
Harvey أُعيد تغريده
NVIDIA AI
NVIDIA AI@NVIDIAAI·
This is a great read on post-training and open models. @harvey & @trajectorylabs post-trained Nemotron 3 Super on complex legal tasks with some very impressive initial results. All with auditable weights, real security, and clear provenance.
Harvey@harvey

We're partnering with @trajectorylabs to bring sovereign continual learning to legal AI with NVIDIA Nemotron models. Continual learning allows agents to improve over time from feedback on their work: every redline refines the next draft. Open-weight models offer full auditability and data sovereignty over legal agents. Using Trajectory's platform, we post-trained NVIDIA Nemotron 3 Super on our Legal Agent Benchmark (LAB), measuring performance on 1,200+ complex end-to-end legal tasks across 24 practice areas. Initial results show that a post-trained Nemotron 3 Super can match performance of closed-source frontier models. This is just the start: we'll keep pushing the frontier with the more powerful Nemotron 3 Ultra when available.

English
16
36
254
28.8K
Harvey
Harvey@harvey·
We're partnering with @trajectorylabs to bring sovereign continual learning to legal AI with NVIDIA Nemotron models. Continual learning allows agents to improve over time from feedback on their work: every redline refines the next draft. Open-weight models offer full auditability and data sovereignty over legal agents. Using Trajectory's platform, we post-trained NVIDIA Nemotron 3 Super on our Legal Agent Benchmark (LAB), measuring performance on 1,200+ complex end-to-end legal tasks across 24 practice areas. Initial results show that a post-trained Nemotron 3 Super can match performance of closed-source frontier models. This is just the start: we'll keep pushing the frontier with the more powerful Nemotron 3 Ultra when available.
Harvey tweet mediaHarvey tweet mediaHarvey tweet mediaHarvey tweet media
Trajectory@trajectorylabs

Welcome to Day 2. Yesterday, we showed the broader work we're doing with the pioneers of continual learning. Today we'd like to deep dive on one: how we post-trained an open model for legal work, in partnership with @Harvey. We've built a platform where production data is the moat. Every correction, retry, and edit becomes signal you can post-train on, and the models are plug and play: customer's can drop in their model of choice, and improve from there. Fields like legal and finance make those demands absolute, with hard security, sovereignty, and provenance requirements. That's why we post-trained @nvidia 's open-weight Nemotron 3 Super, on Harvey's LAB benchmark. The results, in just hours: post-trained Nemotron 3 Super approaches the closed frontier, matches GPT 5.5, lifts rubric-pass criteria +25%, all while beating the performance-vs-cost frontier. That's the power of our platform. And this is just a glimpse towards what the future of intelligence will look like: continual learning, where products get smarter every time they're used. Thanks to @nikogrupen, @gabepereyra, @ItsJulioPereyra, and the whole Harvey team for their collaboration on this. Much more to come soon on continually learning legal agents

English
8
22
200
54.4K
Harvey أُعيد تغريده
Niko
Niko@nikogrupen·
@AnthropicAI launched Claude Opus 4.8 today. Legal Agent Benchmark is the newest knowledge work benchmark included in their public model card -- awesome to see frontier labs making legal intelligence first-class! Check out the results below:
Niko tweet media
Claude@claudeai

Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.

English
2
8
25
3.3K
Harvey
Harvey@harvey·
Now live in Harvey: Claude Opus 4.8. Opus 4.8 scored 10.4% on our Legal Agent Benchmark (LAB), which measures end-to-end completion of complex legal tasks across 24 practice areas. Opus 4.8 is the first frontier model to break 10% on our all-pass standard.
Harvey tweet media
English
3
8
89
5K
Harvey
Harvey@harvey·
Harvey Mobile is now generally available on Android. With Harvey Mobile, legal teams can securely access relevant context, capture new information in real time, and stay up to speed wherever work takes them.
English
1
2
23
2.7K
Harvey
Harvey@harvey·
We partnered with @baseten to fine-tune an open-source model in a lightweight agent harness on our LAB legal agent benchmark. It matched frontier performance on complex legal tasks - with the speed and economics of open-weight inference. We're excited to share Part One of these results, with future work focused on more advanced training techniques. The future is multi-model.
Harvey tweet mediaHarvey tweet mediaHarvey tweet mediaHarvey tweet media
Gabe Pereyra@gabepereyra

x.com/i/article/2059…

English
3
5
79
11.7K
Harvey
Harvey@harvey·
We're proud to partner with @trajectorylabs. Legal AI should learn from the work lawyers already do: every redline should improve the next draft. With Trajectory, we’re bringing continual learning to Harvey’s agents.
Harvey tweet media
Ronak Malde@rronak_

Today, @MichaelElabd, @QuantumArjun, and I are excited to announce Trajectory. We are a research lab and product company building the platform for Continual Learning. Our platform unlocks the signal already sitting in product usage, so companies can continuously post-train large-scale agentic models that outperform the frontier. @trajectorylabs We’ve raised $15M from @Conviction, @BessemerVP, @radicalvcfund, @jeffdean, @drfeifei and more. We’re partnering with some of the best AI-native companies: @ClayRunHQ @Harvey, @DecagonAI, @mercor_ai, @RogoAI to power their agentic systems, some of which we are already in production with. We’ve brought together a world class research team from DeepMind, OpenAI, Apple, Meta Superintelligence, Amazon AGI, Scale AI, and an elite product team from Stripe and Figma. AI will never again start on day one. Every correction, every retry, every edit will make products smarter. This is Continual Learning.

English
10
4
71
7.7K
Harvey
Harvey@harvey·
Lawyers live in their inbox. Now Harvey does too. Email ask@harvey.ai.
English
3
7
116
16.5K
Harvey
Harvey@harvey·
We additionally analyzed model behavior over the course of their work, and found common patterns that affect end-to-end legal performance. Models that spent substantial time verifying and revising their work performed best on LAB's task suite.
Harvey tweet mediaHarvey tweet media
English
1
0
7
558
Harvey
Harvey@harvey·
We evaluated frontier models on LAB, our long-horizon legal agent benchmark. Three findings stood out: 1) Legal work is far from saturated by frontier models. 2) Model performance varies sharply by practice area. 3) Cost and latency rise at the frontier. Read more:
Harvey tweet mediaHarvey tweet mediaHarvey tweet media
Gabe Pereyra@gabepereyra

x.com/i/article/2059…

English
5
10
76
18K