Applied Compute

153 posts

Applied Compute

@appliedcompute

We build Specific Intelligence for enterprises.

San Francisco Katılım Temmuz 2012

18 Takip Edilen4K Takipçiler

Sabitlenmiş Tweet

Applied Compute@appliedcompute·8 Nis

x.com/i/article/2041…

ZXX

144

203.1K

Applied Compute@appliedcompute·3d

English

2.1K

Applied Compute@appliedcompute·3d

Furthermore, we find that the token-level granularity in self-distillation is both a strength and a weakness – it's dense, but there is often a great deal of noise in the updates because the teacher and student may disagree on tokens for reasons unrelated to the desired behavior. We want to concentrate our loss on the tokens which matter most for the student's improvement. This is the intuition behind RMSD: we filter for a set of T token positions where the teacher and student disagree the most, then an LLM judge narrows those to a final set of S tokens that are most relevant to the target behavior. In our experiments, RMSD reaches the target behavior in about half the steps of OPSD, making it significantly more data efficient, while also training more stably and reaching a higher performance ceiling. We think that the self-distillation approach and RMSD in particular will be powerful methods for advancing the capabilities of models on specialized settings.

English

2.9K

Applied Compute@appliedcompute·3d

Enterprises increasingly want custom models that are tailored to their internal tools and processes, without sacrificing intelligence or reliability. Often times this involves tasks that are out-of-distribution for existing models: think custom document formats that aren't on the public web or company-specific legacy APIs, things that never appeared in pretraining. Existing techniques struggle here. SFT teaches new behaviors, but causes catastrophic forgetting. RL has no foothold: if the target behavior never appears, there's no reward to reinforce. We investigate self-distillation as a way to elicit these OOD behaviors, using a controlled toy task as a proxy for the enterprise-internal formats we see in practice. We find that self-distillation can: - Better elicit OOD behaviors compared to SFT and RL - Better preserve the model's existing capabilities

English

2.9K

Applied Compute@appliedcompute·3d

Some enterprise tasks are challenging to hill-climb with RL-based methods since they involve very out-of-distribution behavior. On-policy self-distillation (OPSD) gives a model learning signal for every token it writes, far richer than the single scalar reward of RL. But that channel is noisy: most tokens don't reflect the behavior you're after. We introduce Relevance-Masked Self-Distillation (RMSD), which uses a two-step filtered loss mask to cut through the noise and find the tokens with the highest signal. Compared to OPSD it trains more stably, provides higher data efficiency, and reaches a higher performance ceiling.

English

296

82.6K

Applied Compute@appliedcompute·5d

Our team of former founders and researchers all share one belief, that the next decade of AI will be defined by how deeply it shapes the work of the world's most important businesses, and we're building the systems that make that real. Join us. jobs.ashbyhq.com/Applied%20Comp…

English

354

Applied Compute@appliedcompute·5d

"We want to be the bridge between frontier research and real-world problems." Our researchers interface with customers and go deep on real problems. They connect powerful models with things that actually move the business.

English

605

Applied Compute@appliedcompute·5d

Frontier models set the floor. Specialized models raise the ceiling. We provide Specific intelligence: AI for one company. As foundation models continue to improve at such a rapid pace, what becomes most important is retaining your competitive edge.

English

Applied Compute@appliedcompute·5d

The companies that compound advantages from their own data and workflows are the ones who win the next decade. These are the systems we build at Applied Compute with partners like @modal.

Modal@modal

Frontier models set the floor. Specialized models raise the ceiling. With Modal, @AppliedCompute is training custom agent workforces for companies like DoorDash, Mercor, and Cognition.

English

4.1K

Applied Compute@appliedcompute·5d

"The reason it's so hard to train these systems in legal is because you don't have open source repositories. We were finally able to generate synthetic client matter data that looks super realistic. The next step is taking private data and working with an individual client to further train that system." More from @gabepereyra on how @harvey is unlocking the path from evals to client-specific models at our Private Frontier all-hands.

English

13.3K

Applied Compute retweetledi

Nasdaq Exchange@NasdaqExchange·6d

“We’re seeing a ton of demand from companies who want to own their intelligence and not rent it.” In partnership with @Wing_VC's 2026 Enterprise Tech 30, @ypatil125, Co-Founder & CEO of @AppliedCompute, joins @Nasdaq to explore the enterprise infrastructure shift their remarkable trajectory. Watch the full interview here: spr.ly/6016BBvNNg

English

2.8K

Applied Compute@appliedcompute·19 May

"Every company is going to build their own frontier AI unique to their secret sauce. That's exactly what the top law firms do. We're starting to talk with a law firm and their client and ask: how could we build you a joint model?" Thanks @gabepereyra and @harvey for joining us at our Private Frontier all-hands to share how Harvey is defining the future of Specific Intelligence for legal agents.

English

19.4K

Applied Compute retweetledi

Yash Patil@ypatil125·16 May

The real power of forward deployed engineering has always been putting strong technical people directly alongside the operators who own the outcome. That proximity forces the work to solve the actual problem instead of some sanitized version of it. In the AI era this principle has become even more valuable. Agents can now sit inside real workflows and improve from actual decisions, which means the highest-leverage work is extracting the tacit knowledge that lives with subject matter experts, building evaluations that reflect how things actually break, and closing the production feedback loop so agents get better from real outcomes.

English

277

196.2K

Applied Compute retweetledi

Yash Patil@ypatil125·12 May

x.com/i/article/2054…

ZXX

139

25.2K

Applied Compute@appliedcompute·11 May

"We see general models as powerful tools that we use very often, but they’re setting the floor for enterprises. We help companies use their data to build specialized agents that set the ceiling." Thank you @NasdaqExchange and @Wing_VC for having us.

English

4.9K

Applied Compute retweetledi

Sam Denton@samueldenton·8 May

We used ContextEngine to double our critical learnings injected into our own coding workflows at @appliedcompute. If you're interested in applying continual learning in the wild would love to chat!

Applied Compute@appliedcompute

x.com/i/article/2052…

English

2.8K

Applied Compute@appliedcompute·8 May

x.com/i/article/2052…

ZXX

182

49.8K

Applied Compute@appliedcompute·6 May

The leap from "answer a legal question" to "navigate a real client matter and produce reviewable work product" is exactly the leap enterprise AI needs to make. LAB is the first benchmark that captures this. Huge congrats @nikogrupen, @gabepereyra, @ItsJulioPereyra and the @harvey team, proud to be partnered on it!

Gabe Pereyra@gabepereyra

x.com/i/article/2051…

English

3.1K

Applied Compute@appliedcompute·1 May

English

1.6K

Applied Compute@appliedcompute·1 May

Using the Context Engine to build a pipeline on APEX-Agents produces up to 16.9% relative improvement at fixed reasoning, with consistent gains on GDPVal. For enterprises, this turns context into a compounding asset: every production rollout makes the next one better.

English

Applied Compute@appliedcompute·1 May

Introducing the AC Context Engine: enterprise-grade infrastructure to continuously encode nuanced institutional knowledge into a living artifact (Contextbase). We find that our Contextbases can be the unlock to moving the Pareto frontier on cost and intelligence.

English

104

59K

Keşfet

@modal @gabepereyra @harvey @Wing_VC @ypatil125 @Nasdaq @NasdaqExchange @nikogrupen