sri retweetledi
sri
17 posts

sri
@krakqn
@citsecurities, research @togethercompute // prev @apple, research @nvidia
San Francisco Katılım Kasım 2020
746 Takip Edilen161 Takipçiler

Generalists are useful, but it’s not enough to be smart.
Advances come from specialists, whether human or machine.
To have an edge, agents need specific expertise, within specific companies, built on models trained on specific data.
We call this Specific Intelligence.
It's what we're building at Applied Compute.
We unlock the latent knowledge inside a company, use it to train custom models, and deploy an in-house agent workforce that reports to your team.
We work with sophisticated companies that have already captured early gains from general models, like @cognition, @DoorDash, and @mercor_ai. They’re pulling even further ahead with proprietary in-house agents that don’t need to wait for the next public model release.
Together, we are building and validating models and agents in days instead of months, achieving state-of-the-art performance on customer evals.
Our team has high density and low latency. Our founders all worked on different parts of this problem while they were researchers at OpenAI — @ypatil125 as a key member on the agentic software engineer effort (Codex), @rhythmrg as a core contributor to the first RL-trained reasoning model (o1), and @lindensli as a core contributor on ML systems and infrastructure for RL training.
Two-thirds of the team are former founders, and everyone brings a deep technical background, from top AI researchers to Math Olympiad winners.
We are backed by $80M in funding from Benchmark, Sequoia, Lux, Elad Gil, Victor Lazarte, Omri Casspi, and others. With their support, we are growing the team, scaling deployments, and bringing to market the first generation of agent workforces built on specific models.
In short:
1. We are building Specific Intelligence for specific work at specific companies.
2. That will power in-house agent workforces to support their human bosses.
3. That in turn will unlock AI’s full potential through humanity’s greatest engine of progress: thriving corporations in a free market.

English
sri retweetledi

Reinforcement Learning (RL) has long been the dominant method for fine-tuning, powering many state-of-the-art LLMs. Methods like PPO and GRPO explore in action space. But can we instead explore directly in parameter space? YES we can. We propose a scalable framework for full-parameter fine-tuning using Evolution Strategies (ES).
By skipping gradients and optimizing directly in parameter space, ES achieves more accurate, efficient, and stable fine-tuning.
Paper: arxiv.org/pdf/2509.24372
Code: github.com/VsonicV/es-fin…
English
sri retweetledi

LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.
thinkingmachines.ai/blog/lora/

English
sri retweetledi

Deepmind won the moment LLMs became about RL.
Oriol Vinyals@OriolVinyalsML
Ahead of I/O, we’re releasing an updated Gemini 2.5 Pro! It’s now #1 on WebDevArena leaderboard, breaking the 1400 ELO barrier! 🥇 Our most advanced coding model yet, with stronger performance on code transformation & editing. Excited to build drastic agents on top of this!
English
sri retweetledi

@elissaben70 hi, I know you’ll make it through! just stay strong and keep your head up, and show cancer who’s boss on behalf of all of us. cancer won way too many times- it’s not about to happen again. you got this!
English



