Ryan Wolf

59 posts

Ryan Wolf banner
Ryan Wolf

Ryan Wolf

@ryantwolf

AI @figure_robot. ex-NVIDIA. UIUC grad.

Sumali Temmuz 2016
89 Sinusundan160 Mga Tagasunod
Ryan Wolf nag-retweet
Brett Adcock
Brett Adcock@adcock_brett·
In the last 120 days, Figure scaled manufacturing 24x - from 1 robot/day to 1 robot/hour We will manufacture 55 humanoid robots this week
English
332
405
4.3K
438.6K
Ryan Wolf nag-retweet
Brett Adcock
Brett Adcock@adcock_brett·
ZXX
164
222
2.5K
308.6K
Ryan Wolf nag-retweet
Brett Adcock
Brett Adcock@adcock_brett·
So proud to see F.03 make history as the first humanoid robot in the White House 🤖 🇺🇸
English
878
808
7.6K
1.3M
Ryan Wolf nag-retweet
Brett Adcock
Brett Adcock@adcock_brett·
Today, Figure is showing another major milestone towards a robot in every home Running Helix 02, cleaning a living room fully autonomously
English
391
546
4.2K
678.9K
Ryan Wolf
Ryan Wolf@ryantwolf·
This should happen more often. A funny idea would be adding new tasks every time a model gets the top score to deter benchmark-hacking. The ultimate moving goalpost.
Artificial Analysis@ArtificialAnlys

New year, new Artificial Analysis Intelligence Index! Announcing Intelligence Index v4.0: incorporating 3 new evaluations, further aligning to real-word use and reducing saturation The Artificial Analysis Intelligence Index is our synthesis metric for assessing generalist model intelligence and tracking AI progress. We like nuance and breakdowns at Artificial Analysis (and you’ll find plenty of that in the thread below!) but when you want a single number, this is the best one. Artificial Analysis Intelligence Index v4.0 is: ➤ Less saturated: Top models score ≤50 in v4.0 compared to 73 in v3.0. The ten evals still cover a range of difficulties to allow differentiation from small models to frontier models ➤ More agentic: GDPval-AA, our leading metric for general agentic performance, joins Terminal-Bench Hard and Tau2 Telecom ➤ A multi-dimensional view of intelligence: Index V4.0 has four equally weighted categories: Agents, Coding, Scientific Reasoning and General Changes: ➤ Added in v4.0: AA-Omniscience, GDPval-AA, CritPT ➤ Removed in v4.0: MMLU-Pro, AIME 2025 and LiveCodeBench Overview of the three new evals: ➤ AA-Omniscience tests models on knowledge and hallucination across >40 topics ➤ GDPval-AA is our generalist agentic performance eval - testing models on real-world economically valuable tasks leveraging OpenAI’s GDPval dataset, via our reference agent called Stirrup ➤ CritPt tests performance on hard research-level physics research tasks, including condensed matter, quantum physics, and astrophysics Results: OpenAI's GPT-5.2 with xhigh reasoning effort leads the Artificial Analysis Intelligence Index v4.0, followed closely by Anthropic's Claude Opus 4.5 then Google's Gemini 3 Pro. As always, all evaluations have been run independently and using a standardized methodology. We publish our methodology on the Artificial Analysis website. See below for further analysis.

English
0
0
0
179
Ryan Wolf nag-retweet
Brett Adcock
Brett Adcock@adcock_brett·
Figure 03 coming 10/9
English
508
730
7.8K
1.7M
Sahil Jain
Sahil Jain@SahilJain314·
Excited to share that I've joined @xai to work on RL! It's been an amazing few weeks so far and I'm super thrilled to work on pushing the frontier 📈
Sahil Jain tweet media
English
90
60
2K
119.2K
Ryan Wolf nag-retweet
Brett Adcock
Brett Adcock@adcock_brett·
Big news: Figure has exceeded $1B in funding at a $39B post-money valuation
English
375
480
5.9K
1.6M