anand iyer

17.6K posts

anand iyer

@ai

Managing partner Canonical · Venture Partner Lightspeed · Father · Husband · Few-shotting tech, open source

San Francisco, CA انضم Şubat 2008

636 يتبع49.6K المتابعون

anand iyer@ai·1d

Chris has provided the clearest breakdown of robot world models that I've seen. The simplest way to understand them: an LLM asks "what word comes next?" A world model asks "what happens next in the physical world if I do this?" 3 types are competing right now: 1. Action-conditioned (V-JEPA 2, Dreamer v4): predict what happens given a robot's action. The purest approach, but predictions collapse within seconds. 2. Video world models (NVIDIA DreamGen, 1x): generate a video of the task first, then reverse-engineer the motor commands from the frames. No action labels needed to train. 3. Joint world-action models (DreamZero, Fast WAM): predict both video and action simultaneously. Currently winning on benchmarks. The best models don't even render video at test time. They just need to have learned what the world looks like in order to act in it. This matters because today's AI is essentially like a very well-read intern. Great at desk work, useless at unloading a truck. You can simulate billions of chess games, but you can't bake a billion cakes. World models trained on internet-scale video are how robotics closes that gap. Transformers made AI book-smart. World models give it physical intuition. Great read👇🏾

Chris Paxton@chris_j_paxton

x.com/i/article/2037…

English

12.3K

anand iyer@ai·2d

“My kids will never be smarter than a computer.” -@sama in response to @deedydas’s Q about how he thinks about the influence of AI as a parent.

English

1.9K

anand iyer@ai·2d

China built >40 state-backed exoskeleton data factories. Workers folding cloth, opening doors, stacking blocks, repeating each motion hundreds of times so a robot can learn what a hand already knows. No text corpus, no simulation gets you there. One of the only ways to give a machine physical intelligence right now is to pass it through a human body first. China is treating this as shared infrastructure worth building at national scale. Whereas here in the US, we are each collecting the same data inside walled gardens. Great read:

Divyansh Kaushik@dkaushik96

Harmonic drives. Servo motors. Rare earth magnets. Strain-wave gears. Exoskeletons worn by workers in Chinese factories repeating the same motion hundreds of times a day so a robot somewhere can learn what a hand already knows. Forty state-funded sites. Local governments providing space rent-free. New essay on why the AI competition is expanding. Link in reply.

English

100

11.3K

anand iyer@ai·3d

In 2020, right before the lockdown, @akashraju4 cold DM’d me asking for fundraising advice for his pre-seed. Today, Glimpse raised a $35M Series A led by a16z. Relentless execution by this team as they have navigated their way to PMF. Huge congrats team! Proud to have been a day 1 backer. Boiler up!!

Akash Raju@akashraju4

I’m extremely excited to announce that @try_glimpse has raised a $35M Series A led by @a16z with continued participation from @8vc & @ycombinator bringing our total raised to $52M. At Glimpse, we’re building the AI-native infrastructure for CPG & retail brands. We started off automating the deductions workflow - recovering brands millions of dollars back into their P&L & saving dozens of hours every week. With this initial focus, we’ve also built the CPG data layer giving us the opportunity to continue expanding to more manual workflows that can be automated. We’re giving CPG brands real operating leverage in the world of AI. To our 200+ customers including PLTFRM, Evermark, IQ Bar, Alice Mushrooms, and more, thank you for your faith in us. We have more fuel now to keep supporting & scaling with you all. We’re hiring - come join us! We’re just getting started.

English

3.5K

anand iyer@ai·5d

Vibe coding. Vibe physics. The debate always seems to come down to the fact that AI is excellent at execution and consensus. And novel question-asking is still a human thing. At least for now. 2035 is 9 years away, which is both soon and an eternity.

Harvard Physics@harvardphysics

Scientists discuss whether AI could surpass human contributions to physics by 2035 physicsworld.com/a/is-vibe-phys…

English

2.6K

anand iyer@ai·6d

The AI infrastructure arms race has moved from chips to land to sovereign energy. Btw these are not data centers. These are power plants that run compute.

Bloomberg@business

Softbank is working to build a massive AI data center on federally owned land in Ohio that it’s planning to power with roughly $33 billion worth of natural gas-fired electricity to be installed by the end of the decade bloomberg.com/news/articles/…

English

6.7K

anand iyer@ai·6d

@PredictReport Yeah we haven’t really cracked into token efficacy.

English

Predict Report@PredictReport·6d

@ai the math barely checks out

English

anand iyer@ai·6d

Token spend is the new startup burn rate.

sunny madra@sundeep

“If your $500K engineer isn’t burning at least $250K in tokens, something is wrong.”

English

4.4K

anand iyer@ai·6d

Almost everyone I know in tech is hacking on something on the side.

English

6.9K

anand iyer@ai·20 Mar

Robotics sensor logs, self-driving car telemetry, hospital vitals - all time series, all dwarfing the text and video data the AI industry has spent years optimizing for. And the reason transformer models (Claude, ChatGPT etc.) can't forecast this well: they turn continuous numbers into discrete tokens, and that tokenization likely destroys the precision the problem needs. Google, Amazon, Datadog have all built proprietary models to compensate but those models only saw historical numbers, never the earnings report or policy change that caused them. @synthefyinc's Migas 1.5 is the first open-weights foundation model that combines text and time series to induce such exogenous information into time series forecasting natively. Early numbers: 75%+ win rate across 86 real-world datasets. 14.2% lower MAE. Weights on @huggingface. Or download & use their new skill directly in Claude.

Synthefy@synthefyinc

Today, we’re releasing Migas 1.5: the first foundation model to fuse text and time series. Until now, forecasting models have only looked at historical numbers. Migas 1.5 changes that by letting users incorporate real-world context directly into the forecast. This enables teams to forecast with essential context like earnings reports, policy changes, market events, supply shocks, and more. This directly results in more accurate forecasting and enables complex scenario analysis, especially when historical data is sparse. Highlights: - Highest Elo rating against leading foundation models on 86 real-world datasets - 75%+ win rate against all baselines (even Migas 1.0!) - Up to 14.2% lower MAE in short-context forecasting - Fully open source - Premade Claude skill to get you started in seconds We’re excited to open-source Migas 1.5 and eager to see what the community builds with it. Links in comments.

English

3.3K

anand iyer@ai·20 Mar

The moat in robotics is the data flywheel. This is one of the most intellectually honest interviews in the space, given by @hausman_k. His core thesis, borrowed from "The Inner Game of Tennis": you can't program intelligence by writing rules. You have to learn it from data. This is the same insight that made LLMs work. We tried to hand-code language for decades (dating myself). Then we just... scaled data. Hausman thinks robotics is at that inflection, but the data problem in physical AI is fundamentally harder: - Language models: trained on the internet (trillions of tokens, free) - Vision models: trained on images (billions, cheap) - Robot models: trained on real-world interactions (expensive, slow, environment-specific) Simulation doesn't solve manipulation because "you'd need to simulate all of the external world". So, the moat is the data flywheel: deploy robots --> collect interaction data --> improve models --> deploy more robots. The Inner Game of Robots. Great interview, @mariogabriele.

Mario Gabriele 🦊@mariogabriele

@hausman_k is the co-founder and CEO of @physical_int, a robotics company building a general-purpose “AI brain for the physical world.” The company has raised more than $1 billion in funding to develop foundation models that allow robots to operate across many machines, environments, and tasks rather than being programmed for a single purpose. In our conversation, we explore: • The moment a lecture from Sergey Levine convinced him to abandon his PhD research direction and pivot fully to deep learning • The case for building a general “AI brain” for the physical world rather than a single specialized robot • The role of real-world data in training robots, the limits of simulation, and how deployment could create a powerful data flywheel • The unique challenges of physical intelligence and why robots must operate with far higher reliability than language models Thank you to the partners who make this possible - @brexHQ: The intelligent finance platform: brex.com/mario - @meetgranola: The app that might actually make you love meetings: granola.ai/mario Timestamps (00:00) Intro (04:05) Karol’s early fascination with robots (18:21) Karol’s entry point to robotics and PhD program (25:49) Combining robotics with LLMs: The Taylor Swift demo (30:48) The 1970s SHRDLU AI experiment (39:40) How research shapes what Physical Intelligence builds (49:07) The return of reinforcement learning in robotics (1:00:00) NVIDIA’s simulation engines (1:07:31) Compensating for missing senses

English

18K

anand iyer@ai·19 Mar

This feels like physical product design's ChatGPT moment. This team just ran an autonomous agent against the entire chip design process: 219-word spec in, tape-out-ready silicon layout out, 12 hours later. The agent ran continuously against a simulator, found its own bugs, rewrote its own pipeline, and iterated to a working CPU! Chip design costs well over $400M and takes up to 9 years. Not because writing hardware code is hard (it is actually brutally hard) but because a respin costs 10 of millions. So teams spend more than half their total budget just verifying the design is correct before a single transistor is placed. That cost structure is why most chip designs never get built. Entire product categories that were previously too low-volume to justify a tape-out are now buildable.

Towaki Takikawa / 瀧川永遠希@yongyuanxi

Design Conductor: an AI agent that can build a RISC-V CPU core from design specs. The agent is given access to a RISC-V ISA simulator and manuals... to enable an end-to-end verification-driven generation. The most important thing for design intelligence is a verifier 😎

English

338

43.9K

anand iyer@ai·18 Mar

@ccatalini Couldn’t agree more. @bgoutham had a great take:

gouthamb.eth@bgoutham

Am looking at 4 key dimensions, 1. output efficiency = output tokens / total tokens 2. context amplification = cache reads / cache writes 3. Iteration cost = tokens per agent step 4. Tokens per task The key insight: LLM cost scales with context size × iterations, not output length Most token usage isn’t generation. It’s context reuse. Put in other words, The model isn't expensive because it talks a lot. It's expensive because it repeatedly rereads the same context. Here is a simple example, in our dataset: - 17.1B cache reads - 1.36B cache writes Amplification ~ 12.6× Meaning each prompt is reused ~12 times. The result: Output efficiency was only ~0.8% In other words: Less than 1% of tokens were actual model output. The rest were context movement. The biggest inefficiency we found: Large repo contexts + long agent loops. Example: 120k token repo context 50 iterations = 6M tokens But if you reduce context to 40k: 40k × 50 = 2M tokens 67% savings immediately.

English

340

Christian Catalini@ccatalini·18 Mar

Except token use is a terrible proxy metric for actual valuable and verified AI output.

The Wall Street Journal@WSJ

Companies that now regularly use artificial intelligence are starting to track their workers’ use of tokens, AI’s unit of measurement on.wsj.com/473nAnZ

English

1.8K

anand iyer@ai·18 Mar

Karpathy's autoresearch loop (write code, train for 10 minutes, check if it improved, keep or discard, repeat) is getting cloned in other verticals to improve models. Bitmind tshipped a deepfake detection toolkit built on the same chassis: point an agent at 60+ image datasets overnight, wake up to 50 iterated experiments and a competition-ready model. Thanks to Karpathy, this sort of autonomous ML research is becoming an essential primitive for frontier AI teams.

Ken Jon@kenjon

Agents are the future. Inspired by autoresearch and arbos we released our deepfake research training toolkit: DFResearch: github.com/BitMind-AI/dfr… Experiment autonomously to train the best deepfake detection models. Integrated to download data, output submission ready-results, and full guide to adding custom models, datasets.

English

6.3K

anand iyer أُعيد تغريده

Ryan Shea@ryaneshea·17 Mar

Inspired by @karpathy’s autoresearch, I built Autofoundry — a simple CLI that lets you run experiments across cloud GPUs with one command: autofoundry run You get a real-time interactive table showing GPU availability and pricing across Runpod, Vast, Lambda Labs and PRIME Intellect. Pick what you want and it spins up the instances, streams results live to your terminal, aggregates metrics into a report, and tears everything down. A great first script to try is: scripts/run_autoresearch.sh The terminal UI is straight out of Neon Genesis Evangelion (w/ full NERV Central Command vibes) and the project is open source (MIT licensed).

English

163

14.5K

anand iyer@ai·17 Mar

@MTCoppel @BChillman @phantom @CFTC Incredible!

English

424

anand iyer أُعيد تغريده

Marisa Tashman Coppel@MTCoppel·17 Mar

1/ Big news: @phantom has received first-of-its-kind no action relief from the @CFTC. We can now connect users to regulated derivatives markets and event contracts without registering as an introducing broker. cftc.gov/PressRoom/Pres…

English

536

67.3K

anand iyer أُعيد تغريده

Purdue Men's Basketball@BoilerBall·16 Mar

Déjà vu. 2023 - 🏆 2026 - 🏆 7 straight dubs in the United Center.

Français

1.1K

18.2K

anand iyer@ai·16 Mar

@_joe_harris_ Which ones?

English

122

Joe Harris@_joe_harris_·15 Mar

Part of the issue is vague terms like infrastructure.. Prediction: this unbundles in the next year to 5+ precise categories - with billion $ players in each one

Robots Digest 🤖@robotsdigest

Robotics lacks infrastructure, not intelligence. Everyone wants to build bigger robot models, but most Physical AI papers complain about the same things: data collection is slow, sim-to-real is fragile, teleop is painful, evaluation is messy, long-horizon control still breaks.

English

anand iyer أُعيد تغريده

Chris Paxton@chris_j_paxton·15 Mar

They need to bring one of these setups to GTC

anand iyer@ai

Went to @DvijKalaria's lab @berkeley_ai and played ping pong against his robot, Oreo. I'd played a ton of ping pong as a kid. This felt appropriately surreal and one of those "I wish I could tell my highschool self about this" moments. Table tennis is one of the harder sports for robots to play. The ball can move up to 30+ mph with heavy spin, the human opponent's intent is hidden, and the whole body has to coordinate. Oreo is a full humanoid holding a real paddle, and it learned key motions like swings by watching Dvij demonstrate. No robot-collected training data. One person shows the motion, the policy generalizes. The way it works, as I understood it: - A smart system (a hierarchical planner) first figures out where the ball is going to fly and picks the best type of hit, like a forehand or backhand swing. - This plan then helps train the robot's "brain" (an RL policy) in a virtual simulation. The brain learns by trial and error, getting rewards when it mimics a few example moves - Once trained in the sim, the whole setup gets applied to the actual physical robot so it can play for real. The human demonstrations are essentially the reference motions. They are building a robot that has watched more human table tennis than any human has, and uses that to develop its own game. I still won. (Barely. But that won't last)

English

7.9K

اكتشف

@sama @deedydas @akashraju4 @PredictReport @synthefyinc @huggingface @hausman_k @mariogabriele