David Luan

552 posts

David Luan

@jluan

retired!! former cofounder of @adeptailabs, vp engineering @openai and @amazon, and @google LLMs lead. all about type II fun.

beep beep boop boop Katılım Mart 2009

1.1K Takip Edilen12.4K Takipçiler

David Luan@jluan·26 Mar

Nathan's a real OG of AI investing, is friends with a bunch of legit researchers, and did a lot to help me during the Adept days. Respect!

Nathan Benaich@nathanbenaich

News! @airstreet has raised $232,323,232 for Fund III to back AI-first companies from the earliest stages in the US and Europe. Now the largest solo GP venture firm in Europe. Our third epoch begins today. Join us!

English

4.3K

David Luan@jluan·14 Mar

@dchaplot devendra you are one of the GOATs in our field and I cannot wait to see what you cook up!

English

701

Devendra Chaplot@dchaplot·14 Mar

I'm joining SpaceX and xAI, working closely with Elon and team to build superintelligence. Together SpaceX and xAI combine physical and digital intelligence under a leader who understands hardware at the deepest level. Add a high-agency culture with frontier-scale resources, and you get the possibility to achieve something truly unique. I’m excited to advance the fields I’ve obsessed over for years, from robotics research to building AI models on the founding teams of Mistral and TML. Both were extraordinary journeys with extraordinary people that shaped how I think about building intelligence from the ground up. Grateful for everything that brought me here and can’t wait to get started.

English

2.9K

2.1K

28.1K

43.6M

David Luan@jluan·14 Mar

apparently this is what the chinese AI ecosystem thinks our grand american AI master plan is! (this is not a joke! forwarded to me by an attendee at a real chinese ai conference!)

English

238

340

3.6K

David Luan retweetledi

Karina@karinanguyen·11 Mar

Excited to release PostTrainBench v1.0! This benchmark evaluates the ability of frontier AI agents to post-train language models in a simplified setting. We believe this is a first step toward tracking progress in recursive self-improvement 🧵:

English

705

165.9K

David Luan@jluan·25 Şub

I’ll be leaving Amazon at the end of this week to cook up something new! Thanks to the Adept deal, I’ve spent the last ~2 years learning from @ajassy et al while leading Amazon’s agents R&D effort and our San Francisco AI lab. As a childhood EC2 Micro Instance fanboy, it was fun to speedrun launching our own tier-1 AWS service. We scaled up the Adept agent recipes, did new RL research, and shipped it to AWS customers like Hertz, 1Password, and Amazon.com itself. And it's cool to see Nova Act on top of realevals.xyz (at least for now). There’s incredible work to be done at Amazon and I'm grateful for the opportunities to take on more here. But with AGI so close, I want to spend 100% of my time on teaching AI systems brand new capabilities. At OpenAI, I was lucky to incubate the first GPTs; at Adept, we went all-in on agents before anyone else–our tech/people now drive computer-use efforts at every major lab. I have a bet for what's next. ;) This wasn't an easy decision, and I'm sad to leave this wonderful team. I’m grateful for the trust our execs placed in me during an important moment for Amazon and the field. I'm excited to swing at the next idea!

English

302

31.8K

David Luan@jluan·24 Şub

Reiner is perhaps the strongest chip+LLM person I have ever gotten to work with, and he and @MikeGunter_ have built just as legit of a team. I'm stoked to have been an investor in an early round. Tapeout in under a year! 😅

Reiner Pope@reinerpope

We’re building an LLM chip that delivers much higher throughput than any other chip while also achieving the lowest latency. We call it the MatX One. The MatX One chip is based on a splittable systolic array, which has the energy and area efficiency that large systolic arrays are famous for, while also getting high utilization on smaller matrices with flexible shapes. The chip combines the low latency of SRAM-first designs with the long-context support of HBM. These elements, plus a fresh take on numerics, deliver higher throughput on LLMs than any announced system, while simultaneously matching the latency of SRAM-first designs. Higher throughput and lower latency give you smarter and faster models for your subscription dollar. We’ve raised a $500M Series B to wrap up development and quickly scale manufacturing, with tapeout in under a year. The round was led by Jane Street, one of the most tech-savvy Wall Street firms, and Situational Awareness LP, whose founder @leopoldasch wrote the definitive memo on AGI. Participants include @sparkcapital, @danielgross and @natfriedman’s fund, @patrickc and @collision, @TriatomicCap, @HarpoonVentures, @karpathy, @dwarkesh_sp, and others. We’re also welcoming investors across the supply chain, including Marvell and Alchip. @MikeGunter_ and I started MatX because we felt that the best chip for LLMs should be designed from first principles with a deep understanding of what LLMs need and how they will evolve. We are willing to give up on small-model performance, low-volume workloads, and even ease of programming to deliver on such a chip. We’re now a 100-person team with people who think about everything from learning rate schedules, to Swing Modulo Scheduling, to guard/round/sticky bits, to blind-mated connections—all in the same building. If you’d like to help us architect, design, and deploy many generations of chips in large volume, consider joining us.

English

12.9K

David Luan@jluan·29 Ağu

Tried to maximize hot takes per unit time -- including one that @HarryStebbings asked @Benioff about ;) Really fun talking with Alex!

Alex Heath@alexeheath

I enjoyed chatting with Amazon's @jluan about what he has been up to since kickstarting its AGI / agents research lab last year David has seen it all and is refreshingly candid theverge.com/decoder-podcas…

English

8.9K

David Luan@jluan·16 Tem

We're working on a really cool agent RL training recipe across a bajillion gym environments with core contributors from verl, sglang, Adept, and replay.io! Shoot me a message if you're interested :)

English

2.2K

David Luan@jluan·16 Tem

Since launch, it's been really cool to see Nova Act handle real agentic workflows for real enterprises, such as scaling out public benefits and QA testing. Knowledge work is much bigger than just chatting and coding! You can now take these agents to production. Use cases here:

Amazon Science@AmazonScience

Nova Act is now⚡️ enterprise ready ⚡️ and we've added new capabilities to our preview to help you take your prototype to production—with 90%+ reliability across our early enterprise customer use cases!

English

8.4K

David Luan@jluan·31 Mar

Stoked about the first release from our new lab: our browser use agent lets you MapReduce over the web! This early preview moves us closer to reliable agents that learn from rewards across a wide range of digital and physical environments. Love our Adept+Amazon team so much!

Amazon Science@AmazonScience

Meet Amazon Nova Act — an effortless way to build AI agents that can reliably use browsers 🧑‍💻 With our new model, compose robust steps into complex workflows; handle everything from bookings to QA testing. Getting started takes just 3 lines of code. See what Nova Act can do 🧵👇

English

11.5K

David Luan@jluan·31 Mar

@denizbirlikci deniz you're a legend!!! <3

English

357

Deniz Birlikci@denizbirlikci·31 Mar

What a joy it is to be part of the team that made Nova Act happen! 🧑‍💻

Amazon Science@AmazonScience

English

145

69.7K

David Luan@jluan·12 Mar

Incredible what happens when you bring the existing world knowledge and intuition of VLMs to the physical world... only a few more missing pieces in the recipe for AGI... really feeling the agi with this one

Google DeepMind@GoogleDeepMind

Meet Gemini Robotics: our latest AI models designed for a new generation of helpful robots. 🤖 Based on Gemini 2.0, they bring capabilities such as better reasoning, interactivity, dexterity and generalization into the physical world. 🧵 goo.gle/gemini2-roboti…

English

4.7K

David Luan retweetledi

Quinten Farmer@quintendf·27 Şub

I’m excited to announce Tolan, our first Embodied Companion. With no launch or press we’ve quietly hit 500,000+ downloads, over $1m in ARR, and a #1 app store category ranking. Today I’m also announcing our $10m seed round (more on that below) and sharing some of what we’ve learned building an AI companion for consumers.

English

843

219.8K

David Luan@jluan·18 Şub

@miramurati @thinkymachines congrats, Mira and team!! what a wonderful crew!

English

474

Mira Murati@miramurati·18 Şub

Thinking Machines@thinkymachines

Today, we are excited to announce Thinking Machines Lab (thinkingmachines.ai), an artificial intelligence research and product company. We are scientists, engineers, and builders behind some of the most widely used AI products and libraries, including ChatGPT, Character.ai, PyTorch, and Mistral. Our mission is to make artificial intelligence work for you by building a future where everyone has access to the knowledge and tools to make AI serve their unique needs. We are committed to open science through publications and code releases, while focusing on human-AI collaboration that serves diverse domains. Our approach embraces co-design of research and products to enable learning from real-world deployment and rapid iteration. This work requires three core foundations: state-of-the-art model intelligence, high-quality infrastructure, and advanced multimodal capabilities. We are committed to building models at the frontier of capabilities to deliver on this promise. If you’re interested in joining our team, consider applying here: 6wajk07p.paperform.co

English

123

103

2.3K

471.4K

David Luan retweetledi

Reiner Pope@reinerpope·28 Oca

1. Breakdown of DeepSeek V3 efficiency vs Llama 3: - Better: 11x fewer FLOPs per token, thanks to MoE [37B vs 405B activated params] - Better: 2x faster numerics [fp8 vs bf16 training] - Worse: 0.5x flops utilization [16% vs 33% end-to-end MFU*] - Neutral: similar hardware platform [H800 and H100 both have 2Pflops/s dense fp8] - Neutral: same training data volume [14.8T vs 15T tokens] Llama 3’s design was obviously and intentionally conservative: dense model (not MoE), bf16 training (not fp8), GQA attention (not cheaper alternatives). DeepSeek benefited by being aggressive on all these fronts, at the cost of being later to market. 2. The core algorithmic improvements were already known; the closed source LLM labs were probably already doing similar things. DeepSeek’s improvements are real, but far more modest than the Llama comparison would suggest; my wild guess is closer to 1.5x improvement. MoE was published in 2017; in 2021 Switch Transformer reported 7x speedups vs dense models, similar to DeepSeek’s 11x. OpenAI is widely rumored to have been using MoE models for years. NVIDIA published their fp8 training paper in 2022. 3. NVIDIA’s stock price is down 15% after DeepSeek. Should it be? LLM compute is like a gas: it expands to fill the available budget. Over the last 3 years the labs have grown their budgets, despite algorithms and hardware improving. There’s no reason to expect this to change now: you win by making the best model, not by shrinking your budget. The more meaningful question: do algorithmic improvements like DeepSeek’s mean that margins will shift from hardware vendors to labs? Hard to see why. Algorithmic improvements are quickly copied from one lab to another, making it hard for them to maintain technological differentiation. Hardware improvements take much longer to copy.

English

385

68.3K

David Luan@jluan·10 Ara

@Ansh7xpex @pabbeel Ooh sorry about that! Opened!

English

609

Ansh Arora@Ansh7xpex·10 Ara

@jluan @pabbeel DMs closed. Can you open them up for the general accounts please?

English

674

David Luan@jluan·10 Ara

!! @pabbeel and I are building a new AI research lab in SF for Amazon! We’re focused on the remaining major problems to build generally intelligent agents and are looking for a few dozen intrinsically motivated people to join our team and work with the Adept folks here. DM me!

Pieter Abbeel@pabbeel

Super-excited about what's ahead. Want to move the AI research frontier, join us! amazon.science/blog/amazon-op… AGI-SFLab-Jobs@amazon.com

English

386

72.2K

David Luan retweetledi

Nathan Benaich@nathanbenaich·10 Ara

new sf ai lab at amazon led by @jluan (ex-adept) and @pabbeel (ex-covariant/berkeley)! exciting to see :)

David Luan@jluan

English

8.4K

David Luan@jluan·10 Ara

@pabbeel It’s been cool to get the space to do longer-term R&D here, and we’re interested in problems ranging from combining LLMs with RL to world models for computer agents!

English

3.1K

David Luan@jluan·10 Ara

@pabbeel By putting together a super talent-dense team focused on a small number of targeted research bets, we’re able to maximize the amount of compute per capita -- aka do more wild experiments faster.

English

Keşfet

@dchaplot @ajassy @MikeGunter_ @HarryStebbings @Benioff @denizbirlikci @miramurati @thinkymachines