David McAllister
173 posts

David McAllister
@davidrmcall
PhD Student @berkeley_ai


FPO++! We got RL on flow policies working on real robot tasks. Sim2real on humanoids trained from scratch + manipulation finetuning in sim with action chunking. Excited about this direction because we can now use RL with expressive policies to discover new behaviors!


Reve's new text-to-image model is here. Really proud of the team to rank at #3 with the big labs. To my knowledge, we are the first lab to use native pixel space diffusion without latent autoencoder at 4k (16MP) resolution for production level image generation.


New project! Flow Policy Gradients for Robot Control tldr; a simple online RL recipe for training and fine-tuning flow policies for robots co-led w/ @redstone_hong: hongsukchoi.github.io/fpo-control








Introducing Terminal Velocity Matching: a scalable, single-stage generative training method that delivers diffusion-level quality with a 25× fewer inference steps, now trained at 10B+ scale. lumalabs.ai/blog/engineeri…




We are looking for contributors for World Model Post-Training of foundational video models at Meta @AIatMeta! We are looking for talent with expertise in RL post-training, distillation, attention sparsification, diffusion model, and more to hop onboard. Candidates at all career levels are welcomed, whether students or not. We have immediate and flexible start dates for contractor positions. Onsite collaboration is possible in Zurich 🇨🇭, London 🇬🇧, or New York 🇺🇸. If you’re driven about advancing interactive spatial intelligence, we are here to talk - feel free to DM me and @ethanjohnweber.


one of the best moments at BAIR lab actually never imagined to spot prof sergey levine from physical intelligence on a random day




