Dan Kondratyuk

521 posts

Dan Kondratyuk

@hyperparticle

Co-Founder. Prev. Research Scientist at @LumaLabsAI (Realtime Video World Models, Ray), @GoogleAI (VideoPoet). Let's automate research!

Mountain View, CA Katılım Mart 2015

651 Takip Edilen2.2K Takipçiler

Sabitlenmiş Tweet

Dan Kondratyuk@hyperparticle·12 Haz

Today we are launching Dream Machine, our first AI model that generates cinematic and fluid videos from text instructions and images. I generated this 1-minute 60 fps video entirely from our model. Try Dream Machine → lumalabs.ai/dream-machine Join us → lumalabs.ai/join

English

415

39K

Dan Kondratyuk@hyperparticle·1h

Coding agents are great at writing new code, but pretty bad at deleting code. It's what inevitably leads to a lot of bloat over time. Deleting code is the halmark of a great senior engineer, i.e., one that can write the least amount of code to get the job done. In my mind that's what's missing to make them robust at building good software.

English

Dan Kondratyuk@hyperparticle·6 May

@alex_whedon Let me guess, you're already thinking of breaking the 100M token context barrier :)

English

122

Alexander Whedon@alex_whedon·5 May

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.

English

1.5K

2.9K

23.1K

12.7M

Dan Kondratyuk@hyperparticle·4 May

@willccbb If you make compaction differentiable, maybe you can learn the optimal compaction strategy across most tasks you care about

English

263

will brown@willccbb·4 May

why aren't more people studying self-compaction at artificially low context lengths. there's no reason you can't benchmaxx math RL with 4k tokens across many turns

English

527

50.1K

Dan Kondratyuk@hyperparticle·3 May

@JiaweiYang118 Nice result. Makes me wonder what is the "ultimate form" of loss we should be optimizing

English

179

Jiawei Yang@JiaweiYang118·1 May

Two months ago, I vaguely posted a number: 0.9 FID, one-step, pixel space. Now it is 0.75, and can be even lower. Many wonder how. I thought it might end as a small FID prank: simple and deliberate. It started with one question: can FID be optimized directly, and what does it reveal? Introducing FD-loss.

English

156

922

211.1K

Dan Kondratyuk@hyperparticle·2 May

@vai_viswanathan I like to think of world models as simulations of some environment, able to represent what comes next. Most commonly it's seen as something visual or tangible (video/3D/robotics/etc), but perhaps LLMs that simulate OS (e.g., shell envs) might also be considered as world models.

English

Vai Viswanathan@vai_viswanathan·1 May

@hyperparticle thanks for the read! how do you define world models?

English

Dan Kondratyuk@hyperparticle·1 May

When I ask people what they mean when they are working on "World Models", I get a very different response every time. It's always fun trying to see all varied and different perspectives.

Vai Viswanathan@vai_viswanathan

x.com/i/article/2047…

English

2.1K

Dan Kondratyuk@hyperparticle·1 May

As with any new software, it's still going to have some rough edges. But I put a lot of checks/manual reviews in place to make sure the code quality is to a good standard: 90+% test coverage, fully typed, docstrings that explain intent, and lots of examples.

English

Dan Kondratyuk@hyperparticle·1 May

Technically it did take another half day to polish things up, but the core implementation was fully operational in just a few hours. Software package: github.com/rekursiv-ai/co… Blog post: rekursiv.ai/blog/i-built-c…

English

109

Dan Kondratyuk@hyperparticle·1 May

I wanted to speedrun how fast I could OSS a complete Python package that solves a non-trivial, important job, and I managed to pull it off in about a day. I've never felt so productive writing software, especially complete packages. The most joy I've felt in a long time.

English

236

Dan Kondratyuk@hyperparticle·29 Nis

The cost of software is starting to get really cheap. What took an entire dev team months can soon be accomplished with a single determined person. I suspect we're going to start seeing a proliferation of apps with weird and crazy ideas that wouldn't have been tried until now.

English

170

Dan Kondratyuk@hyperparticle·16 Nis

One thought that scares me a bit: the proliferation of "AI Viruses": tiny coding agents which can break into unsecured systems, replicate themselves, and adapt/mutate over time. And like real viruses, might hide, spread repeatedly like a botnet and impossible to fully eradicate.

English

178

Dan Kondratyuk@hyperparticle·6 Mar

Really excited to share our newest release on a unified model that does understanding and generation all in one. The team really did a great job here!

Luma@LumaLabsAI

Introducing Uni-1, Luma’s first unified understanding and generation model, our next step on the path towards unified general intelligence. lumalabs.ai/uni-1

English

2.2K

Dan Kondratyuk@hyperparticle·27 Kas

Our team has developed a new diffusion distillation technique which is overall much simpler and more robust than prior methods, and scales well to large model training. We make the code and paper freely available github.com/lumalabs/tvm

Luma@LumaLabsAI

Introducing Terminal Velocity Matching: a scalable, single-stage generative training method that delivers diffusion-level quality with a 25× fewer inference steps, now trained at 10B+ scale. lumalabs.ai/blog/engineeri…

English

1.2K

Dan Kondratyuk@hyperparticle·4 Eki

Me and the boys

English

277

Dan Kondratyuk@hyperparticle·24 Eyl

Duck hunt

English

234

Dan Kondratyuk@hyperparticle·19 Eyl

@gravicle Ray2 was able to animate some pretty great anime, and we're keeping the tradition! x.com/seiiiiiiiiiiru…

SEIIIRU😈動画生成AIを使う映像クリエイター@seiiiiiiiiiiru

Luma AIのRay2は動物系の実験が楽しい🐺✨

English

101

amit@gravicle·19 Eyl

Ray2 was the best anime model. Ray3 is better.

Aiden Guo@aidenguoai

Luma Ray 3 test. The running has improved a lot from the last model. As with any AI models, there will be inconsistencies, but I think it inches ever closer to a quality anime sakuga action scene with actions that make sense.

English

2.7K

Dan Kondratyuk@hyperparticle·18 Eyl

It took an incredible amount of energy to get here, but now we're ready to unleash Ray3, our new frontier video model with reasoning capabilities. I especially love the HDR video generations, the colors and lighting just pop in ways that make SDR look dull. Check it out!

Luma@LumaLabsAI

This is Ray3. The world’s first reasoning video model, and the first to generate studio-grade HDR. Now with an all-new Draft Mode for rapid iteration in creative workflows, and state of the art physics and consistency. Available now for free in Dream Machine.

English

Dan Kondratyuk@hyperparticle·7 Ağu

I see they went to the Intel/Nvidia school of cooking charts

Emad@EMostaque

what in the chart crime

English

534

Dan Kondratyuk@hyperparticle·27 Tem

For those that ever wondered how video generation works, this video is fantastic look into how these models operate from a geometric perspective

Grant Sanderson@3blue1brown

New video on the details of diffusion models: youtu.be/iv-5mZ_9CPY Produced by @welchlabs, this is the first in a small series of 3b1b this summer. I enjoyed providing editorial feedback throughout the last several months, and couldn't be happier with the result.

English

382

Dan Kondratyuk@hyperparticle·8 Haz

@Yiheng_Li_Cal Congrats Yiheng!

Svenska

105

Yiheng Li@Yiheng_Li_Cal·8 Haz

🎉Introducing Improved Immiscible Diffusion - Accelerating Diffusion Training by Reducing Its Miscibility. 🔥 Supported by detailed feature analysis, we further clarify that the miscibility problem, i.e. the mix of diffusion paths of different images during training, reduces the training efficiency. 🤔 Based on this, we design a new KNN implementation, which not only is efficient (unrelated to batch sizes) but also performs well in diverse baseline models, especially in flow matching. 🤩 We hope our miscibility problem lights the way for further improving diffusion training efficiency. ✈️ arxiv.org/abs/2505.18521

English

105

16.4K

Keşfet

@alex_whedon @subquadratic @willccbb @JiaweiYang118 @vai_viswanathan @gravicle @elonmusk @BarackObama