Robin Rombach

717 posts

Robin Rombach

@robrombach

Krawallkrümel. Generative Models at https://t.co/1xqMb617gc, made with ❤️

Black Forest Katılım Temmuz 2019

549 Takip Edilen12.9K Takipçiler

Sabitlenmiş Tweet

Robin Rombach@robrombach·1 Ağu

🔥 I am so damn excited to announce the launch of Black Forest Labs. We set ourselves on a mission to advance state-of-the-art, high-quality generative deep learning models for images and video, and make them available to the broadest audience possible. Today, we release FLUX.1

Black Forest Labs@bfl_ml

We are excited to announce the launch of Black Forest Labs. Our mission is to develop and advance state-of-the-art generative deep learning models for media and to push the boundaries of creativity, efficiency and diversity.

English

155

1.2K

378K

Robin Rombach retweetledi

Anjney Midha@AnjneyMidha·22h

amppublic.com

ZXX

430

175.6K

Robin Rombach@robrombach·3d

@giffmana Macron tribute panel

English

1.4K

Lucas Beyer (bl16)@giffmana·3d

haha love the GTC panel "black sunglass + black dress" trend, you can see who likes having fun:

English

14.7K

Robin Rombach retweetledi

Sayak Paul@RisingSayak·13 Mar

The @bfl_ml team released Klein KV and showed how KV-caching can incorporated in a flow pipeline 🤯 The idea is simple and elegant. In the first denoising step, reference image tokens are included in the full DiT forward pass. Their per-layer KVs are computed and cached. In the subsequent steps, KVs for only noisy latents are computed while the cached reference KVs are injected during computing attention. As a result, it delivers upto 2.5x speedups for multi-reference editing tasks over Klein. I basically learned about it from this PR: github.com/huggingface/di… The PR is a poetry in motion and is from the BFL team itself! Kudos to them for always being the best when it comes to designing codebases for flow and diffusion models. The best! Check out the model here: huggingface.co/black-forest-l…

English

131

21.1K

Robin Rombach retweetledi

Linoy Tsaban@linoy_tsaban·12 Mar

My favorite editing model, FLUX.2 [klein] 9B, just got 2x faster: Meet FLUX.2 [klein] 9B-KV 😍💨 > Using KV-Cache Optimization to reduce computation & speed up inference by up to 2.5 times for multi-reference editing love how well it edits "around" the bullets

English

391

28.3K

Robin Rombach@robrombach·13 Mar

@seungwookh This is so fucking cool 🥹

English

296

Seungwook Han@seungwookh·12 Mar

Can language models learn useful priors without ever seeing language? We pre-pre-train transformers on neural cellular automata — fully synthetic, zero language. This improves language modeling by up to 6%, speeds up convergence by 40%, and strengthens downstream reasoning. Surprisingly, it even beats pre-pre-training on natural text! Blog: hanseungwook.github.io/blog/nca-pre-p… (1/n)

English

259

1.7K

239.8K

Robin Rombach@robrombach·11 Mar

🤖🤖🤖🌲🌲🌲

Patrick Esser@pess_r

Fixed vision encoders like DINO have driven impressive progress in more learnable representations for generative modeling - but there is no universal variant across modalities, and they do not scale with the generative model. We introduce our self-supervised framework, Self-Flow, that builds learnability directly into flow models, working in a unified and scalable way across image, video and audio. Particularly excited about the gains on video-action prediction: Beyond the overall success rate improving substantially, more complex tasks - like "Open and Place" - see some of the clearest gains. So many interesting research questions to explore to make 🤖 go brrr Super glad to be working with my amazing colleagues @hila_chefer, Dominik, @dustin_podell, Vikash, @Vinh_Suhi, Antonio and @robrombach - as well as the whole @bfl_ml team! arxiv: arxiv.org/abs/2603.06507 project page: bfl.ai/research/self-…

ART

4.8K

Robin Rombach@robrombach·9 Mar

@CSProfKGD @ELLISforEurope Awesome, see you there!

English

500

Kosta Derpanis (sabbatical in Munich 🇩🇪)@CSProfKGD·9 Mar

Just confirmed that I will be speaking at the @ELLISforEurope Winter School on Foundation Models (FoMo) in Amsterdam later this month 🤗

English

103

7.7K

Robin Rombach@robrombach·7 Mar

@natanielruizg Very cool!

English

685

Nataniel Ruiz@natanielruizg·6 Mar

Excited to show some surprising inventions on generative multiplayer games we made at Google with Stanford. We call the work MultiGen. I've always been inspired by early studios like id Software with Doom or Blizzard with Warcraft bringing networked video games to the next level. We are at the point in history where we can make strides like them, but for generative games. It's a strange feeling to be in the age of generative video games while still discovering how exactly to train the models and design the tools that make them useful. All of the tools that have been invented for classic game engines need to be redesigned for generative games. For example level and world design is not entirely possible with existing technology. We introduce editable memory to diffusion game engines that allow for design of new levels via a minimap. But we can easily imagine how this can be expanded with different creation tools. The end goal of this research direction is to allow game designers to be able to guide the generation process of their world, at the granularity that they prefer. Editable memory also allows us to add multiplayer to Generative Doom. We were amazed when we saw GameNGen some years ago, and now you can play it live with friends in real-time, on your couch or even online. Shared representations like our editable memory seem like the future for this type of experience. Models are, in some cases, expensive and approximate encoders but great interpolators and extrapolators. Leveraging their strengths lets you have completely new experiences that can be realized now and not in the distant future. This work was started at my previous team and continued in collaboration with Stanford. Congratulations to all for the discoveries.

English

570

98.2K

Robin Rombach@robrombach·5 Mar

@RuiqiGao 🌲🌲🌲

QME

329

Ruiqi Gao@RuiqiGao·5 Mar

🚀🚀🚀

Black Forest Labs@bfl_ml

We present a research preview of Self-Flow: a scalable approach for training multi-modal generative models. Multi-modal generation requires end-to-end learning across modalities: image, video, audio, text - without being limited by external models for representation learning. Self-Flow addresses this with self-supervised flow matching that scales efficiently across modalities. Results: • Up to 2.8x faster convergence across modalities. • Improved temporal consistency in video • Sharper text rendering and typography This is foundational research for our path towards multimodal visual intelligence.

ART

7.6K

Robin Rombach@robrombach·4 Mar

New paper out! We present a training method for multimodal generative models, called Self-Flow, which combines classic flow matching and representation learning. Why? Unlike most representation alignment methods, our new approach does not require external, pretrained models and thus scales gracefully to joint multimodal training on images, videos and audio. How? It combines per-timestep flow matching with dual-timestep representation learning, improving the models' internal representations. This approach outperforms prior methods and shows promising scaling behavior in multimodal pretraining. It also enables downstream applications such as action prediction for embodied AI. webpage+paper: bfl.ai/research/self-… code: github.com/black-forest-l… Credit to @hila_chefer, @pess_r, Dominik, @dustin_podell, Vikash, @Vinh_Suhi and Antonio. If you enjoy doing open research like this, come and join BFL! We are actively hiring🌲

English

312

26.5K

Robin Rombach retweetledi

Hila Chefer@hila_chefer·4 Mar

New research from @bfl_ml 🥳 Meet Self-Flow: our self-supervised framework for image, audio, video & world models 🤖 bfl.ai/research/self-… Do generative models really need DINO to learn strong representations? We propose teaching them directly via a joint framework instead 🧵

English

271

56.9K

Robin Rombach@robrombach·24 Şub

@kilian_maciej @AnthropicAI Good stuff man :) Enjoy

English

557

Maciej Kilian@kilian_maciej·23 Şub

excited to share that i'm joining the @AnthropicAI pretraining team! claude is by far my favorite model and it brings me so much joy to get to be part of this. everyone i've met here is brilliant and incredibly kind and i'm really excited to be working with them :)

English

490

24.6K

Robin Rombach@robrombach·23 Şub

@AnjneyMidha Beschleunigungsspur

Deutsch

980

Anjney Midha@AnjneyMidha·23 Şub

if you were teaching a class to 500+ cs students on how to prepare for takeoff, what would you call it?

English

11.9K

Robin Rombach retweetledi

Tomáš Procházka@tomasproc·17 Şub

pencil autocomplete #3 realtime model: FLUX.2 [klein] by @bfl_ml via @fal

Español

252

2.7K

356.3K

Robin Rombach@robrombach·18 Şub

@steipete @sabinedoering Let's change it.

English

397

Peter Steinberger 🦞@steipete·17 Şub

@sabinedoering Also, mindset. Wie ich nach 3 Jahren suchen wieder purpose gefunden hab: USA: "oh man this is so great let's build sth cool!" AT: "Ja aber pass schon auf di auf, net dassd wieder a Burnout kriegst, goi? Also mach a bissi langsamer."

Deutsch

116

2.6K

113.5K

Sabine Döring@sabinedoering·17 Şub

Der OpenClaw-Gründer Peter Steinberger entkräftet hier sachlich viele B/Doomer-Bedrohungsszenarien. Nun geht er zu OpenAI. Warum konnte oder wollte Europa dieses Talent nicht halten? Ums Geld allein ging es ihm ja ganz offensichtlich nicht. on.orf.at/video/14311959…

Deutsch

107

1.1K

206.9K

Robin Rombach@robrombach·15 Şub

@c_valenzuelab yes.

522

Cristóbal Valenzuela@c_valenzuelab·15 Şub

Within two years, 90% of the pixels you see on screen, from images and videos to games and software, will be generated by AI.

English

286

33K

Robin Rombach retweetledi

Dev Ed@developedbyed·7 Şub

Flux 2 Klein 4B param, dropped at 2 steps with much higher FPS! I also added a couple of LORA's, I'll mess around with. Such a good diffusion model!

English

813

108.6K

Robin Rombach@robrombach·1 Şub

@venturetwins @jnack @krea_ai 👀

QME

118

Justine Moore@venturetwins·1 Şub

@jnack @krea_ai It’s powered by a new image model that’s much higher quality! But yes if you’re using professionally, you’d probably want to enhance ☺️

English

2.3K

Justine Moore@venturetwins·31 Oca

Real-time image editing is insanely good for architecture. You can take a sketch, render a photorealistic building, and then change the materials, weather, or environment by simply adjusting the prompt. Done with the new @krea_ai model in seconds 🤯

English

153

1.5K

181K

Robin Rombach@robrombach·31 Oca

@DynamicWebPaige @nathanbenaich @airstreet @bfl_ml @TuscherMarc @SereactAI @stateofai Come visit!

English

171

👩‍💻 Paige Bailey@DynamicWebPaige·31 Oca

@nathanbenaich @airstreet @robrombach @bfl_ml @TuscherMarc @SereactAI @stateofai fomo

Português

713

Nathan Benaich@nathanbenaich·30 Oca

i'm headed to zurich to host our @airstreet zurich ai meetup on 19 feb, ft: - @robrombach of @bfl_ml, frontier visual intelligence - @tuschermarc of @sereactai, general purpose robotics - an update on the @stateofai :) luma(dot)com/zurichai

English

10.5K

Robin Rombach retweetledi

Xor@XorDev·26 Oca

Rocaille 2 vec2 p=(FC.xy*2.-r)/r.y/.3,v;for(float i,f;i++<1e1;o+=(cos(i+vec4(0,1,2,3))+1.)/6./length(v))for(v=p,f=0.;f++<9.;v+=sin(v.yx*f+i+t)/f);o=tanh(o*o);

Xor@XorDev

Rocaille vec2 p=(FC.xy*2.-r)/r.y/.3,v;for(float i,f;i++<1e1;o+=(cos(i+vec4(0,1,2,3))+1.)/6./length(v))for(v=p,f=0.;f++<9.;v+=sin(v.yx*f+i+r+t)/f);o=tanh(o*o);

Čeština

185

2.1K

539.2K

Keşfet

@giffmana @bfl_ml @seungwookh @CSProfKGD @ELLISforEurope @natanielruizg @RuiqiGao @hila_chefer