Dmitry Baranchuk

21 posts

Dmitry Baranchuk

Dmitry Baranchuk

@DmitryBaranchuk

GenCV Researcher

Katılım Aralık 2021
53 Takip Edilen100 Takipçiler
Dmitry Baranchuk
Dmitry Baranchuk@DmitryBaranchuk·
Closing thoughts: unlike cascaded DMs or continuous-token AR, which run diffusion separately at each stage, SwD unifies scaling and diffusion into a single process. Thus, SwD can be seen as a natural continuous variant of next-scale prediction models (VAR/Switti/Infinity) (9/9)
English
0
0
4
90
Dmitry Baranchuk
Dmitry Baranchuk@DmitryBaranchuk·
I'd like to share our new diffusion distillation method, SwD, which produces few-step generators with progressive resolution scaling over the diffusion process. On SD3.5, SwD matches the speed of two full-size steps but with much better quality. Demo & models are released. (1/9)
Dmitry Baranchuk tweet media
English
2
2
9
1.2K
Dmitry Baranchuk
Dmitry Baranchuk@DmitryBaranchuk·
@ab_testing53 flux is clearly in a higher league. We acknowledge that it is better. Switti should be considered as a step in developing the new paradigm but not as a production-grade model like flux, midjorney, ideogram, etc
English
0
0
10
321
Dmitry Baranchuk
Dmitry Baranchuk@DmitryBaranchuk·
We introduce Switti, a scale-wise transformer for T2I generation. Our 2.5B model outperforms existing AR models and competes with SOTA diffusion models of the same size, while being up to 7x faster! We release our models and code, along with a demo to play around.(1/10)
Dmitry Baranchuk tweet media
English
5
21
167
54.8K
Dmitry Baranchuk
Dmitry Baranchuk@DmitryBaranchuk·
There is still much work to be done to achieve top-tier t2i model quality such as Midjorney, FLUX, etc. As the next step forward, we hope to integrate Switti with higher quality scale-wise image tokenizers, e.g., Infinity [1] released today. [1] arxiv.org/pdf/2412.04431 (10/10)
English
0
2
13
894