bagel.com

652 posts

bagel.com banner
bagel.com

bagel.com

@bageldotcom

Decentralized Diffusion Models. We're hiring: https://t.co/zFBqJxUhaq

가입일 Haziran 2023
8 팔로잉12K 팔로워
고정된 트윗
bagel.com
bagel.com@bageldotcom·
Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.
English
145
168
986
602.2K
bagel.com
bagel.com@bageldotcom·
In town for NVIDIA GTC? If you're building generative world models or investing in the people who are - we're putting the right people in one room for you tomorrow night in Palo Alto. Co-hosted by Alumni Ventures. Signup link below.
English
2
7
17
9.2K
bagel.com 리트윗함
bagel.com 리트윗함
bidhan @ NVIDIA GTC
bidhan @ NVIDIA GTC@bidhan·
Excited to share that Bagel Labs' paper got accepted at CVPR 2026. A lot of the most important diffusion model research has historically stayed inside frontier labs. We're bringing more of that in the open through open science and open infrastructure. In this work we showcase the very counterintuitive advantage of mixing different training objectives (DDPM and Flow-Matching) through an ensemble of diffusion models. This is one of the first ever works to successfully combine diffusion models trained with heterogeneous objectives. See details here: blog.bagel.com/p/heterogeneou…
bidhan @ NVIDIA GTC tweet media
English
4
7
21
2.8K
bagel.com
bagel.com@bageldotcom·
Diffusion models are becoming the foundation for image, video, and world models. We are hosting a founders and investors gathering on that topic during NVIDIA GTC week, co-hosted by our friends at Alumni Ventures. Mar 16, Menlo Park. Sign up below. luma.com/nvidia-gtc-gen…
bagel.com tweet media
English
3
8
17
4.9K
Deedy
Deedy@deedydas·
The Ultimate List of Artificial Intelligence "Neolabs". A Neolab is a pre-revenue scale startup working on long-term AI breakthroughs. Here's all 50 of them.
Deedy tweet media
English
91
136
1.5K
236.8K
bagel.com 리트윗함
bidhan @ NVIDIA GTC
bidhan @ NVIDIA GTC@bidhan·
Being at the frontier - by the definition of it - means creating the frontier. You don't get to be at the frontier by following someone else. And creating the frontier often means discoveries that go against the established knowledge. We recently made such a discovery about distributed diffusion model training. A common way to optimize diffusion model training is by ensuring the numerical stability of their generation paths. We found that that's not true for the most efficient distributed diffusion model training architecture. We shared what works instead in our blogpost below. blog.bagel.com/p/stability-qu…
bidhan @ NVIDIA GTC tweet media
English
13
32
456
50.2K
bagel.com 리트윗함
Yacine Mahdid
Yacine Mahdid@yacinelearning·
alright folks tomorrow january 22 from 10h-12h AM EST we're going to dive into decentralized diffusion models by reviewing the paris model from bagel labs I even managed to lock in @bidhan for an interview on why how what that's a good thing to even do tune in!
Yacine Mahdid tweet media
Yacine Mahdid@yacinelearning

this weekend I'll be diving deep into decentralized training for diffusion models with the paris model and the bagel team (what a sentence)

English
3
8
91
25.9K
bagel.com 리트윗함
mirian
mirian@mirimayer·
no better way to celebrate bagel day 🥯
mirian tweet mediamirian tweet media
English
4
3
14
2K
bagel.com 리트윗함
bidhan @ NVIDIA GTC
bidhan @ NVIDIA GTC@bidhan·
NeurIPS takeways (better late than never) 1. real AGI needs real continual learning - models that can keep learning without catastrophic forgetting. 2. model architectures need to be "stateful" for building accurate world models for games and robotics. 3. diffusion models are superior for solving both 1 & 2. 4. @bageldotcom's distributed diffusion training architecture is SOTA among both open and closed source frontier lab comparables. 5. the age of research is back, and no better place to do frontier diffusion model research than Bagel Labs. join us - jobs.bagel.com
bidhan @ NVIDIA GTC tweet mediabidhan @ NVIDIA GTC tweet mediabidhan @ NVIDIA GTC tweet mediabidhan @ NVIDIA GTC tweet media
English
7
8
54
7.2K
bagel.com 리트윗함
nisten🇨🇦e/acc
nisten🇨🇦e/acc@nisten·
cooking some stuff in distributed diffusion, theres quite a ot of unturned stones left local and network inference when the architechture is made for it. (also we're training a video MoE)
bidhan @ NVIDIA GTC@bidhan

👀

English
2
4
19
4.3K
bagel.com 리트윗함
bidhan @ NVIDIA GTC
bidhan @ NVIDIA GTC@bidhan·
I’m going to give a talk at NeurIPS on decentralized diffusion models later today, come by if you’re around!
bidhan @ NVIDIA GTC tweet media
English
4
6
43
3.6K
bagel.com 리트윗함
Tanishq Mathew Abraham, Ph.D.
Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·
Noticing this release now (h/t @yacinelearning) This decentralizing training approach is so cool! I will note I shared this alpha back when Luma released its paper in Jan If you're not following me you're going to be missing alpha 😄
Tanishq Mathew Abraham, Ph.D. tweet media
bagel.com@bageldotcom

Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.

English
3
10
67
11.8K
bagel.com 리트윗함
eigenron
eigenron@eigenron·
@bageldotcom bagel labs, just like what its named after, has impeccable taste
English
1
1
4
790
bagel.com
bagel.com@bageldotcom·
Latent Diffusion
bagel.com tweet mediabagel.com tweet media
Français
8
8
74
10.8K