Tal Daniel

113 posts

Tal Daniel banner
Tal Daniel

Tal Daniel

@TalDaniel8

Postdoc @ CMU Robotics Institute Ph.D. from the Technion ECE Research interests include self-supervsied learning, generative modeling, RL, robotics.

Katılım Kasım 2019
385 Takip Edilen296 Takipçiler
Sabitlenmiş Tweet
Tal Daniel
Tal Daniel@TalDaniel8·
🚀 #ICLR2026 Oral 💥 How can we design world models that capture object interactions directly from pixels? Introducing Latent Particle World Models-the first end-to-end self-supervised, object-centric world model, trained from videos, supporting action/img/lang conditioning. 1/n
English
4
43
266
22.2K
Tal Daniel
Tal Daniel@TalDaniel8·
During training, the posterior latent actions condition the dynamics module that predicts the next-frame prior. A KL regularization term aligns this prediction with the latent policy’s output, forming a VAE-style objective over particle transitions. 7/n
English
1
0
8
663
Tal Daniel
Tal Daniel@TalDaniel8·
The inverse dynamics observes particles at t and t+1, inferring the latent actions that caused the change. The latent policy sees only particles at t and outputs a distribution over possible latent actions from the current state. 6/n
English
1
0
6
572
Tal Daniel
Tal Daniel@TalDaniel8·
🚀 #ICLR2026 Oral 💥 How can we design world models that capture object interactions directly from pixels? Introducing Latent Particle World Models-the first end-to-end self-supervised, object-centric world model, trained from videos, supporting action/img/lang conditioning. 1/n
English
4
43
266
22.2K
Tal Daniel
Tal Daniel@TalDaniel8·
To address this, we introduce a context module that predicts latent actions per particle, enabling fine-grained, multi-entity dynamics It has two heads: (1) an inverse dynamics (posterior) and (2) a latent policy (prior). 5/n
Tal Daniel tweet media
English
1
0
8
628
Tal Daniel
Tal Daniel@TalDaniel8·
Building a world model means capturing stochastic particle dynamics. Existing “latent action” models help, but (1) need strong regularization (e.g., VQ) and (2) rely on a single global latent—missing interactions among multiple entities. 4/n
English
1
0
6
643
Tal Daniel
Tal Daniel@TalDaniel8·
DLP decomposes scenes to particles with several attributes (keypoints, bounding-boxes, masks), fully unsupervised. These act as visual “tokens,” making cross-modal long-horizon reasoning (vision ↔ language) far more natural than the standard pixel patches. 3/n
Tal Daniel tweet media
English
2
0
8
786
Tal Daniel
Tal Daniel@TalDaniel8·
Self-supervised object-centric models decompose scenes into entities without any supervision—a key step toward visual understanding. In this work, we extend Deep Latent Particles (DLP) to world-modeling on real-world datasets. 2/n
GIF
English
2
0
11
922
Tal Daniel retweetledi
Tal Daniel
Tal Daniel@TalDaniel8·
We have an amazing line-up of rockstar speakers, and we will also debut our “Latent Particle World Models” work (Thursday Feb 5, 11am EST). Schedule and livestream links: world-model-mila.github.io
English
1
0
0
70
Tal Daniel
Tal Daniel@TalDaniel8·
Interested in world models? The first “World Modeling” workshop is happening this week (Feb 4-6, Wed-Fri) at @Mila_Quebec , and it will be streamed on YouTube so you can follow along live!
Tal Daniel tweet mediaTal Daniel tweet media
English
1
0
7
147