Michael Drolet

21 posts

Michael Drolet banner
Michael Drolet

Michael Drolet

@mdrolet01

PhD student at @ias_tudarmstadt | Interested in latent representations & robotics

Katılım Ağustos 2024
93 Takip Edilen57 Takipçiler
Sabitlenmiş Tweet
Michael Drolet
Michael Drolet@mdrolet01·
Interested in discrete latents? I’ll be presenting a new method for training discrete VAEs at ICLR in Rio 🇧🇷 📍 Pavilion 3, P3-605 🗓 Thu, Apr 23 @ 11:15 More details below 👇
Michael Drolet@mdrolet01

DAPS: Discrete Variational Autoencoding via Policy Search (accepted @iclr_conf) A principled way to train discrete autoregressive encoders — without straight-through gradients. Entropy reg. + KL trust region for stable training and compact latents. 🔗 drolet.io/daps

English
1
1
11
456
Michael Drolet
Michael Drolet@mdrolet01·
Correction: Poster session is at 3:15 🙂
English
0
0
0
9
Michael Drolet
Michael Drolet@mdrolet01·
Interested in discrete latents? I’ll be presenting a new method for training discrete VAEs at ICLR in Rio 🇧🇷 📍 Pavilion 3, P3-605 🗓 Thu, Apr 23 @ 11:15 More details below 👇
Michael Drolet@mdrolet01

DAPS: Discrete Variational Autoencoding via Policy Search (accepted @iclr_conf) A principled way to train discrete autoregressive encoders — without straight-through gradients. Entropy reg. + KL trust region for stable training and compact latents. 🔗 drolet.io/daps

English
1
1
11
456
Michael Drolet
Michael Drolet@mdrolet01·
Step 3: Optimal target distribution. Solve a KL-constrained optimization problem to get a closed-form non-parametric target q*. Update the parametric encoder toward q*.
English
1
0
4
203
Michael Drolet
Michael Drolet@mdrolet01·
DAPS: Discrete Variational Autoencoding via Policy Search (accepted @iclr_conf) A principled way to train discrete autoregressive encoders — without straight-through gradients. Entropy reg. + KL trust region for stable training and compact latents. 🔗 drolet.io/daps
GIF
English
1
8
24
2.5K
Michael Drolet retweetledi
Ahmed Hendawy | أحمد هنداوى
🧵 Accepted at @iclr_conf ! Target networks stabilize bootstrapping in RL 🛡️ But induce slow-moving targets 🐢 Online networks adapt fast ⚡ But can diverge with function approximation 💥 𝗠𝗜𝗡𝗧𝗢🌿 uses the online network 𝗼𝗻𝗹𝘆 𝗶𝗳 𝗶𝘁 𝗰𝗮𝗻 — yielding faster and more stable RL. Here’s how 👇
English
1
6
35
2.2K
Davide Tateo
Davide Tateo@davide_tateo·
My life chapter in Darmstadt is coming to an end. Today, I got a goodbye present from my dear friend (and amazing researcher) @liu_puze
Davide Tateo tweet mediaDavide Tateo tweet mediaDavide Tateo tweet media
English
5
0
10
575
Haitham Bou Ammar
Haitham Bou Ammar@hbouammar·
I read this paper in detail, and I am very sad! They literally re-do the optimal reward baseline work that we have known since forever, without even crediting the true authors in their derivations. The third screenshot is taken from: ieeexplore.ieee.org/stamp/stamp.js… As you see, they are almost identical! This is so wrong!! Stop this behavior @Microsoft ... sounds novel doesn't it ? Ridiculous to say the least - it has been done DECADES AGO! #AI #MachineLearning
Haitham Bou Ammar tweet mediaHaitham Bou Ammar tweet mediaHaitham Bou Ammar tweet media
English
4
11
124
20.4K
Michael Drolet retweetledi
Firas Al-Hafez
Firas Al-Hafez@firasalhafez·
🚀 We’re releasing #LocoMuJoCo2 — An extensive library for whole-body imitation & RL. 🔥 Highlights: - Supports MuJoCo & MJX - Clean JAX impl of PPO, GAIL, AMP, DeepMimic - 22K+ trajectories per humanoid - Add your humanoid easily 🔗 github.com/robfiras/loco-… 🧵Details 👇
English
14
98
471
43.5K
Michael Drolet retweetledi
Intelligent Autonomous Systems Group
Intelligent Autonomous Systems Group@ias_tudarmstadt·
Something exciting just arrived at our lab! Here's a hint: precision mechanics, intricate wiring, and a touch of power. What could it be? A) High-end lab equipment B) Experimental space probe component C) Something that might walk past you in the hallway. Share your guesses!
Intelligent Autonomous Systems Group tweet media
English
2
2
17
2.3K