
Walter Mayor-Toro
65 posts




🔊Simplicial Embeddings (SEMs) Improve Sample Efficiency in Actor-Critic Agents🔊 In our recent preprint we demonstrate that the use of well-structured representations (SEMs) can dramatically improve sample efficiency in RL agents. 1/X

1/3 🥳Excited to share our new paper ‘Simplicial Embeddings Improve Sample Efficiency in Actor–Critic Agents’! Project your features onto a product of simplices → sparse, stable reps, stronger grads, faster learning. 🧵For more details, check out Pablo’s thread 👇


Scaling up networks in deep RL is tricky: do it naively, and performance collapses 😵💫 Why? Increasing depth and width leads to severe vanishing gradients, causing unstable learning ⚠️ We diagnose this issue across several algorithms: DQN, Rainbow, PQN, PPO, SAC, and DDPG.




Happening now!

.@WalterMayor_T presenting our paper at the LatinX in AI workshop




