

Thang Doan
18 posts

@doan_tl
Pr. Applied Scientist @Oracle | Postdoc @Mila_Quebec, PhD @mcgillu, MSc @ucl




CSP: our new algorithm for scalable continual reinforcement learning. CSP allows users to control the trade-off between performance and model size by adaptively learning a subspace of policies. SOTA on Continual World and Brax. Interactive Website: bit.ly/3tRb9ri

CSP: our new algorithm for scalable continual reinforcement learning. CSP allows users to control the trade-off between performance and model size by adaptively learning a subspace of policies. SOTA on Continual World and Brax. Interactive Website: bit.ly/3tRb9ri





Quick reminder that the deadline for the Self-Supervision for RL (sslrlworkshop.github.io) workshop at ICLR'21 is tomorrow (Feb 26th) AoE! Looking forward to everyone's submissions, and feel free to email us with any questions!






In the last @ContinualAI Reading Group @Mehdi_A_Bennani presented his recent work on "Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent"! youtube.com/watch?v=UOwgkB…
