Thang Doan

18 posts

Thang Doan

@doan_tl

Pr. Applied Scientist @Oracle | Postdoc @Mila_Quebec, PhD @mcgillu, MSc @ucl

LAX Katılım Aralık 2020

153 Takip Edilen135 Takipçiler

Sabitlenmiş Tweet

Thang Doan@doan_tl·25 Oca

Delighted to share our work “A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix “ accepted at AISTATS 2021 w/ @Mehdi_A_Bennani, @bogdan_mazoure, @grwip, @PierreAlquier Paper: arxiv.org/pdf/2010.04003… Code: github.com/tldoan/PCA-OGD (1/n) 👇

English

Thang Doan retweetledi

Jean-Baptiste Gaya@jb_gaya·2 May

I will present our paper Building a Subspace of Policies for Scalable Continual Learning at #ICLR23 at 3:10pm in room AD10. If you're interested in the topic, come to meet and chat about it ! @doan_tl @LucasPCaccia @LudovicDenoyer @LaureSoulier @robertarail

GIF

English

2.2K

Thang Doan retweetledi

Roberta Raileanu@robertarail·21 Oca

CSP accepted as spotlight at #ICLR2023. Kudos to @jb_gaya who led this project and to all the other wonderful co-authors @doan_tl @LucasPCaccia @LaureSoulier @LudovicDenoyer

Roberta Raileanu@robertarail

CSP: our new algorithm for scalable continual reinforcement learning. CSP allows users to control the trade-off between performance and model size by adaptively learning a subspace of policies. SOTA on Continual World and Brax. Interactive Website: bit.ly/3tRb9ri

English

11.3K

Thang Doan retweetledi

Roberta Raileanu@robertarail·26 Kas

Building a Subspace of Policies for Continual Reinforcement Learning When: Fri Dec 9, 8:25am PST Where: Virtual Paper: arxiv.org/abs/2211.10445 Website: bit.ly/3tRb9ri Thread: twitter.com/robertarail/st… led @jb_gaya w @doan_tl @LucasPCaccia @LaureSoulier @LudovicDenoyer

Roberta Raileanu@robertarail

English

Thang Doan@doan_tl·19 Tem

[Worskhop on Theory of Continual learning (ICML'21)] Thrilled to announce our awesome speaker line up: sites.google.com/view/cl-theory… You can ask questions to our speakers here: docs.google.com/forms/d/e/1FAI… Happening this Friday 23rd !🔥🚀

English

Thang Doan@doan_tl·7 Haz

We [EXCEPTIONALLY] extend the deadline of our workshop on "Theory of Continual Learning" (ICML'21) to June 11th (AoE) more information at: sites.google.com/view/cl-theory…

English

Thang Doan@doan_tl·31 May

We have extended the deadline of our workshop on "Theory and Foundation of Continual Learning", ICML'21 to June 07th (AoE) website: sites.google.com/view/cl-theory…

English

Thang Doan@doan_tl·7 May

If you want to know more about Self-supervision methods in RL, join our workshop today ! Happening in 1 hour 🔥🎉

Ankesh Anand@ankesh_anand

Quick reminder that the deadline for the Self-Supervision for RL (sslrlworkshop.github.io) workshop at ICLR'21 is tomorrow (Feb 26th) AoE! Looking forward to everyone's submissions, and feel free to email us with any questions!

English

Thang Doan@doan_tl·26 Mar

Glad to share my talk at the @ContinualAI reading group hosted by @v_lomonaco ! :)

Vincenzo Lomonaco@v_lomonaco

🔥 The recording of the latest @ContinualAI seminar with @doan_tl is here! 🔥 youtube.com/watch?v=iUlOxl…

English

Thang Doan@doan_tl·26 Oca

@LucasPCaccia thanks Lucas ! 😀

English

Lucas Caccia@LucasPCaccia·26 Oca

@doan_tl cool work :)

English

Thang Doan@doan_tl·25 Oca

English

Thang Doan@doan_tl·25 Oca

We propose a variant of Orthogonal Gradient Descent (OGD) which leverages the structure of the data through Principal Component Analysis called PCA-OGD. Our method is twice as memory efficient as OGD. (4/n)

English

Thang Doan@doan_tl·25 Oca

We introduce the “NTK Overlap matrix” as a task similarity measure that governs the CF. Unlike Vanilla SGD, projection based methods mitigate CF by operating orthogonally to previous task subspaces. This reduces the overlap between tasks leading to less forgetting. (3/n) 👇

English

Thang Doan retweetledi

CIFAR@CIFAR_News·19 Oca

Canada welcomes 29 newly-appointed Canada CIFAR AI Chairs at @AmiiThinks @Mila_Quebec @VectorInst - congratulations to all! #AICan cifar.ca/cifarnews/2021…

English

Thang Doan retweetledi

Ankesh Anand@ankesh_anand·18 Oca

The RL formalism is powerful in its generality, but poses a hard problem: how can we design agents that learn efficiently & generalize well, given only sensory info and a reward signal? Self-supervision might be the answer, join us at the ICLR workshop: sslrlworkshop.github.io

English

313

Thang Doan retweetledi

Sébastien Couasnon @SebCouasnon·16 Oca

Population déjà vaccinée : 🇮🇱24% 🇬🇧5,5% 🇺🇸4% 🇩🇰2,5% 🇮🇹1,7% 🇪🇸1,6% 🇨🇦1,3% 🇩🇪1,2% 🇪🇺1,1% 🇵🇹1% 🇷🇺1% 🇫🇷0,6%

Français

109

Thang Doan retweetledi

Mehdi Bennani@MehdiBennaniML·12 Oca

Thanks for hosting the reading group and for the discussions @v_lomonaco :)

Vincenzo Lomonaco@v_lomonaco

In the last @ContinualAI Reading Group @Mehdi_A_Bennani presented his recent work on "Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent"! youtube.com/watch?v=UOwgkB…

English

Thang Doan retweetledi

Vincenzo Lomonaco@v_lomonaco·5 Oca

🔥 The @ContinualAI Reading Groups are back! 🔥 This Friday, 17.30 CET we will host Mehdi Abbana Bennani (Aqemia) who will present the paper: "Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent" eventbrite.it/e/generalisati…

English

Keşfet

@LucasPCaccia @LudovicDenoyer @LaureSoulier @robertarail @jb_gaya @ContinualAI @v_lomonaco @bogdan_mazoure