CleanRL

@cleanrl_lib

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Philadelphia, PA Katılım Haziran 2022

0 Takip Edilen659 Takipçiler

Sabitlenmiş Tweet

CleanRL@cleanrl_lib·25 Tem

Yay!

Costa Huang@vwxyzjn

Happy to share my @nvidia internship's work: @cleanrl_lib's PPO now supports Isaac Gym! 📜 docs: #ppo_continuous_action_isaacgympy" target="_blank" rel="nofollow noopener">docs.cleanrl.dev/rl-algorithms/… A short 🧵

QST

CleanRL retweetledi

Chang Ye@yooceii·26 Ağu

Happy to share that @cleanrl_lib now supports Random Network Distillation + envpool, it's 3× faster than our first version without envpool and still have comparable performance to the original implementation, say 👋 to the long training time on hard-exploration games! Details👇

English

CleanRL retweetledi

Costa Huang@vwxyzjn·1 Ağu

Thanks to @_joaogui1's awesome contribution 🙏, @cleanrl_lib now has a TD3 + JAX implementation that is 2-4x faster than the TD3 + @PyTorch equivalent 🔥. Running on TPU is now possible, too 🚀! 📜 docs: #td3_continuous_action_jaxpy" target="_blank" rel="nofollow noopener">docs.cleanrl.dev/rl-algorithms/… 💾 code: github.com/vwxyzjn/cleanr… A short 🧵1/x

English

102

Keşfet

@PyTorch @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine