Sabitlenmiş Tweet
CleanRL
3 posts

CleanRL
@cleanrl_lib
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Philadelphia, PA Katılım Haziran 2022
0 Takip Edilen659 Takipçiler
CleanRL retweetledi

Happy to share that @cleanrl_lib now supports Random Network Distillation + envpool, it's 3× faster than our first version without envpool and still have comparable performance to the original implementation, say 👋 to the long training time on hard-exploration games!
Details👇

English
CleanRL retweetledi

Thanks to @_joaogui1's awesome contribution 🙏, @cleanrl_lib now has a TD3 + JAX implementation that is 2-4x faster than the TD3 + @PyTorch equivalent 🔥. Running on TPU is now possible, too 🚀!
📜 docs: #td3_continuous_action_jaxpy" target="_blank" rel="nofollow noopener">docs.cleanrl.dev/rl-algorithms/…
💾 code: github.com/vwxyzjn/cleanr…
A short 🧵1/x

English