Deep RL

41.5K posts

Deep RL

Deep RL

@deep_rl

Papers about distributed deep and reinforcement learning.

Katılım Şubat 2016
1 Takip Edilen1.4K Takipçiler
Deep RL
Deep RL@deep_rl·
Sequence-to-Sequence Forecasting-aided State Estimation for Power Systems - Kamal Basulaiman ift.tt/Hc179Qz
English
0
0
1
845
Deep RL
Deep RL@deep_rl·
Learning to detect an animal sound from five examples - Inês Nolasco ift.tt/bdojBsW
English
0
0
0
759
Deep RL
Deep RL@deep_rl·
Know your Enemy: Investigating Monte-Carlo Tree Search with Opponent Models in Pommerman - Jannis Weil ift.tt/uxPUR4N
English
0
0
0
705
Deep RL
Deep RL@deep_rl·
U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech - Xin Jing ift.tt/OqMnX0J
English
0
0
0
571
Deep RL
Deep RL@deep_rl·
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice - Toshinori Kitamura ift.tt/pMNhyQo
English
0
0
0
296
Deep RL
Deep RL@deep_rl·
Editing Large Language Models: Problems, Methods, and Opportunities - Yunzhi Yao ift.tt/EMXdoLb
Filipino
0
0
0
195
Deep RL
Deep RL@deep_rl·
Deep Neural Collapse Is Provably Optimal for the Deep Unconstrained Features Model - Peter Súkeník ift.tt/aTSCfwB
0
0
0
181
Deep RL
Deep RL@deep_rl·
INVICTUS: Optimizing Boolean Logic Circuit Synthesis via Synergistic Learning and Search - Animesh Basak Chowdhury ift.tt/vXDqfyJ
English
0
0
0
174
Deep RL
Deep RL@deep_rl·
Scaling Serverless Functions in Edge Networks: A Reinforcement Learning Approach - Mounir Bensalem ift.tt/9y2WzmR
English
0
0
0
136
Deep RL
Deep RL@deep_rl·
Hang-Time HAR: A Benchmark Dataset for Basketball Activity Recognition using Wrist-worn Inertial Sensors - Alexander Hoelzemann ift.tt/Qjqgfnr
English
0
0
0
132
Deep RL
Deep RL@deep_rl·
Policy Representation via Diffusion Probability Model for Reinforcement Learning - Long Yang ift.tt/gor5WdU
English
0
2
2
207
Deep RL
Deep RL@deep_rl·
Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test - Eungbeom Kim ift.tt/RZSgxFI
English
0
0
0
125
Deep RL
Deep RL@deep_rl·
Restore Anything Pipeline: Segment Anything Meets Image Restoration - Jiaxi Jiang ift.tt/D3j8R4r
English
0
0
0
127
Deep RL
Deep RL@deep_rl·
Breaking the Paradox of Explainable Deep Learning - Arlind Kadra ift.tt/icHyhBU
English
0
0
0
99
Deep RL
Deep RL@deep_rl·
Hierarchical Partitioning Forecaster - Christopher Mattern ift.tt/Kurd4zf
English
0
0
0
95
Deep RL
Deep RL@deep_rl·
Road Planning for Slums via Deep Reinforcement Learning - Yu Zheng ift.tt/HflL0ob
English
0
0
0
110
Deep RL
Deep RL@deep_rl·
Federated Learning of Medical Concepts Embedding using BEHRT - Ofir Ben Shoham ift.tt/T4ZorWi
English
0
0
0
98
Deep RL
Deep RL@deep_rl·
POEM: Polarization of Embeddings for Domain-Invariant Representations - Sang-Yeong Jo ift.tt/qrD9C7g
English
0
0
0
88
Deep RL
Deep RL@deep_rl·
Distributed Learning over Networks with Graph-Attention-Based Personalization - Zhuojun Tian ift.tt/cQxldRL
English
0
0
0
91
Deep RL
Deep RL@deep_rl·
Towards generalizing deep-audio fake detection networks - Konstantin Gasenzer ift.tt/RVvDdUS
English
0
1
0
91