Junwei Lu

3 posts

Junwei Lu banner
Junwei Lu

Junwei Lu

@lu_junwei

Assistant Professor of of @HarvardBiostats

Katılım Nisan 2020
16 Takip Edilen70 Takipçiler
Junwei Lu retweetledi
Zhaoran Wang
Zhaoran Wang@zhaoran_wang·
We know optimism is provably efficient for online RL. What about offline RL? It turns out simply flipping the sign of the bonus is minimax optimal! Given a dataset, pessimism is the best effort we can make. arxiv.org/abs/2012.15085 Just leave pessimism to 2020. Happy new year~!
Zhaoran Wang tweet mediaZhaoran Wang tweet media
English
3
29
150
0
Junwei Lu retweetledi
Tuo Zhao
Tuo Zhao@tourzhao·
Struggling with fine-tuning BERT models? Overfit your tasks again? Check our recent work on ACL 2020 "SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization" arxiv.org/abs/1911.03437
Tuo Zhao tweet mediaTuo Zhao tweet media
English
1
13
41
0