Quan Wei

6 posts

Quan Wei

Quan Wei

@quanwei0

PhD student @UMNews

Katılım Haziran 2023
113 Takip Edilen24 Takipçiler
Quan Wei retweetledi
Siliang Zeng
Siliang Zeng@ZengSiliang·
Interesting project with @willccbb @quanwei0 @Mingyi552237 and other collaborators. We show that turn-level credit assignment indeed helps a lot for multi-turn tasks. Multi-turn task is the setting where we may think bringing MDP and other classic RL thoughts back to the table.
will brown@willccbb

new paper with @ZengSiliang and other collaborators about some of our multi-turn findings from the past couple months :) lots of experiments about the pros and cons of structured rewards and credit assignment, particularly relevant when doing tool-use RL with 7B models

English
0
6
22
4.2K
Quan Wei retweetledi
will brown
will brown@willccbb·
new paper with @ZengSiliang and other collaborators about some of our multi-turn findings from the past couple months :) lots of experiments about the pros and cons of structured rewards and credit assignment, particularly relevant when doing tool-use RL with 7B models
will brown tweet media
English
15
31
385
30.3K
Quan Wei
Quan Wei@quanwei0·
I am honored to receive the Best Student Paper Award at #ICASSP2023 with Prof. Ziping Zhao for our paper "Large covariance matrix estimation with oracle statistical rate"!
Quan Wei tweet media
English
0
0
5
693