Emil Wärnberg

10 posts

Emil Wärnberg

Emil Wärnberg

@emil_warnberg

Computer scientist venturing into the exciting world of neuroscience

เข้าร่วม Mart 2017
215 กำลังติดตาม106 ผู้ติดตาม
Emil Wärnberg
Emil Wärnberg@emil_warnberg·
@xpuente @KordingLab Nevertheless, a more complicated model of RL in the brain probably needs to train some kind of policy network as one component. And for that having access to an error gradient is very beneficial, especially if the action space is continuous.
English
0
0
0
43
Emil Wärnberg
Emil Wärnberg@emil_warnberg·
@xpuente @KordingLab In our Discussion, we point at some ideas of how that gradient might appear, but realistically I think that would require a much more complete model of the whole brain - perhaps also including hippocampus for e.g. off-policy learning with some smart sampling strategy.
English
1
0
0
40
Emil Wärnberg
Emil Wärnberg@emil_warnberg·
@wegmor @Inscopix In summary, all three striatal pathways independently broadcast a similar, complete representation of the different levels of the task (choice value, trial phase) to downstream targets. This representation looks more like RL state, than actor or critic [8/9]
Emil Wärnberg tweet media
English
1
1
5
0
Moritz Weglage
Moritz Weglage@wegmor·
@emil_warnberg @Inscopix Trial progression: ✓. But can we decode win/switch strategy? Focusing on the right to center port turn, we indeed found neurons selective for win-stay vs lose-switch trials. Moreover, these neurons did not simply encode strategy; their activity increased with choice value. [7/9]
Moritz Weglage tweet media
English
1
0
1
0
Moritz Weglage
Moritz Weglage@wegmor·
In our new bioRxiv preprint, we used @Inscopix miniscopes to calcium-image neurons of 3 striatal output pathways (matrix direct/D1, matrix indirect/A2A and striosome/Oprm1) in mice making decisions. Hypothesis: the matrix serves as RL-actor and the striosome as critic. [1/9]
Moritz Weglage tweet media
English
1
35
86
0
Emil Wärnberg
Emil Wärnberg@emil_warnberg·
@wegmor @Inscopix Our division of the task into 12 discrete phases was somewhat arbitrary. Could we have chosen a finer division? Turns out the answer is yes: we can accurately decode the animals’ progression through individual phases (example: right to center port turn). [6/9]
Emil Wärnberg tweet media
English
1
0
2
0
Moritz Weglage
Moritz Weglage@wegmor·
@emil_warnberg @Inscopix The neurons kept their tunings across days and weeks. In fact, tunings were stable enough to support accurate, trial-by-trial decoding of the trial phase across days. [5/9]
Moritz Weglage tweet media
English
1
0
2
0
Emil Wärnberg
Emil Wärnberg@emil_warnberg·
@wegmor @Inscopix Let’s unpack that a bit. Here’s a video of three Oprm1-neurons during the task: the red one fires in the center to right port turn; the purple one fires in the center port prior to left choices; the cyan one when getting a reward from the left port. [3/9]
English
1
3
10
0
Emil Wärnberg
Emil Wärnberg@emil_warnberg·
@wegmor @Inscopix Choice port outcomes were occasionally swapped covertly (75% vs 0% reward). In response, mice adapted a win-stay (blue), lose-switch strategy (red). In short, the striatal activity reflected both the mice’s progression through trials and their win/switch strategy. [2/9]
Emil Wärnberg tweet media
English
1
1
3
0
Emil Wärnberg รีทวีตแล้ว
Sandro Galea
Sandro Galea@sandrogalea·
Determining what matters and does not is more complicated than accepting binary “statistical” significance. I signed on to this @nature paper, with 800+ other scientists, which eloquently urges we stop talking about statistical significance: go.nature.com/2TkvLDN @societyforepi
English
2
26
80
0