RunRL

23 posts

RunRL banner
RunRL

RunRL

@runrl_com

Making reward go up

เข้าร่วม Ocak 2025
13 กำลังติดตาม179 ผู้ติดตาม
RunRL
RunRL@runrl_com·
RunRL tweet media
ZXX
1
0
1
92
RunRL
RunRL@runrl_com·
RunRL tweet media
ZXX
0
0
1
71
RunRL
RunRL@runrl_com·
RunRL tweet media
ZXX
0
1
6
236
RunRL รีทวีตแล้ว
Goliath
Goliath@zero_goliath·
new @runrl_com blog post: i pretrained a tiny transformer model on perfect tic tac toe moves and measured how much it affects RL compute requirements
Goliath tweet media
English
2
1
3
769
RunRL รีทวีตแล้ว
Dyusha Gritsevskiy
Dyusha Gritsevskiy@dyushag·
A cool technique from the RunRL research team: using PD controllers to balance multiobjective loss functions! Suppose you want to train a model to give short yet relevant answers. For a given minimum level of relevance, this lets us improve on the other reward terms much more!
Dyusha Gritsevskiy tweet media
English
1
3
11
1.1K
RunRL
RunRL@runrl_com·
It's time to Run RL!
RunRL tweet media
English
0
0
2
275
RunRL รีทวีตแล้ว
Dyusha Gritsevskiy
Dyusha Gritsevskiy@dyushag·
if you've ever wanted to run rl there's never been a better time to do it than now
Dyusha Gritsevskiy tweet media
English
2
1
9
455
will brown
will brown@willccbb·
if you’re doing research in LLM reinforcement learning what are your pain points? what are the things that you feel like should really be easier, but they aren’t?
English
54
18
420
76.5K
maddie rune🪰
maddie rune🪰@maddierune·
What is the most pleasant, non-sexual, non-drug, experience a human can have?
English
1.9K
49
1.6K
3.5M
RunRL รีทวีตแล้ว
Dyusha Gritsevskiy
Dyusha Gritsevskiy@dyushag·
opus 4 generating the RunRL headquarters in tikz
Dyusha Gritsevskiy tweet media
English
0
1
6
678
RunRL
RunRL@runrl_com·
and here it is...the funniest joke of all time!!!
RunRL tweet media
English
4
0
25
1K
RunRL
RunRL@runrl_com·
Have you ever wondered what the funniest joke in the world is? Well, with the power of RL, we can find out!
RunRL tweet media
English
7
5
61
9K