Clemens Winter

33 posts

Clemens Winter

Clemens Winter

@ClemensWinter

Maker. Distinguished member of the Rust Evangelism Strikeforce. Aspiring mad scientist-engineer. I currently make GPUs go brrr at OpenAI. Views my own.

Katılım Temmuz 2018
8 Takip Edilen904 Takipçiler
Sabitlenmiş Tweet
Clemens Winter
Clemens Winter@ClemensWinter·
1/ Excited to share my new blog post on Deep Reinforcement Learning and the Entity Neural Network (ENN) project! It's all about making RL easier to apply in complex simulated environments. Check it out: clemenswinter.com/2023/04/14/ent…
English
2
30
139
30K
Clemens Winter
Clemens Winter@ClemensWinter·
8/ A big thank you to my incredible collaborators: @Bam4d , @vwxyzjn , Théo Matricon, and Anssi Kanervisto. Your expertise and hard work have been invaluable in bringing the ENN project to life. It's been a pleasure working with you all!
English
0
0
3
951
Clemens Winter
Clemens Winter@ClemensWinter·
7/ I'm looking forward to seeing the amazing projects and applications you'll create using the ENN project. Let's unlock the full potential of Deep Reinforcement Learning together!
English
1
0
3
1K
Clemens Winter
Clemens Winter@ClemensWinter·
1/ Excited to share my new blog post on Deep Reinforcement Learning and the Entity Neural Network (ENN) project! It's all about making RL easier to apply in complex simulated environments. Check it out: clemenswinter.com/2023/04/14/ent…
English
2
30
139
30K
Clemens Winter retweetledi
Benedikt Winter
Benedikt Winter@benedikt_winter·
Excited about our new model SPT-NRTL to predict thermodynamically consistent molecular properties using physics-guided machine learning. Do you ever struggle to get high-accuracy predictions for activity coefficients? Check out our new pre-print: arxiv.org/abs/2209.04135 1/8
English
5
5
21
0
Clemens Winter retweetledi
andy jones
andy jones@andy_l_jones·
🚨 I've a paper out today: Scaling Scaling Laws with Board Games! 🚨 arxiv.org/abs/2104.03113 Principle result is that by studying a sequence of small problems in ML, I could predict the outcome of experiments on orders-of-magnitude larger problems 🤯
andy jones tweet media
English
4
71
396
0
Clemens Winter
Clemens Winter@ClemensWinter·
@vwxyzjn Actually, correction: I think the latest version might not compute the policy network for inactive drones. Inactive drones will still be observed by other drones and the value function.
English
0
0
1
0
Clemens Winter
Clemens Winter@ClemensWinter·
@vwxyzjn When a drone is building I mask out every action other than the "no-action" action. I still compute the latent state for the drone because it's useful for the value function.
English
1
0
0
0
Clemens Winter
Clemens Winter@ClemensWinter·
@vwxyzjn larger maps you probably want to focus a lot more heavily on building out production capacity in the early game.
English
1
0
0
0
Clemens Winter
Clemens Winter@ClemensWinter·
@vwxyzjn Agents can handle arbitrarily large maps in principle. I have not studied this rigorously, but I would expect them to do quite well on maps several times larger they weren't trained on. This breaks down somewhat once maps require fundamentally different strategies. E.g. on 10x...
English
1
0
1
0
Clemens Winter
Clemens Winter@ClemensWinter·
@vwxyzjn It does scout, but without the tile feature scouting doesn't work that well on larger maps. Agents explore much less efficiently and usually just end up moving around in the same areas rather than exploring every part of the map.
English
0
0
1
0
Costa Huang
Costa Huang@vwxyzjn·
@ClemensWinter Another interesting observation feature I found was the "tile" feature that helps with scouting. Does the agent scout at all without this feature?
English
2
0
0
0
Clemens Winter
Clemens Winter@ClemensWinter·
@vwxyzjn Nice, thanks for the recommendation! My REST API turned out to be a major pain point and significantly restricts performance now, so jpype would likely have been a much better choice.
English
1
0
0
0