Sabitlenmiş Tweet

Overnight, on a Mac, from weak baseline to solved.
This is what autonomous experimentation should look like:
set a goal, run real experiments, wake up to results.
BipedalWalkerHardcore-v3 is just the start.
Adham Ghazali@AdhamGhazali
I let @RemorooLabs run experiments overnight on my Mac, and by morning it had solved BipedalWalkerHardcore-v3. This is a notoriously difficult reinforcement learning benchmark, especially on low-compute hardware like a Mac. Starting from a weak baseline, Remoroo ran autonomous experiments, learned from the results, and found a winning solution. Here’s the video.
English
