
🧊 Ablation
We ablate key design choices of ZPRL, including whether the bottleneck is used, the perturbation scale λ, the dimension of z, and the offline data size.
Here is the takeaway:
1. A compact, task-relevant bottleneck latent matters.
2. λ must be large enough to steer, but small enough to stay local.
3. More offline data gives RL a better latent manifold to exploit.

English


