
Paper: huggingface.co/papers/2605.00…
Project page: finch.agibot.com/research/lwd
LWD uses DIVL and QAM to learn from successes, failures, and human interventions across the fleet, continuously improving a single generalist policy without imitating only demonstrations.
English









