Ke Wang retweetledi

🤖Low-data post-training can teach a VLA policy a new robot skill. But it also makes it too attached to the training demos.
We call this lock-in🔒: the policy can execute the post-training task, yet fails to respond to seemingly obvious prompt changes.
DeLock preserves steerability using only the policy’s own pretrained knowledge. No extra supervision needed!🚀🚀🚀
#Robotics #AI #EmbodiedAI #VLA
English
