Roy Fox (@[email protected]) retweetledi

Excited to share our work, "Skill Set Optimization", a continual learning method for LLM actors that:
- Automatically extracts modular subgoals to use as skills
- Reinforces skills using environment reward
- Facilitates skill retrieval based on state
allenai.github.io/sso
🧵

English











