Haibin@eric_haibin_lin
Recent updates on @verl_project (RL lib for LLMs):
Engine:
- Megatron qwen & GRPO support, v0.11 upgrade
- vllm v0.7 integration with v1 mode
- experimental sglang integration
Algorithm & recipes:
- vision language reasoning with qwen2.5-vl
- PRIME, RLOO, remax, math-verify rewards, etc
Docs:
- tutorial for distributed training setup and debugging
- programming model tutorial
Hardware:
- experimental AMD support
And many awesome community projects such as code-R1, Easy-R1, Search-R1, RAGEN, etc. Big thank you to the community!
Working on multi-turn & environment/tool supports. Stay tuned...