Hamza Benchekroun
68 posts

Hamza Benchekroun
@hparams
Core Team Researcher @hcompany_ai. Into anything that starts with Reinforcement and ends with Learning.

Today we're releasing prime-rl v0.6.0 — enabling RL at trillion-parameter MoE scale on agentic workloads at the highest efficiency. We've relentlessly optimized our RL infra. The result: GLM-5 on agentic SWE tasks at 131k context and sub-5-minute step time.














We’ve been cooking this summer: Holo1.5 is here! SOTA UI localization + QA, 3× gains vs Qwen-2.5 VL 🍳 Now up to 72B 💥 — a strong base for computer-use agents like Surfer. • Open weights on HuggingFace 🤗 huggingface.co/Hcompany/Holo1… • Blog post 📝 hcompany.ai/blog/holo-1-5 (1/n 🧵)







🚀 We’re excited to introduce Qwen3-235B-A22B-Thinking-2507 — our most advanced reasoning model yet! Over the past 3 months, we’ve significantly scaled and enhanced the thinking capability of Qwen3, achieving: ✅ Improved performance in logical reasoning, math, science & coding ✅ Better general skills: instruction following, tool use, alignment ✅ 256K native context for deep, long-form understanding 🧠 Built exclusively for thinking mode, with no need to enable it manually. The model now natively supports extended reasoning chains for maximum depth and accuracy. Hugging Face: huggingface.co/Qwen/Qwen3-235… or huggingface.co/Qwen/Qwen3-235… ModelScope: modelscope.cn/models/Qwen/Qw… or modelscope.cn/models/Qwen/Qw… API Doc: #16ff9753e1ctz" target="_blank" rel="nofollow noopener">alibabacloud.com/help/en/model-…







