

Sergei Vecherenko
635 posts

@serg_vecher
Building https://t.co/txo1elNnVc | Architecting AI Search 🤖 | Lead Frontend Engineer | Sharing insights on engineering leadership & AI workflows | CA 📍











Thank you so much for all the feedback on the Grok Build Beta. Some of you reported hitting limits quickly. Our team found areas to improve caching, so we've reset Grok Build usage limits for all accounts. Please keep sharing feedback - the team is here to help.

Today we’re releasing DeepSWE, a new standard for agentic coding benchmarks. On public leaderboards, top models often look relatively close in capability. DeepSWE shows where they actually diverge, reflecting the realistic experience of developers in their day-to-day work.
