Gemini 3.5 Flash scores 55.1% on SWE-Bench Pro.
Claude Opus 4.7 scores 64.3%.
Not even close.
Google just made a Flash model that beats their own Pro in tool use and agentic tasks.
But on real world coding?
Still 9 points behind Opus 4.7.
GPT 5.5 beats it too at 58.6%.
If this is the model Google needed to make a comeback with, it's not there yet on coding.
Waiting on Gemini 3.5 Pro.
That's where the real test is.
Add a fresh challenge to your Minecraft world!
With THIRST ON, you’ll need to manage your hydration to avoid fainting.
Purify water, build wells, and keep your water bottles within easy reach on your new belt.