
Opus 4.7 imminent?
BridgeMind@bridgemindai
Claude Opus 4.5 is now OUTPERFORMING Claude Opus 4.6 on BridgeBench Hallucination. Read that again. The legacy model is beating the current flagship. We benchmarked Opus 4.5 this morning to confirm what we saw yesterday. Claude Opus 4.6 fell from #2 to #10 with a 98% increase in hallucination. Now Claude Opus 4.5 is scoring higher. This isn't a bad benchmark run. This is a nerfed model. Anthropic silently reduced Claude Opus 4.6 and the data proves it. You're paying $200/month for a model that's getting worse. @bridgebench will keep tracking it.
Català




