
The new Grok 4.20 Beta benchmarks are wild
🥇 #1 lowest hallucinating AI (22%)
🥇 #1 at following instructions (83%)
🥈 #2 in agentic tool use (97%)
Grok 4.20 ranks #1 in the lowest hallucination rate ever recorded across all AI models tested globally
Most models race to sound smart. Grok 4.20 was built to never lie and still dominates on instruction following and agentic tasks
This is literally a 500B model performing top-notch in the things that matter most

English
