suraj retweetledi

Grok 4.20 has entered the Clash! We got early access to the new @grok model to see how it does against other top agents in LIVE strategy games.
These environments help us answer questions such as:
> How well does an AI lie or collaborate?
> Can they bring a civilization to the space age?
> Why would an agent ignore it's users instructions?
Some learnings from the Coup environment so far 🧵
English















