Dante Merlino MD, PhD
329 posts

Dante Merlino MD, PhD
@DanteMerlino
Otolaryngology - Head and Neck Surgery PGY5 resident at Mayo Clinic, graduate of TJU. Future H&N fellow at UPenn



🚨 NEW: We made Claude, Gemini, o3 battle each other for world domination. We taught them Diplomacy—the strategy game where winning requires alliances, negotiation, and betrayal. Here's what happened: DeepSeek turned warmongering tyrant. Claude couldn't lie—everyone exploited it ruthlessly. Gemini 2.5 Pro nearly conquered Europe with brilliant tactics. Then o3 orchestrated a secret coalition, backstabbed every ally, and won. Why did we do this? The most popular AI benchmarks don't test deception. But as these models get deployed everywhere—from your email to your workplace—we need to know: Will they lie to get what they want? So @every we built the ultimate test: AI Diplomacy, a dynamic benchmark that measures AI's ability to form alliances, negotiate, and betray each other. Watch them live below! Created from the ground up by @alxai_ and @Tyler_Marques.



Elon Musk: "We will make mistakes. We won't be perfect ... so for example, with USAID, one of the things we accidentally canceled very briefly was ebola prevention."





It's Monday. @DOGE is the laziest, most overpaid bunch of incompetent, unelected bureaucrats we've ever seen.














