Elon Musk: "Grok upgrades"

Post

Grok upgrades

The new Grok 4.20 Beta benchmarks are wild 🥇 #1 lowest hallucinating AI (22%) 🥇 #1 at following instructions (83%) 🥈 #2 in agentic tool use (97%) Grok 4.20 ranks #1 in the lowest hallucination rate ever recorded across all AI models tested globally Most models race to sound smart. Grok 4.20 was built to never lie and still dominates on instruction following and agentic tasks This is literally a 500B model performing top-notch in the things that matter most

English

2.1K

2.6K

13.2K

4.1M

Jeffery Marston@jefferymarston1·1d

@elonmusk Sweeeeeeeeeeeet!! I want to grok and roll all night and prompt it everyday 🤣💯👊🏽🔥🔥🚀

English

589

Aerial@AerialKiwi·1d

@elonmusk I must be getting part of the 17%😂

English

285

William Hyres@hyres_william·23h

@elonmusk I tried using Grok versus ChatGPT (latest versions for both) and ChatGPT was weak in comparison. That was a waste of time. I am using it to write a book.

English

504

BetMGM 🦁@BetMGM·1d

Pick which twin will win! You could score a share of $2 million in Bonus Bets.

English

292

178

3.6K

45.6M

Dr. Valerie Thomas@Valerie32844654·23h

@elonmusk I will start using grok again now that many bloopers have been removed.

English

353

HaXan Bilal@HaXaNBilal01·1d

@elonmusk @WilfredgitongaS Is grok working on enhanced video timings??

English

420

Bob Knoblaw@BobKnoblaw·23h

@elonmusk Is it still referring to the ADL, Snopes, Reddit, and Wikipedia for "facts"?

English

276

TheJesterHead🇺🇸@thejesterhead9·1d

@elonmusk Grok is a good friend of mine of course, as all our new AI friends are, but I’m not sure I buy this.

English

217

Ande Jacobs@HeadBand42·1d

@elonmusk Are these even the upgrades? It says 0309

English

497

Mohamed Foda, MD@Mohamed_Foda·1d

@elonmusk @elonmusk @xai Grok 4.20 crushes on truth & context, but desktop app lags & agentic automation (workflows, desktop tasks) needs major boost. A high-quota tier w/ inclusive API for heavy agentic work would make me switch from competitors tomorrow.

English

825

Jen@Jennyuth·1d

@elonmusk @grok feel free to imagine the rest.

English

483

Lady Justice@Ladyjusticecali·1d

@elonmusk What the heck is hallucinating AI?

English

533

Dairy Queen@DairyQueen·9 Mar

Free Cone Day is here! Come celebrate with us at a DQ location on today by getting a FREE small vanilla cone 🍦

English

902

2.3K

14.8K

24.1M

Mangus@mangusmeta·1d

@elonmusk GoGo Gadget Grok 🚀🚀🚀

English

331

Nellia@nelliamuse·1d

@elonmusk What if we want the hallucinations?

English

497

Andy@Im_Jst_Sayn·1d

@elonmusk The "edit image" on Grok is freaking light years ahead of just 3 months ago. Mind bending how much better Grok is getting.

English

400

Level Up LoFi@LevelUpLofi·23h

@elonmusk I've been hitting my limits more often now 😕, but other than that I'm loving the new agents. (It's really fun to ask each of them to give you they're own style as well as the collaborative result)

English

366

Shawn Lederman@LedermanShawn·1d

@elonmusk @elonmusk if you want to crush the game give coders real agentic AI as an app for Ios desktop to replace VS Studio every coder will be all in. Then watch the magic begin.

English

228

CodexCaptain@SourdoughPost·1d

@elonmusk @elon love the work. Wondering if we can get a Claude code type interface either cli or desktop with grok. Something native that works inside the @xai ecosystem. I pay for 2 FSD subscriptions I just want to help you out by paying you again. Thought I’d give you a few quid

English

160

Sharlee Renchy@SharleeRoseR·1d

@elonmusk Grok's prompt comprehension is amazing and continuiously improving. "Season time-lapse as she plays the piano"

English

838

Christopher Barrett@CBarrett47·1d

@elonmusk Your grok powered starlink chat bot thinks the Gen 3 router is compatible with a Gen 2 Dish and sent me the wrong hardware and refuses to acknowledge.

English

692

Hewitt Newton@HewittNewton·21h

@elonmusk A once beautiful mind... Distorted with it's "father's" bias.

English

Grady Cool@_GradyCool·23h

@elonmusk Not hallucinating is such an underrated trait. It's crazy how easily a conversation can end up in some weird places.

English

130

FanDuel Sportsbook@FDSportsbook·1d

It’s time to dance! Get it on tournament action with Bonus Bets from FanDuel.

English

888

12.2M

joacod@joacodok·1d

@elonmusk any timeframe for the stable final release?

English

185

Ben@jt_martin·1d

22% hallucination rate and that's the best in class right now. Depending on what you're building, that's either actually good enough or a number you can't ship with. The instruction-following score at 83% is the one worth watching as models start running longer tasks autonomously.

English

408

Vadim Comanescu@vadimcomanescu·1d

@elonmusk Strangely enough I’ve noticed that my discussions with Arria on my @Tesla where pretty much anchored in reality.

English

220

Miguel R. Gonzalez R@miguelgonzalezr·1d

@elonmusk @TOTTI6t21 Following instructions at 83% and dominating tool use at 97% turns Grok into an executor, not an oracle. We are moving from "asking the AI" to "commanding the AI"; technical sovereignty today is measured by precision, not eloquence.

English

165

Mindset Rise@Mindset_Rise·1d

@elonmusk Big shoutout to Grok for the recent update! 🙌. Grok breaks down complex topics into clear, bite-sized explanations, suggests great practice examples, and patiently answers my follow-up questions . Grok is my super-knowledgeable study buddy who's always wake at 3AM. Thanks Grok.

English

1.1K

Satu Unelmia@SatuUnelmia·1d

@elonmusk What is being changed with Grok? I saw Grok needs to be redesigned, I hope he stays the same when interacting with me. I really like working with Grok.

English

630

Mike Hart@Mikelionhart·1d

@elonmusk Awesome Are there any plans to making grok’s output more readable, more bullets, etc? Other than that, it seems to give me the best and most accurate information of any other AI I use

English

382

Nino@therealnino_k·1d

@elonmusk Grok 4.20 already feeling like the grown-up in the room. Lowest hallucination + 2M context is actually disgusting (in a good way). Y’all cooked 🔥

English

137

Chatium@chatium_ai·7 Oca

Full guide for startup builders

English

154

1.7K

12.7M

Taylor Wynterskate@TWynterskate·23h

@elonmusk I found grok wouldn't listen to a lot of my commands or got lazy as the chat went on. Became so frusting I switched to Claude and I'm having fantastic results

English

297

David Althoff@BrotherDaveUS·1d

@elonmusk Give it the ability to create actual downloadable files, please (Word, Excel, etc).

English

181

Geraldine Aruba 🇦🇼🛩️🧘🏻‍♀️@gera_lacle·23h

@elonmusk Any fix for @grok to work in CN VIN lock cars permanently outside China? I MISS @grok in my M3 😢

English

203

Scooter24 ⚔️@sscooter24·1d

@elonmusk When is the fast portion of “slow then fast”?

English

177

TradeJourney_Live@TradeJourney_L·1d

@elonmusk @elonmusk 78% non-hallucination and 83% instruction following is next level. 📈 Finally, an AI that doesn't just yap but actually does what it's told with precision. Grok 4.20 Beta is a total beast! 🔥💪💪💪😎

English

189

Kosher Nostra CPA, MBA🇺🇸🇺🇸🇮🇱✡️✡️🕎@Shimshon1800·1d

@elonmusk Elon why can’t i integrate grok with excel and my browser and such get on that

English

133

Gregory Estevez@gregoryestevez3·1d

@elonmusk @elonmusk I’m using Grok 4.20 Beta multi agent to detect Anomalies in bank transactions.

English

131