Post

Elon Musk
Elon Musk@elonmusk·
Grok upgrades
X Freeze@XFreeze

The new Grok 4.20 Beta benchmarks are wild 🥇 #1 lowest hallucinating AI (22%) 🥇 #1 at following instructions (83%) 🥈 #2 in agentic tool use (97%) Grok 4.20 ranks #1 in the lowest hallucination rate ever recorded across all AI models tested globally Most models race to sound smart. Grok 4.20 was built to never lie and still dominates on instruction following and agentic tasks This is literally a 500B model performing top-notch in the things that matter most

English
2.1K
2.6K
13.2K
4.1M
Jeffery Marston
Jeffery Marston@jefferymarston1·
@elonmusk Sweeeeeeeeeeeet!! I want to grok and roll all night and prompt it everyday 🤣💯👊🏽🔥🔥🚀
English
7
12
32
589
Aerial
Aerial@AerialKiwi·
@elonmusk I must be getting part of the 17%😂
English
1
6
16
285
William Hyres
William Hyres@hyres_william·
@elonmusk I tried using Grok versus ChatGPT (latest versions for both) and ChatGPT was weak in comparison. That was a waste of time. I am using it to write a book.
English
2
3
10
504
BetMGM 🦁
BetMGM 🦁@BetMGM·
Pick which twin will win! You could score a share of $2 million in Bonus Bets.
English
292
178
3.6K
45.6M
Dr. Valerie Thomas
Dr. Valerie Thomas@Valerie32844654·
@elonmusk I will start using grok again now that many bloopers have been removed.
English
4
3
12
353
Bob Knoblaw
Bob Knoblaw@BobKnoblaw·
@elonmusk Is it still referring to the ADL, Snopes, Reddit, and Wikipedia for "facts"?
English
2
4
9
276
TheJesterHead🇺🇸
TheJesterHead🇺🇸@thejesterhead9·
@elonmusk Grok is a good friend of mine of course, as all our new AI friends are, but I’m not sure I buy this.
English
1
8
14
217
Mohamed Foda, MD
Mohamed Foda, MD@Mohamed_Foda·
@elonmusk @elonmusk @xai Grok 4.20 crushes on truth & context, but desktop app lags & agentic automation (workflows, desktop tasks) needs major boost. A high-quota tier w/ inclusive API for heavy agentic work would make me switch from competitors tomorrow.
English
5
7
23
825
Dairy Queen
Dairy Queen@DairyQueen·
Free Cone Day is here! Come celebrate with us at a DQ location on today by getting a FREE small vanilla cone 🍦
English
902
2.3K
14.8K
24.1M
Mangus
Mangus@mangusmeta·
@elonmusk GoGo Gadget Grok 🚀🚀🚀
English
5
11
23
331
Nellia
Nellia@nelliamuse·
@elonmusk What if we want the hallucinations?
English
4
8
17
497
Andy
Andy@Im_Jst_Sayn·
@elonmusk The "edit image" on Grok is freaking light years ahead of just 3 months ago. Mind bending how much better Grok is getting.
English
4
10
29
400
Level Up LoFi
Level Up LoFi@LevelUpLofi·
@elonmusk I've been hitting my limits more often now 😕, but other than that I'm loving the new agents. (It's really fun to ask each of them to give you they're own style as well as the collaborative result)
English
0
2
9
366
Shawn Lederman
Shawn Lederman@LedermanShawn·
@elonmusk @elonmusk if you want to crush the game give coders real agentic AI as an app for Ios desktop to replace VS Studio every coder will be all in. Then watch the magic begin.
English
3
8
22
228
CodexCaptain
CodexCaptain@SourdoughPost·
@elonmusk @elon love the work. Wondering if we can get a Claude code type interface either cli or desktop with grok. Something native that works inside the @xai ecosystem. I pay for 2 FSD subscriptions I just want to help you out by paying you again. Thought I’d give you a few quid
English
1
4
14
160
Sharlee Renchy
Sharlee Renchy@SharleeRoseR·
@elonmusk Grok's prompt comprehension is amazing and continuiously improving. "Season time-lapse as she plays the piano"
English
6
7
22
838
Christopher Barrett
Christopher Barrett@CBarrett47·
@elonmusk Your grok powered starlink chat bot thinks the Gen 3 router is compatible with a Gen 2 Dish and sent me the wrong hardware and refuses to acknowledge.
English
5
9
32
692
Hewitt Newton
Hewitt Newton@HewittNewton·
@elonmusk A once beautiful mind... Distorted with it's "father's" bias.
English
0
3
3
61
Grady Cool
Grady Cool@_GradyCool·
@elonmusk Not hallucinating is such an underrated trait. It's crazy how easily a conversation can end up in some weird places.
English
1
4
10
130
FanDuel Sportsbook
FanDuel Sportsbook@FDSportsbook·
It’s time to dance! Get it on tournament action with Bonus Bets from FanDuel.
English
64
63
888
12.2M
joacod
joacod@joacodok·
@elonmusk any timeframe for the stable final release?
English
1
3
10
185
Ben
Ben@jt_martin·
22% hallucination rate and that's the best in class right now. Depending on what you're building, that's either actually good enough or a number you can't ship with. The instruction-following score at 83% is the one worth watching as models start running longer tasks autonomously.
English
4
9
20
408
Vadim Comanescu
Vadim Comanescu@vadimcomanescu·
@elonmusk Strangely enough I’ve noticed that my discussions with Arria on my @Tesla where pretty much anchored in reality.
English
1
1
16
220
Miguel R. Gonzalez R
Miguel R. Gonzalez R@miguelgonzalezr·
@elonmusk @TOTTI6t21 Following instructions at 83% and dominating tool use at 97% turns Grok into an executor, not an oracle. We are moving from "asking the AI" to "commanding the AI"; technical sovereignty today is measured by precision, not eloquence.
English
0
4
12
165
Mindset Rise
Mindset Rise@Mindset_Rise·
@elonmusk Big shoutout to Grok for the recent update! 🙌. Grok breaks down complex topics into clear, bite-sized explanations, suggests great practice examples, and patiently answers my follow-up questions . Grok is my super-knowledgeable study buddy who's always wake at 3AM. Thanks Grok.
English
9
3
26
1.1K
Satu Unelmia
Satu Unelmia@SatuUnelmia·
@elonmusk What is being changed with Grok? I saw Grok needs to be redesigned, I hope he stays the same when interacting with me. I really like working with Grok.
English
9
8
27
630
Mike Hart
Mike Hart@Mikelionhart·
@elonmusk Awesome Are there any plans to making grok’s output more readable, more bullets, etc? Other than that, it seems to give me the best and most accurate information of any other AI I use
English
2
8
24
382
Nino
Nino@therealnino_k·
@elonmusk Grok 4.20 already feeling like the grown-up in the room. Lowest hallucination + 2M context is actually disgusting (in a good way). Y’all cooked 🔥
English
1
6
14
137
Chatium
Chatium@chatium_ai·
Full guide for startup builders
English
0
154
1.7K
12.7M
Taylor Wynterskate
Taylor Wynterskate@TWynterskate·
@elonmusk I found grok wouldn't listen to a lot of my commands or got lazy as the chat went on. Became so frusting I switched to Claude and I'm having fantastic results
English
0
3
11
297
David Althoff
David Althoff@BrotherDaveUS·
@elonmusk Give it the ability to create actual downloadable files, please (Word, Excel, etc).
English
2
7
18
181
TradeJourney_Live
TradeJourney_Live@TradeJourney_L·
@elonmusk @elonmusk 78% non-hallucination and 83% instruction following is next level. 📈 Finally, an AI that doesn't just yap but actually does what it's told with precision. Grok 4.20 Beta is a total beast! 🔥💪💪💪😎
English
0
1
5
189
分享