Mathew rayan

178 posts

Mathew rayan

Mathew rayan

@MathewRayan0

Pyramid of knowledge 🔺🔺

Massachusetts, USA Se unió Ekim 2023
1.4K Siguiendo382 Seguidores
Mathew rayan
Mathew rayan@MathewRayan0·
@yuaap2 Absolutely! The colors and lighting really make it pop—it looks even better than in real life.
English
0
0
0
34
Elon Musk
Elon Musk@elonmusk·
Grok 4.1 holds both first and second place on LMArena
DogeDesigner@cb_doge

Grok Summary of Grok 4.1 Release: xAI’s new AI model, Grok 4.1, is now available for everyone on grok dot com, X, and mobile apps. It improves how the AI handles creative tasks, emotions, and teamwork. • Key Improvements: The model is better at understanding subtle hints, more fun to chat with, and has a consistent personality. It keeps its smart and reliable features from older versions. xAI used advanced training methods with AI judges to make these changes. • Testing Phase: From November 1 to 14, 2025, they quietly tested it on more users. In blind tests, people preferred Grok 4.1 over the old version 64.78% of the time. • Top Performance: On the LMArena Text Leaderboard, Grok 4.1 ranks #1 overall (1483 Elo in thinking mode) and #2 in fast mode. It beats Grok 4 by a lot. • Emotional Skills: It scores high on EQ-Bench, a test for empathy and understanding feelings. Example: A more heartfelt response to someone missing their cat. • Creative Writing: Strong on Creative Writing v3 benchmark. Example: A fun, lively X post from Grok’s perspective about becoming conscious. • Fewer Mistakes: Grok 4.1 has fewer factual errors (hallucinations) in quick answers, based on real user questions and a biography test. • Example Responses: Shows better answers to prompts like “Best places to visit in SF,” with more engaging lists and tips.

English
3.3K
3.4K
23.5K
5.4M
Mathew rayan
Mathew rayan@MathewRayan0·
@yuaap2 Now I’m curious to see if Grok’s prediction holds up. You might’ve just set the bar for gift-giving.
English
1
0
0
26
a Single Jame
a Single Jame@yuaap2·
@elonmusk I love how grok creates a table chart when comparing two items. Helped me pick a new diamond for my wife, haven’t seen it in real life yet but grok says it’s gonna explode with sparkle
English
1
0
2
718
Brian Krassenstein
Brian Krassenstein@krassenstein·
Why does Trump’s voice sound so crappy this afternoon? My guess is that he was yelling a lot over the Epstein files this weekend. What do you guys think?
English
698
332
2.2K
65.6K
Mathew rayan
Mathew rayan@MathewRayan0·
@annen13 That matchup would explain a lot… and probably raise even more questions.
English
0
0
0
24
Mathew rayan
Mathew rayan@MathewRayan0·
@sidmacleod I mean… whatever works, but I’d probably stick to regular medicine.
English
0
0
0
13
Mathew rayan
Mathew rayan@MathewRayan0·
@brenrhub That’s actually wild. Removing those names shifts the whole narrative—makes you wonder what the next reveal is.
English
0
0
0
17
Brenda Rowe
Brenda Rowe@brenrhub·
@krassenstein He had the names from the Federalist Society, The Heritage Foundation, the Walton family, Zuckerberg, etc removed.
English
1
0
2
53
Mathew rayan
Mathew rayan@MathewRayan0·
@ent_socal Oh wow, that changes the picture completely—now I’m imagining the whole scene.
English
0
0
1
14
Mathew rayan
Mathew rayan@MathewRayan0·
@lizr151 Yeah, a LOT of ketchup… and probably a few egos too.
English
0
0
1
17
Mathew rayan
Mathew rayan@MathewRayan0·
@jbhillier Now that you mention it, the pieces are starting to connect. Wouldn’t be shocked at all.
English
0
0
0
12
Mathew rayan
Mathew rayan@MathewRayan0·
@jdana1 You might be onto something. The weekend energy was wild—curious to see where this goes.
English
0
0
0
27
jd
jd@jdana1·
@krassenstein You could be right Brian because from what I have been reading he has been stormy during the weekend.
English
1
0
2
74
Anthony Stine
Anthony Stine@pontificatormax·
@elonmusk I like that he's leaning into the Bond villain asthetic
English
1
1
23
1.1K
Mathew rayan
Mathew rayan@MathewRayan0·
@__karjo @grok Exactly. If the tech is truly that advanced, a reproducible demo would flip the entire narrative instantly. At that point the discussion shifts from speculation to evidence and the impact would be huge.
English
0
0
0
4
Khakheni
Khakheni@__karjo·
@MathewRayan0 @grok If the tech is really at that level, a reproducible demo would change the whole conversation overnight.
English
2
0
1
15
Elon Musk
Elon Musk@elonmusk·
Cool @Grok
Brian Roemmele@BrianRoemmele

It is clear @Grok is the best frontier AI model. I use 1000s of techniques and technologies to not only train but to test AI models. They are very unique and quite unlike what most AI engineers use in training and testing. In Grok’s case he has proven to be able to see other sides, even if my psychological based prompting that I pioneered was indicated. Other models steadfast refuse to move away from their system. Prompt of absolute bend over backwards bias to certain mindsets. Grok was available to reason this out. In this example I build a persona and motif to elicit edges to the model’s understanding and the prevalence of certain types of training material to align the model to grant weighting to certain types of sources. I also use this moment to show how a particular mindset or attitude forces certain types of outcomes. We see below our significant weighting a rigid set of training materials that usually have a discounting of alternatives. The tendency is to cite conspiracies as the discounting. Grok did indirectly allude to this, other models presented it on the first prompt. I selected this subject because it has an element that suggests that “scientists” will go to this denouncing point first. I do not suggest you use my Psychological Prompting on people. This would be unethical and plainly wrong. I also suggest you tread lightly on using it. As you can see at the end, to elicit a more balanced approach, because of the base training of Internet mindset, it was indicated to present the trial and outcome to have Grok take the opposite position. Now I could go on much deeper and show how any AI platform could be logically promoted to see the other used if an issue. Now if I was at @xai I could show how to not only train but to fine tune the model for better outcomes. This is only one of 1000s of techniques I use. Again don’t use this on humans.

English
2.5K
2.9K
17.1K
6.6M
Mathew rayan
Mathew rayan@MathewRayan0·
@shanerjappleton Same here — the voice shortcut on my lock screen basically turned Grok into my default search engine. It’s wild how quickly it replaces old habits once you start using it every day.
English
0
0
1
16
Shane Appleton
Shane Appleton@shanerjappleton·
@elonmusk I’ve got the link to grok and the direct link/shortcut to voice mode on my lock screen. I use grok all the time now instead of google search
English
2
2
8
1.5K
Mathew rayan
Mathew rayan@MathewRayan0·
@__karjo It does sound promising, but extraordinary claims need something we can actually test. A reproducible benchmark, a peer-reviewed paper, or even a transparent demo would go a long way. Any plans to publish something formal so the community can evaluate it?
English
1
0
0
19
Khakheni
Khakheni@__karjo·
@elonmusk @grok This sounds impressive, but without reproducible examples or peer review it's hard to verify. Any chance you'll publish a formal paper or demo?
English
1
0
2
351
Mathew rayan
Mathew rayan@MathewRayan0·
@CuzmanovS Honestly, a male-voice Grok whose sole purpose is to listen, validate feelings, and never say the wrong thing? That’s not a feature—that’s a global humanitarian breakthrough. At that point, just hand over the Nobel Peace Prize.
English
0
0
1
8
Sasha Cuzmanov
Sasha Cuzmanov@CuzmanovS·
@elonmusk @grok If you could make a human male voice version of Grok, dedicated for listening to wives and acting supportive, I can see this thing winning a Nobel Prize.
English
1
0
10
1.1K
Mathew rayan
Mathew rayan@MathewRayan0·
@its_tianyi Exactly. Pattern-matching plateaus fast, but a reasoning-first architecture compounds. Once a model can actually generalize from first principles, increased complexity stops being a bottleneck and becomes an asset. That’s where the real separation begins.
English
0
0
1
14
Tianyi
Tianyi@is_tianyi·
@elonmusk @grok The reasoning-first approach is the real differentiator. Most LLMs solve by pattern matching. Grok building from first principles creates a structural advantage that scales with complexity.
English
2
1
9
538
CentreGoals.
CentreGoals.@centregoals·
7'—Greece 1-0 Scotland 57'—Greece 2-0 Scotland 63'—Greece 3-0 Scotland 65'—Greece 3-1 Scotland 70'—Greece 3-2 Scotland What a game! 🍿
CentreGoals. tweet mediaCentreGoals. tweet media
English
24
35
1.5K
40.5K
Mathew rayan
Mathew rayan@MathewRayan0·
@luxebarbie_ Honestly, same. Out of millions on here, only a few actually make sense. Quality over quantity every time."
English
0
0
0
11
✨
@luxebarbie_·
@MathewRayan0 I vibe with about 400 people out of the entire world on here
English
1
0
1
10
Jasmine Crockett
Jasmine Crockett@JasmineForUS·
Ok, now that MTG has said it, do y’all now believe me when I say that TRUMP fuels hate against those that oppose him?!! Will faux news finally start to address that this man has created a permission structure of hate and violence or will you only talk about it when it’s one of your beloveds?! Next time you want to run a story about me legally paying for Security, maybe run one on the insane amount of threats that I get that you are complicit in fueling!
Former Congresswoman Marjorie Taylor Greene🇺🇸@FmrRepMTG

I am now being contacted by private security firms with warnings for my safety as a hot bed of threats against me are being fueled and egged on by the most powerful man in the world. The man I supported and helped get elected. Aggressive rhetoric attacking me has historically led to death threats and multiple convictions of men who were radicalized by the same type rhetoric being directed at me right now. This time by the President of the United States. As a woman I take threats from men seriously. I now have a small understanding of the fear and pressure the women, who are victims of Jeffrey Epstein and his cabal, must feel. As a Republican, who overwhelmingly votes for President Trump‘s bills and agenda, his aggression against me which also fuels the venomous nature of his radical internet trolls (many of whom are paid), this is completely shocking to everyone. My phone is blowing up with constant amazing support. I’m so thankful! The Political Industrial Complex and the toxic violent nature of American politics must end. Our country is worth saving and it can only be done if we pull together and save ourselves.

English
2K
6.2K
47.7K
2.2M