
Jason April
12.8K posts

Jason April
@Jason_April
Politically independent maker of music, art, prose, and code. Accidental polymath. Fight the ruling class, not each other.
Boston شامل ہوئے Temmuz 2011
3K فالونگ1.1K فالوورز
پن کیا گیا ٹویٹ

The United States is by far the greatest threat to world peace.
brilliantmaps.com/threat-to-peac…
English

@mark_k @_Ian__Brown_ So you're an idiot.
Socialism is when workers own the means of production.
English

@mark_k @koltregaskes You're welcome to run your own local model or pay API fees.
English

@mark_k Feel free to use another service.
But they all do this because it's a decent thing to do.
English

@kimmonismus What kind of person tells ChatGPT that their baby just walked?
English

@mark_k Put your money where your mouth is on Polymarket if you're actually confident.
English

@Maga_Trigger The bullet killed someone. This being staged is a liberal conspiracy theory.
English

@mtracey You're right, but this isn't the hill I'd die on.
What stands out to me is that these same people justify Biden's far creepier public interactions with children.
English

Trump putting his arm around his own daughter is supposed to tell us "there's much more to this Epstein story than meets the eye." 30K likes. What utter garbage
KT "Special MI6 Operation"@KremlinTrolls
There's much more to this Epstein story than meets the eye, let me tell you.
English

@scaling01 Who came up with these numbers, and why are all variants of every model included *except* for GPT-5?
This looks like it was made by someone with an agenda.
English

@rohanpaul_ai So you have to pay more for the lesser models. Got it.
English


@fchollet Everyone's sentimental about their childhood experiences, not just millennials.
English

I dunno man, it's just not very cash money when after GPT-4, finished mid-2022, had "sparks of AGI" evidenced by physical intuition, your "Ph.D level intelligence" GPT-5, mid-2025, can't figure out how to flip a mug
I suppose it's good at excel tho! I just doubt this merits $7T

Henry Shevlin@dioscuri
@teortaxesTex I don’t get what these riddle-me-this questions are supposed to prove, especially when we already know that reasoning versions of the models can solve them. 98% of the business use cases for LLMs are “do this excel or coding thing for me”.
English

@sama Oh, double the mystery rate to twice the mystery rate! How mysterious!
English

GPT-5 rollout updates:
*We are going to double GPT-5 rate limits for ChatGPT Plus users as we finish rollout.
*We will let Plus users choose to continue to use 4o. We will watch usage as we think about how long to offer legacy models for.
*GPT-5 will seem smarter starting today. Yesterday, the autoswitcher broke and was out of commission for a chunk of the day, and the result was GPT-5 seemed way dumber. Also, we are making some interventions to how the decision boundary works that should help you get the right model more often.
*We will make it more transparent about which model is answering a given query.
*We will change the UI to make it easier to manually trigger thinking.
*Rolling out to everyone is taking a bit longer. It’s a massive change at big scale. For example, our API traffic has about doubled over the past 24 hours…
We will continue to work to get things stable and will keep listening to feedback. As we mentioned, we expected some bumpiness as we roll out so many things at once. But it was a little more bumpy than we hoped for!
English

@rohanpaul_ai "Model trained on benchmarks holds marginal lead on benchmarks."
English

However xAI's Grok 4 Heavy continues to be the number one on HLE (Humanity’s Last Exam)
Grok 4 Heavy achieves 44.4% accuracy on HLE
GPT-5 Pro (using Python and search tools with blocklist) achieves 42.0% on the same benchmark.
And also note due to Continual Reinforcement Learning Grok 4 learns literally everyday. i.e. Grok 4 is smarter now vs 2 weeks ago.
I wrote a detailed post on this here.
rohan-paul.com/i/169268692/ho…
---
Backgournd on Humanity’s Last Exam, or HLE,
It'ss a 2,500-question multimodal test designed to measure expert-level reasoning.
- xAI states that Grok 4 Heavy is the first model to cross the 50% mark on HLE,
- Earlier OpenAI’s o3 (high configuration) records 20.32% on the same leaderboard, while an o3 variant optimized for data retrieval reaches 26.6% in OpenAI’s own release notes.
@xai @grok


Elon Musk@elonmusk
Grok 5 will be out before the end of this year and it will be crushingly good
English

@donkoclock @SouthPark A lowbrow dick joke isn't exactly bravery.
English

South Park has more courage than our MSM.
Drop a 💙 for @SouthPark! Let them know we love their work!

English

@glenn_tunes Tell that to the person on the other side of Trump who *was shot and killed* by the bullet that grazed Trump, you conspiracy theorist moron.
English

@mtaibbi Just instruct it differently, and it acts differently.
Have you actually used it? Because it sounds like you haven't.
English















