Zsolt Ero

910 posts

Zsolt Ero banner
Zsolt Ero

Zsolt Ero

@hyperknot

Building https://t.co/CUfyhT0Ura and https://t.co/GTLrvnmS0h Writing on https://t.co/irgNrwubhY Loves paragliding

Europe Katılım Temmuz 2012
1.1K Takip Edilen3.6K Takipçiler
Zsolt Ero
Zsolt Ero@hyperknot·
@levelsio @parasight And for dark style, there is "dark" and "fiord" styles, just put them in the URL, it'll work.
English
1
0
1
531
Zsolt Ero retweetledi
@levelsio
@levelsio@levelsio·
✨ I just replaced Mapbox on all my sites with OpenFreeMap by @hyperknot and my map bill is now $0 Mapbox's pricing is getting increasingly extortionary (which is fine, it's capitalism) but at some point you have to think, $857/month for what? A map? Really? A map is that expensive? How can loading a map be that expensive? It's just some PNG tiles you host somewhere? Why? @OpenFreeMapOrg is 100% free and all you do is point your AI to openfreemap(dot)org and tell it to replace Mapbox with that 5 minutes and you save thousands $$$ per year! Apparently @Cloudflare sponsors its bandwidth which is very cool and keeps it online!
@levelsio tweet media
English
40
120
3.3K
471.4K
Zsolt Ero
Zsolt Ero@hyperknot·
@levelsio Welcome on board! Making web maps free and fun again!
English
0
0
155
11.8K
Armin Ronacher ⇌
Armin Ronacher ⇌@mitsuhiko·
What's the smallest (but comparatively coolest) extension you built for pi?
English
74
2
104
36.3K
Mario Zechner
Mario Zechner@badlogicgames·
i'm now convinced they switched opus 4.6 to be haiku. only half joking.
English
13
0
183
15.1K
Zsolt Ero
Zsolt Ero@hyperknot·
@theo Its not the model, it's CC which they keep slopnerfing then fixing. The same model is served on Azure, Bedrock, Google, how could they nerf them at the same time?
English
0
0
0
101
Theo - t3.gg
Theo - t3.gg@theo·
Serious question. Has anyone ever noticed meaningful regressions in Codex/OpenAI models? I feel like we talk about this a lot w/ Anthropic but I've never seen a similar discussion with OAI.
English
306
11
1.7K
142K
Zsolt Ero
Zsolt Ero@hyperknot·
@mitsuhiko Models should remember thinking mode. I like Sonnet + medium and GPT + high. Painful to cycle through it every time I switch models.
English
2
0
1
179
Armin Ronacher ⇌
Armin Ronacher ⇌@mitsuhiko·
If something really bothers you about pi, give us feedback please :)
English
91
1
156
15.3K
Zsolt Ero
Zsolt Ero@hyperknot·
@zeeg @viggy28 Put pi in SSE mode, it's ridiculously fast with GPT models.
English
0
0
2
549
David Cramer
David Cramer@zeeg·
@viggy28 I'm unlikely to move off the OpenAI/Anthropic ecosystsems, but Pi is hands down the best coding harness alternative out there
English
8
10
209
14.5K
David Cramer
David Cramer@zeeg·
anthropic being down, rate limits, etc is understandable. huge hardware reqs and insane growth. claude code being bug filled is just a result of people vibe coding slop at anthropic and accepting it as ok. it is not ok you will lose your customers just as fast as you've gained them.
English
89
35
1.1K
118.1K
Zsolt Ero
Zsolt Ero@hyperknot·
@ryanflorence I went through all, @kobaltedev has the highest quality code. You need to ask Claude to translate the CSS to Tailwind though. Corvu is also good. The others are just wrappers/porting efforts and are mostly overcomplicated or abandoned. Ark is architecture astronaut stuff.
English
0
0
2
242
Ryan Florence
Ryan Florence@ryanflorence·
What's the Base UI of Solid.js?
English
14
1
69
20.6K
Zsolt Ero
Zsolt Ero@hyperknot·
@zeeg I was thinking of making a complexity bench. Instead of a zero-shotting eval, ask for a complex program in 10-20 steps, each time adding a new requirement. I bet Codex would end up with the most over-engineered code. Claude is better, but also very far from a good human.
English
0
0
0
191
David Cramer
David Cramer@zeeg·
Has anyone done any kind of research on the complexity of outputs from different models? Anecdotal, but I'm convinced that Codex is significantly wrose than Claude. I dont know how you'd reliably measure it, but even a subjective "yeah most humans would agree this is better"..
English
32
0
58
11.2K
Zsolt Ero
Zsolt Ero@hyperknot·
@tenobrus They are already not serving their best models publicly. The ones on the API are not the ones winning math and programming Olympiad problems. Those are internal-accessible only. Also, GPT 5.4 is a Sonnet sized model, their "Opus" version is not public either.
English
0
0
1
193
Tenobrus
Tenobrus@tenobrus·
i'll reiterate since it's buried in this longpost: how much longer do you *really* think OpenAI and Anthropic will continue to serve their raw frontier models through publicly accessible APIs? that was always a revenue and data bootstrap. it's ending within the next two years.
Tenobrus@tenobrus

"who cares if Cursor used Kimi 2.5 as a base, starting with a commoditized pretrained model was always the right move anyway" nah, sorry, what it proves is Cursor is still fundamentally reliant on frontier labs. Kimi 2.5 is only as capable as it is because it's a distill of Opus 4.5. the only open model that ever showed it was capable of trading blows w the frontier was deep seek, and it really seems that moment has passed. the question was whether Cursor could really break the dependency chain and start building improvements based entirely on their own expertise and data. and Composer 2 shows that they *can't*, that they need the general model quality and intelligence from 4.5 to get anywhere, and that really what they're doing is laundering culpability through Chinese labs so they don't have to get their hands dirty doing distillation themselves. when Opus 5 and GPT 6 are significantly more capable along many dimensions, more RL with coding rollouts aren't going to be enough to save Composer 3, they'll either need to have caught up with whatever the frontier labs are doing internally, which right now we have pretty strong evidence they just don't have the research capacity for or... wait for another distill. and how much longer do you *really* think OpenAI and Anthropic will continue to serve their frontier models through publicly accessible APIs? that was always a revenue and data bootstrap. it's ending within the next two years.

English
53
13
692
67.1K
Arvid Kahl
Arvid Kahl@arvidkahl·
If you do AI inference via OpenAI’s API, you should use the flex tier for half price. My requests always try to use flex tier first, and on 429 / 500 errors, I use the default service tier. 95% of my requests are flex. 2 tries flex, then fall back to standard. Massive cost cut.
Arvid Kahl tweet media
English
31
6
172
20.2K
Zsolt Ero retweetledi
NIK
NIK@ns123abc·
🚨 MICROSOFT ABOUT TO SUE OPENAI & AMAZON >be microsoft >invest $1B in openai >gets exclusive azure cloud deal >invest another $10B+ >gets rights to 49% of profits +IP >Azure goes brrrrrr >Altman lies to board, quietly launches ChatGPT >board fires him for being a lying manipulative snake >Satya goes to war for Altman. saves his entire career >Altman retvrns in 5 days >immediately purges everyone who purged him >full control. no oversight. thanks Satya! >fast forward to 2025 >OpenAI restructures from non-profit to PBC >MSFT $13.8B is now worth $135B. 10x return >plus 27% of OpenAI >but gives up cloud exclusivity + profit share >KEEPS API clause >all API calls contractually MUST route through Azure >Satya thinks life is good lol >5 months later >Sam Altman becomes strong enough to betray you >"raises $110B round" >doesn't need satya daddy's money anymore >announces $50B deal with AMAZON >$138B in AWS cloud commitments >amazon and openai claim they built some cope called a "Stateful Runtime Environment" >Microsoft lawyers hmmm >Altman: it's not what it looks like. i can totally explain >so it's technically not an API call because it's "stateful" >and it's a... "Runtime Experience" >totally di!erent thing >pls ignore the TCP packets lol >Microsoft engineers look at the SRE architecture >"THIS IS NOT TECHNICALLY POSSIBLE without violating the contract." *Satya finds out he's been cucked* Microsoft exec literally tells FT: "We know our contract. We will sue them if they breach it." >AWS quietly gives employees a memo on which words are legally safe lmao >can say: "powered by" or "enabled by" or "integrates with" OpenAI >cannot say: "enables access to" or "calls on" ChatGPT >also cannot suggest frontier models are "available on AWS" Microsoft: "If Amazon and OpenAI want to take a bet on the creativity of their contractual lawyers, I would back us, not them." Scam Altman strikes AGAIN.
NIK tweet media
Financial Times@FT

Microsoft weighs legal action over $50bn Amazon-OpenAI cloud deal ft.trib.al/6LZe39E

English
465
1.5K
14.3K
2.1M
Zsolt Ero
Zsolt Ero@hyperknot·
@Rasmic @romainhuet They usually say the one you give is better. Ask them in a new conversation which one is better from the two.
English
0
0
0
68
Micky
Micky@Rasmic·
I got both gpt-5.4 and opus 4.6 to generate a plan... I gave gpt's plan to opus and it admitted that it was a better plan lol
Micky tweet media
English
71
18
580
68.4K
Zsolt Ero
Zsolt Ero@hyperknot·
@OfficialLoganK 3.1 is unusable for creative writing. We have an email drafting pipeline carefully tuned over months, which worked perfectly on 3.0. 3.1 is unusable, it's a codemaxxed model like some of the GPT 5 series. Where 3.0 writes 4 beautiful paragraphs, 3.1 writes 4 bullet points.
English
0
0
4
291
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
PSA: we are turning down Gemini 3 Pro next Monday March 9th. You can upgrade to 3.1 Pro Preview which improves on lots of the things folks gave feedback about on the first Gemini 3 rev. Please keep the feedback coming : )
English
262
81
1.8K
594.5K
Zsolt Ero retweetledi
Justin Starner
Justin Starner@Justin_Starner·
I am excited to announce that BigBoy Charging is now officially powered by OpenFreeMap! This latest addition, combined with the incredible mapcn, completes my goal of being able to provide this service for free to all EV drivers 🔋⚡ Huge thanks to @hyperknot and @sainianmol16 for making this dream of mine a reality 🫶
Justin Starner tweet media
English
1
1
8
1.6K
Zsolt Ero retweetledi
Sam Rose
Sam Rose@samwhoo·
Reminder before Sonnet 5 drops: SWE-bench tests a model’s ability to fix small Python bugs in 12 repos in one-shot with appropriate context fed to it. It’s not a measure of agentic coding ability. I wrote in detail about a bunch of benchmarks and what they mean, link below.
English
5
8
73
6.6K
ben hylak
ben hylak@benhylak·
it's silly that google charges ~50% more when you call a model through vertex
English
11
0
51
14.6K
Ole Lehmann
Ole Lehmann@itsolelehmann·
what is the country in europe with the most pro-entrepreneur/pro-growth mindset? talking about general sentiment, not tech bubble sentiment in these countries
English
226
3
121
48.7K