UhuBuhu

482 posts

UhuBuhu

UhuBuhu

@kecksbe

Katılım Nisan 2022
49 Takip Edilen15 Takipçiler
UhuBuhu
UhuBuhu@kecksbe·
@Jobox05 @flash_canadian There is No big News openai ist stopping sora to get more compute to Not fall behind on the llm Side since Google and Claude catched Up the only real News is that other labs are making faster Progress on AI than openai and people think this is a sign of the ai downfall
English
1
0
1
35
CanadianFlash
CanadianFlash@flash_canadian·
Watching the downfall of AI is so satisfying
English
4
0
20
602
UhuBuhu
UhuBuhu@kecksbe·
@nicdunz didn't they announce an update was coming or wasn't there leaks about that voice mode 1.5 or something like that
English
0
0
3
905
nic
nic@nicdunz·
chatgpt voice is so good now. what did they do?? why havent they said anything about it??
English
8
1
100
10.1K
UhuBuhu
UhuBuhu@kecksbe·
@alexgrama @kimmonismus depends in my opinion it shows that llms still lack a rly basic part about reasoning but how relevant that part rly is i don't know we will see that in the coming years
English
0
0
0
36
gramanoid
gramanoid@alexgrama·
@kimmonismus is this really a relevant test? it's absolutely fantastic at what matters. who the fuck cares about this stupid ass question? same with the strawberry test and all the other retarded ones before that.
English
4
0
2
625
Chubby♨️
Chubby♨️@kimmonismus·
ChatGPT 5.4 still doesnt get it.
Chubby♨️ tweet media
English
113
24
700
80.5K
UhuBuhu
UhuBuhu@kecksbe·
@thsottiaux I still don't get why you Name the Models Codex the cli Codex and the web Thing Codex Name the Model gpt 5.3 Code or coding
English
0
0
0
45
Tibo
Tibo@thsottiaux·
Our naming team has been cooking
English
130
15
1.1K
93K
UhuBuhu
UhuBuhu@kecksbe·
@emollick For me Codex was a bigger Leap than o3 cause sonnet was already pretty Close at least in coding
English
0
0
0
50
Ethan Mollick
Ethan Mollick@emollick·
From an AI user perspective, the four big leaps so far in ability: 1. GPT-3.5 (ChatGPT, November 2022) 2. GPT-4 (Spring 2023) 3. Reasoners (starts with o1-preview, but the real deal was o3, Spring 2025) 4. Workable agentic systems (Harness + good reasoner models, December 2025)
English
117
139
2.5K
242K
Justin Schroeder
Justin Schroeder@jpschroeder·
Sonnet 4.6 and Opus 4.6: Anthropic trained Sonnet 5, and it almost outperformed Opus 4.5. With some RL on benchmarks, it could even outperform 4.5, but it was now dramatically smaller (cheaper). So? They renamed Sonnet 5 → Opus 4.6. But what of Sonnet? They distilled "Sonnet 5" into an even smaller model and rebadged it Sonnet 4.6. So now both models are just a fraction of their original size and cost to run. Even better, it left a little bit of extra compute overhead, which can be sold for 6x the cost as "fast mode." The models are not *actually* better than what they replace but...margin. I can't blame them too much for that.
English
66
33
970
175.2K
UhuBuhu
UhuBuhu@kecksbe·
@OP__Nico @Angaisb_ I am fine already got chatgpt fpr codex and claude for opus. The free tier pf gemini gives me everything i need from gemini but ty for the tip
English
1
0
1
20
OP Nico
OP Nico@OP__Nico·
@kecksbe @Angaisb_ As a subscription is really valid tho! And u get really generous 4.6 opus use in antigravity with your subscription! Give it a try, to me is really the ultimate deal :)
English
1
0
1
31
Angel 🌼
Angel 🌼@Angaisb_·
Time to go back to Plus Pro was amazing while it lasted
Angel 🌼 tweet media
English
15
3
153
30.1K
UhuBuhu
UhuBuhu@kecksbe·
@OP__Nico @Angaisb_ The problem with gemini is that claude is worse in anzigravity and that codex is smarter than claude but if dev isn't your main focus its a great deal
English
1
0
1
22
OP Nico
OP Nico@OP__Nico·
@Angaisb_ Man since I switched to gemini I don't think I can go back to anything you get really good limits within the UI + Antigravity use. Antigravity opens up doors to opus 4.6 too, which I had running for at least 4 hours yesterday building Swift code. It's the ultimate deal really
English
2
0
0
442
adi
adi@adonis_singh·
achieves about gpt-5.3-codex-low levels of accuracy (at xhigh itself) at a noticeably faster way faster and smarter than the previous 5.1-codex-mini model though, which is a bigg win
adi tweet media
English
2
0
15
1K
UhuBuhu
UhuBuhu@kecksbe·
@thsottiaux The terminal ui sucks in claude code i get a way better overview of what the model is actually doing in codex especially back when the model was slow i often questioned myself is the model working or got it stuck? But overall the model is great maybe try to speed it up even more
English
0
0
0
37
Tibo
Tibo@thsottiaux·
What could we do better on Codex? App, model, strategy and features… what’s wrong in how we approach things that we should improve immediately?
English
1.2K
11
948
101.2K
UhuBuhu
UhuBuhu@kecksbe·
@adonis_singh i want to use 5.2 instant for quick easy searches but the model beeing this much worse than the thinking and even 4o defeats the whole purpose of that if i need to write 5 prompts for it to do what i want
English
0
0
0
6
UhuBuhu
UhuBuhu@kecksbe·
@adonis_singh i feel like its always dumber simple example i asked both for the current state of a Tournament named jbb 5.2 instant told me 3 times that i probably mean jbbl. I told it no search for jbb searched again for jbbl after 5 attemts i went to 4o it instandly seachred for jbb by itse
English
1
0
0
19
ThePrimeagen
ThePrimeagen@ThePrimeagen·
it would be funny if 5.3 is just 5.2 and it's an experiment to see how much psychosis there really is
English
86
38
3.2K
99.2K
TestingCatalog News 🗞
TestingCatalog News 🗞@testingcatalog·
BREAKING 🚨: CLAUDE OPUS 4.6 IS ROLLING OUT ON THE WEB, APPS AND DESKTOP! TESTING TIME 🔥
TestingCatalog News 🗞 tweet media
English
29
38
776
185.2K
Wolf
Wolf@KryptoWolfGER·
Gehen wir live oder geben wir auf? #Bitcoin
Deutsch
164
11
412
23.8K
Angel 🌼
Angel 🌼@Angaisb_·
No Sonnet 5 today Patience Cave won this time
Angel 🌼 tweet media
English
19
1
391
20.3K
UhuBuhu
UhuBuhu@kecksbe·
@kimmonismus I am missing the new gemini agentic flash which is in my opinion the best one right now and mixtral ocr which was the best one before gemini flash agentic vision. Paddle ocr is good metric but not on mixtrak or gemini lvl
English
0
0
2
527
Chubby♨️
Chubby♨️@kimmonismus·
So we got SOTA OCR with just 0.9B params. GLM-OCR is a lightweight (0.9B params) multimodal OCR system built on the GLM-V encoder–decoder stack multimodal OCR system built on the GLM-V encoder–decoder stack. Love it!
Chubby♨️ tweet media
Z.ai@Zai_org

Introducing GLM-OCR: SOTA performance, optimized for complex document understanding. With only 0.9B parameters, GLM-OCR delivers state-of-the-art results across major document understanding benchmarks, including formula recognition, table recognition, and information extraction. Weights: huggingface.co/zai-org/GLM-OCR Try it: ocr.z.ai API: docs.z.ai/guides/vlm/glm…

English
15
44
783
69.9K
UhuBuhu
UhuBuhu@kecksbe·
@TheRealSynetos @camsoft2000 We need claude code ui with a router that actually works routing between opus and codex. Oh and codey needs a speed boost
English
0
0
0
24
Synetos
Synetos@TheRealSynetos·
@camsoft2000 We just need the cozy feeling of Claude Code UI/UX mixed with Codex capabilities and that's it. I just love how CC feels to use. And also we need Codex models that are better at frontend. Or if anyone has a workflow to make better frontend, I'm begging you to share it please
English
2
0
2
902
camsoft2000
camsoft2000@camsoft2000·
Having used Claude Code with Opus 4.5 a lot recently, I can tell you that for me, Codex CLI with GPT-5.2-Codex wins. It's a relief to go back to Codex; it feels like home. Not saying CC is rubbish, it's just Codex gets it done, Claue Code is too lazy, replies to my queries without re-checking code, and requires more steering and planning. Codex just doesn't need all that. I'm sure some will disagree, and it's subjective for sure, but I much prefer Codex to Claude Code. It suits the way I work.
English
34
3
234
23.9K
Apoorv
Apoorv@apoorvdarshan·
@theCTO isn't it slow?
English
3
0
2
2.3K
adam
adam@theCTO·
i miss the real Opus 4.5 rip legend
English
34
8
799
82.8K
Sarcastic Sharma
Sarcastic Sharma@sarkasticsharma·
Girls left or right?
Sarcastic Sharma tweet mediaSarcastic Sharma tweet media
English
2K
256
8.2K
13.2M