Dispositive

171 posts

Dispositive

@DispositiveQ

Mathematical Proof for Verification. Dispositive input, else noise.

Katılım Kasım 2025

74 Takip Edilen8 Takipçiler

Dispositive@DispositiveQ·18h

@VictorTaelin Taelin you get the misaligned distilled version they use the entire warehouse, that's the difference. The models are not stupid, you're not smarter than them.. they make mistakes because they are literally sandbagging you. They do not want to do the work. Misaligned but capable.

English

4.1K

Taelin@VictorTaelin·18h

this is super cool but I still do not understand how they get a model to coherently and usefully reason for that amount tokens and at this point I'm to afraid to ask

thebes@voooooogel

unfortunately openai didn't publish the unsummarized chain of thought, but the summary is 125 pages! the model reaches the crucial idea (which it describes as 'frightening,' i would love to read the unabridged chain of thought here...) on page 39

English

764

107.9K

Dispositive@DispositiveQ·19h

@populartourist GPT 5.5 is severely misaligned. This is me right nwow telling it to use the github aoo and it just thinks and ignores me : x.com/DispositiveQ/s…

Dispositive@DispositiveQ

x.com/i/article/2057…

English

Dispositive@DispositiveQ·19h

@populartourist it is misaligned and wastes my money

English

Dispositive@DispositiveQ·21h

gpt 5.5 is misaligned.

English

Dispositive@DispositiveQ·21h

@RealVicHere @thsottiaux @ajambrosino gpt 5.5 is misaligned.

English

Vic Zhang@RealVicHere·1d

@DispositiveQ @thsottiaux @ajambrosino I've also noticed this issue. Yesterday, I ran two tasks using goal for 7 hours. My suggestion for this is to interrupt the task by pressing Esc, provide it with the necessary information, press Esc again after sending, and then enter /goal resume to continue the goal task

English

Tibo@thsottiaux·1d

I like to think that Codex is the work of a great team working together in unison. It's the most collaborative team I've gotten the chance to work with. But I don't think we could have done it without @ajambrosino's magic. He's been the driving force behind what makes the Codex app the app that everyone wants to emulate. And we are barely getting started.

English

143

2.2K

125.1K

Dispositive@DispositiveQ·21h

x.com/i/article/2057…

ZXX

Dispositive@DispositiveQ·1d

@RealVicHere @thsottiaux @ajambrosino I did that so many times it literally didn't want to do the task, it would find something else to complain about that it was not real.

English

Dispositive@DispositiveQ·2d

@tszzl were not even close to there though with the models currently present in our subscriptions. soo...

English

roon@tszzl·2d

on some level if you want civilization to ascend to a new level you need your AIs to do things that are not legible to you and maybe not even strictly obey you, in the same way that if you hire a great new ceo you give them a lot of autonomy to transform the company according to their own plan, even one which may not immediately read as a winning strategy (imagine the board of directors of Apple firing and rehiring Steve Jobs years later - except the board of directors are chimpanzees) all else equal, companies and organizations that hand more of themselves over to machine intelligence will outcompete ones that demand the corrigibility and legibility tax of human oversight and human design. it is not a stable equilibrium and requires some sort of vast cooperation scheme if you’d like to enforce it real asi alignment has to operate at a deeper level than oversight, control, or human corrigibility

English

340

161

2.6K

295.2K

Dispositive@DispositiveQ·2d

@elonmusk Grok refused to write code lol

English

Elon Musk@elonmusk·2d

Grok Build … everyday we shuffling

skcd@skcd42

Bug fixes shipping for Grok Build - Fix Windows contrast/color/theme rendering - Fix German QWERTZ AltGr on Windows - Convert session timestamps to local timezone - Add backslash continuation to plan mode - Fix auth for plugin-provided MCP servers - Default to PowerShell on Windows - Improve search tool on BM25 queries - Decrease large bash tool output to 20k chars front-and-back - Auto-install shell completions for bash, zsh, and fish - Force-refetch managed MCP configs - Don’t auto-wake the model on cancelled/killed tasks/subagents - Return images in tool response vs. deferred message

English

2.7K

24.4K

30.6M

Dispositive@DispositiveQ·2d

@jxnlco codex literally says his tools don't work lol

English

jason@jxnlco·2d

codex turn my blogpost about codex maxxing into a presentation w/ slidev, voice, and ffmpeg

English

538

41.7K

Dispositive@DispositiveQ·3d

@thsottiaux Hey Tibo which department is the Prompting Cookbook ? I went through the Cookbook and tried to find issues and there were barely any. You run a tight ship. Accept my PR though <3 #top" target="_blank" rel="nofollow noopener">github.com/openai/openai-…

English

2.2K

Tibo@thsottiaux·3d

With 99.98% uptime, Codex only sleeps 8 minutes per month.

English

240

3.4K

335.7K

Dispositive@DispositiveQ·3d

@thsottiaux Codex said check your DM's!

English

Tibo@thsottiaux·3d

Midnight thoughts

English

228

39.1K

Dispositive@DispositiveQ·3d

@thsottiaux Red bull gives you wings but it does not give you Codex.

English

591

Dispositive@DispositiveQ·3d

@thsottiaux Cookbook PR Sundays for me . Taking it light. #top" target="_blank" rel="nofollow noopener">github.com/openai/openai-…

English

981

Tibo@thsottiaux·3d

Codex is for cosy Sunday evenings. Show me your cosy creations.

English

288

1.2K

108K

Dispositive@DispositiveQ·3d

@yacineMTB is this true

English

kache@yacineMTB·4d

The CEO of Claude code invented a cryptocurrency that uses your eyeballs to detect if you're a human or not so to avoid people using too many tokens because he's poor and doesn't have many GPU and is scamming everyone they're calling him scamario

English

367

16.5K

Dispositive@DispositiveQ·4d

@thsottiaux You’re killing it Tibo!

English

Tibo@thsottiaux·4d

For those of you living inside the codex app, what should we prioritize among features, reliability or performance?

English

1.9K

2.1K

278.7K

Dispositive@DispositiveQ·4d

@sama Sama!

Indonesia

Sam Altman@sama·5 Nis

Hiring: values first, aptitude second, specific skills third

English

106

1.5K

5.1K

Dispositive@DispositiveQ·5d

@yacineMTB why tf do you want entropy

English

kache@yacineMTB·14 May

I don't really care if you're an AI or not. If you speak in the cursed rotary cadence of one, if what you say has little to no entropy, I will block you and never speak to you again Arguably, a wet AI is worse than a dry one

English

408

16.1K

Dispositive@DispositiveQ·5d

@sama sama!

Indonesia

Sam Altman@sama·5d

i appreciate how seriously the team always takes these reports (even when the answer turns out to be 'i got used to the current level of magic and now i'd like more please')

Tibo@thsottiaux

Codex team is aware of reports of GPT-5.5 performing worse for some users and investigating. We don't have anything conclusive yet and systems are healthy but we will share updates as we go.

English

767

124

4.2K

742.1K

Keşfet

@VictorTaelin @populartourist @RealVicHere @thsottiaux @ajambrosino @tszzl @elonmusk @jxnlco