Jeff Harris

1.4K posts

Jeff Harris

@jeffintime

the malleability of minds. Codex @openai

Oakland Katılım Ocak 2008

1.1K Takip Edilen3.9K Takipçiler

Jeff Harris retweetledi

Boaz Barak@boazbaraktcs·2d

I think preserving models for internal deployment is risky. I encourage Anthropic to release Mythos, even if it’s a version that over refuses on cyber tasks or routes risky responses to a weaker model, as we did with codex.

English

590

66K

Jeff Harris retweetledi

Tibo@thsottiaux·4d

Three million people are now using Codex weekly - up from two million a little under a month ago. Incredible to see the growth. Thank you to all of you and to the ecosystem we’re part of. To celebrate, we’re resetting rate limits so you can keep building, and we’ll reset them every additional 1M users until we reach 10M, so we can keep celebrating along the way. Enjoy and thank you!

English

401

297

4.5K

496.4K

Jeff Harris@jeffintime·3 Nis

This is one of the biggest unsolved product problems in AI Users know urgency. Platforms know when capacity is scarce and abundant. Building products that can turn those two signals into coordinated behavior is going to matter a ton

Tibo@thsottiaux

With Codex the there is quite the gulf in load between peak and off-peak times, and we would like to achieve more of a smoother traffic pattern as that would be a more optimal use of our compute. We have ideas, but curious what you all think we should do? Would more usage during off-peak and surge multiplier during peak times make sense?

English

319

Jeff Harris@jeffintime·17 Mar

@SrinivasSi78619 @dkundel @ajambrosino are you still running into this? we landed a few fixes that may have solved it, but would love to know for sure

English

Srinivas Sivaratri@SrinivasSi78619·14 Mar

@dkundel @ajambrosino current update after signing in -

English

dominik kundel@dkundel·14 Mar

Sometimes we ship features so fast we forget to share them 😅 Codex in the app can now read the integrated terminal! Thanks @ajambrosino!

Leon@LeonKuzmin_

@dkundel @OpenAIDevs @raycast I’m constantly copying command output from the app command window. Shouldn’t codex be aware of the command output?

English

504

60.6K

Jeff Harris@jeffintime·16 Mar

why i love OAI: there are so many excitements cooking, that i can't even tell what my boss is vagueposting about

Tibo@thsottiaux

Working at OpenAI is fun because questioning everything and taking risks is part of the culture. Within Codex, the team asks itself how we could make it an order of magnitude better every few months and then sets most things aside to go and do it across the entire stack. Some examples were the Codex App and our first deployment of Cerebras inference with WebSockets. We are now well under way on the next bet and it’s making even our best engineers nervous as it’s at the edge of what’s possible today.

English

939

Jeff Harris retweetledi

Dwayne@CtrlAltDwayne·11 Mar

No serious dev should be using Claude Code at this point, just use Codex. Look how OpenAI handles issues transparently and then resets when something goes wrong. Instant accountability. This is setting the standard for how all labs should treat customers.

Tibo@thsottiaux

OK, Codex is back and stable and we should be good for a while. Reset button pressed, should see it in a bit

English

795

56.1K

Jeff Harris retweetledi

Rohan Varma@rohanvarma·10 Mar

If you want AI Code Review, but don't want to pay $25 per review (not a typo), check out Codex Review! It leverages frontier Codex models, finds complex issues, and 100% usage based. Most runs should cost ~$1 or less developers.openai.com/codex/integrat…

English

1.4K

186.3K

Jeff Harris@jeffintime·3 Mar

@piquopiquo nudging the model to use subagents is also helpful

English

Piquo@piquopiquo·3 Mar

@jeffintime don’t even think you would need to speed the token/sec even further at this point. Lightning fast compared to the rest of the market. Do you have suggestions how to handle huge codebases with the current context windows? MCPs perhaps? Thank you!

English

Jeff Harris@jeffintime·3 Mar

Spark’s the fastest model we’ve ever made Now rolling out to the heaviest Codex users on Plus

English

448

31.8K

Jeff Harris@jeffintime·3 Mar

@andrewgazelka spark is powered by a new kind of capacity from our partners at Cerebras, so supply is more limited than on other models. more details openai.com/index/introduc…

English

168

Andrew Gazelka@andrewgazelka·3 Mar

@jeffintime why not give unbatched regular codex like claude /fast ?

English

149

Jeff Harris@jeffintime·3 Mar

@ManiacSelinsky we had a brief sev with overtriggering this check last night ... really sorry about the disruption 😔 x.com/thsottiaux/sta…

Tibo@thsottiaux

Codex is back up. We introduced an issue that caused requests to be blocked and rejected as high cyber risk. This was fixed in ~8 mins and we are now back online and fully operational. Team will reset rate limits, that’s been a while. Enjoy one on us.

English

124

Maniac Selinsky@ManiacSelinsky·3 Mar

@jeffintime what is going on with your cybersecurity checks. Massive amounts of false positives being reported.

English

420

Jeff Harris@jeffintime·3 Mar

@MoonGotArt yep 128K. we'll keep pushing on context in future fast models

English

MoonGotArt@MoonGotArt·3 Mar

@jeffintime Does it still only have 100k context?

English

406

Jeff Harris@jeffintime·3 Mar

@colinsolvely for this round we look at established tenure on Codex (at least 1mo) and then usage over the last week we'll keep adjusting the qualifying threshold as capacity permits

English

144

Colin@colinsolvely·3 Mar

@jeffintime What’s the definition of a heavy user?

English

700

Jeff Harris@jeffintime·3 Mar

@shailesz Huh shouldn't be the case. DM me the email address for your pro account?

English

128

misato 🌸@shailesz·3 Mar

@jeffintime bro im on pro, i dont have spark. why?

English

622

Jeff Harris@jeffintime·3 Mar

@piquopiquo helpful to know. we'll keep pushing on context in future fast models

English

Piquo@piquopiquo·3 Mar

@jeffintime really love it, the context window atm is sadly my biggest bottleneck with it!

English

416

Jeff Harris@jeffintime·3 Mar

@leodev helpful feedback 🙏

English

Leo@leodev·3 Mar

@jeffintime Can you increase the limits for it for Pro users. It runs out so quickly... especially in the 5 hour window

English

301

Jeff Harris@jeffintime·3 Mar

@tina_jjjj it's same, so unfortunately no: 5.3-spark is text only but stay tuned for future fast models 👁️

English

123

JT@tina_jjjj·3 Mar

@jeffintime can images be uploaded on this ? or is this different from codex-spark?

English

371

Jeff Harris retweetledi

Peter Bakkum@pbbakkum·25 Şub

gpt-realtime-1.5 is the best native audio model on the Scale AudioMultiChallenge benchmark -- this is a significant jump in capability by this measure. There are models that outperform it but they are reasoning models without native audio output.

English

177

25.2K

Jeff Harris retweetledi

adi@adonis_singh·25 Şub

uhhh WTF?! gpt-5.3-codex gets 86% on IBench, beating out all other models massively. I was NOT expecting this

English

232.5K

Jeff Harris@jeffintime·24 Şub

it’s early innings for optimizing LLM latency in long-running harnesses

Cline@cline

We tested @OpenAI's new WebSocket connection mode for the Responses API into Cline and the early numbers are wild. Instead of resending full context every turn, WebSocket mode keeps a persistent connection, sends only incremental inputs. With 5.2 Codex results vs the standard API: → ~15% faster on simple tasks → ~39% faster on complex multi-file workflows → Best cases hitting 50% faster WebSocket handshake adds slight TTFT overhead on short tasks, but it gets amortized fast. On heavier workloads with dozens of tool calls, the speed gains are massive. Still expanding our test sample, but this is a very promising step forward for every Cline user. Faster AI coding is coming.

English

1.4K

Keşfet

@SrinivasSi78619 @dkundel @ajambrosino @piquopiquo @andrewgazelka @ManiacSelinsky @MoonGotArt @colinsolvely