Eric

971 posts

Eric

@ericmitchellai

chatgpt posttraining @openai. building personal agi. I like ai and music and some other stuff

United States เข้าร่วม Aralık 2017

583 กำลังติดตาม10.9K ผู้ติดตาม

Eric@ericmitchellai·51m

@eddiboi @atulit_gaur can you provide any convo share links where this happened?

English

EdDiboi@eddiboi·15h

@ericmitchellai @atulit_gaur Thank you!

English

atulit@atulit_gaur·1d

seriously WHO TF is responsible for chatgpt's post training? i want to have a fucking word with you

English

158

12.6K

Eric@ericmitchellai·12h

@DicksonPau @atulit_gaur What needs to be better dickson

English

107

Dickson Pau@DicksonPau·1d

@ericmitchellai @atulit_gaur Please please do better!!!

English

540

Eric@ericmitchellai·15h

@charliermarsh @andrew_n_carr wow, sick, welcome Charlie! Go tigers!

English

350

Charlie Marsh@charliermarsh·17h

We've entered into an agreement to join OpenAI as part of the Codex team. I'm incredibly proud of the work we've done so far, incredibly grateful to everyone that's supported us, and incredibly excited to keep building tools that make programming feel different.

English

275

140

395.5K

Eric@ericmitchellai·15h

@Kates_AI @atulit_gaur OOC do custom instructions help here?

English

106

Noah Gordon@Kates_AI·1d

@ericmitchellai @atulit_gaur Can chat not speak in bullet points for pages. Thanks :)

English

510

Eric@ericmitchellai·15h

@eddiboi @atulit_gaur Thanks for the report, will flag this!

English

137

EdDiboi@eddiboi·1d

@ericmitchellai @atulit_gaur Hi eric, catastrophic things happen when I ask GPT-5.4 Pro to generate an image - it spams the imagen tool and I end up hitting the rate limit

English

758

Eric@ericmitchellai·15h

@theo @atulit_gaur thanks theo, glad you really liked 5.4. Really proud of the team for that one

English

136

Theo - t3.gg@theo·1d

@ericmitchellai @atulit_gaur You did a great job

English

924

Eric@ericmitchellai·6d

@thsottiaux @DavidOndrej1 lace up i want to see some steps tibo

English

280

Tibo@thsottiaux·6d

@DavidOndrej1 Smoking incredible code I would walk a mile for code from GPT-5.4

English

500

19.5K

David Ondrej@DavidOndrej1·6d

GPT 5.4 *is not* better than Opus 4.6 i have no idea what people are smoking

English

181

931

133.8K

Eric@ericmitchellai·6d

@shadcn glad you like it

English

506

shadcn@shadcn·6d

Nobody move. GPT-5.4 is working amazingly.

shadcn@shadcn

Been trying to make Codex work for me but it overthinks everything, even simple stuff. Tried different reasoning levels, no difference. It just tries too hard. Good at reviewing others work though.

English

107

1.9K

184.4K

Eric@ericmitchellai·11 Mar

@Vera28765582815 @nicdunz Share link?

English

Moira@Vera28765582815·11 Mar

@ericmitchellai @nicdunz 5.4 constantly does this with me. It seems like using the web tool makes it behave like the prior message was the first one in the conversation. I therefore explicitly ask it not to use the web tool as it will mess up your work. It also makes it reasoning about the task.

English

nic@nicdunz·10 Mar

ive noticed 5.4 does things like this. 5.2 never did this.

English

123

17.2K

Eric@ericmitchellai·11 Mar

machines that build machines that build machines

English

543

47.2K

Eric@ericmitchellai·10 Mar

@Miles_Brundage thanks miles. If you ever run into an example you feel comfortable sharing, would love to see <3 glad you're finding the model useful!

English

221

Miles Brundage@Miles_Brundage·10 Mar

@ericmitchellai I don't love sharing links / logs since they often involve non-public docs but here's an example prompt - "Given what you know of me, what should I read right now?" It then suggested reading docs *from* the (ChatGPT) Project folder itself, not public reading material.

English

604

Miles Brundage@Miles_Brundage·10 Mar

The most common failure mode I've observed with GPT-5.4 is misunderstanding the intent behind the prompt (but then doing a good job at what it thought the task was). I'm not sure if this is a regression or not, but it stands out by contrast w/ the task execution

English

5.8K

Eric@ericmitchellai·10 Mar

@hansonwng @xeophon :gogogo:

Euskara

Hanson Wang@hansonwng·10 Mar

@xeophon we will fix this ;)

English

368

Xeophon@xeophon·10 Mar

codex but it isn't going overboard with defensive coding who's working on this

English

4.4K

Eric@ericmitchellai·10 Mar

damn even ben likes it that's neat

ben@benhylak

i pretty much churned from chatgpt until gpt 5.4 i also think people are sleeping on 5.4 pro. it feels like we got o3 back, except now it has a really good sandbox (?) feels devin-ish

English

7.8K

Eric@ericmitchellai·9 Mar

@Miles_Brundage glad you like it miles

English

697

Miles Brundage@Miles_Brundage·9 Mar

GPT-5.4-Thinking has a very non-annoying writing style which is nice

English

526

32K

Eric@ericmitchellai·8 Mar

@_simonsmith @dejavucoder Thanks a lot Simon. Verbosity in 5.4 is actually on our radar already :) Appreciate all your feedback!

English

197

Simon Smith@_simonsmith·8 Mar

I made a nonfiction writing benchmark to evaluate this and, as much as I love the writing improvements in GPT-5.4, it confirmed my observation that by default GPT-5.4 is excessively verbose. I have tamed it with custom instructions but here's a heat map showing its strengths and weaknesses relative to Opus and Sonnet.

English

269

sankalp@dejavucoder·8 Mar

gpt 5.4 has improved in conversation. content wise the answers are rich too based on some soft questions / day to day life stuff i asked. opus 4.6 however is still much more enjoyable to talk to. gpt 5.4 just has some slop patterns still.

English

6.6K

Eric@ericmitchellai·8 Mar

@nicdunz Can you give any convo share links?

English

1.1K

nic@nicdunz·8 Mar

chatgpt seems to be having an issue where its stopping thinking and doesnt respond randomly

English

3.3K

Eric@ericmitchellai·8 Mar

@nk @z Can you give session id from /feedback ?

English

from the future@nk·8 Mar

@z @ericmitchellai I think this is just a bug. It gets confused what it is doing.

English

from the future@nk·8 Mar

Chat GPT 5.4 is really frustrating to work with. Awful hallucinations and laziness. It's hard to tell if it's "smarter" when it's so difficult to steer.

English

845

ค้นพบ

@eddiboi @atulit_gaur @DicksonPau @charliermarsh @andrew_n_carr @Kates_AI @theo @thsottiaux