Eric

970 posts

Eric banner
Eric

Eric

@ericmitchellai

chatgpt posttraining @openai. building personal agi. I like ai and music and some other stuff

United States Katılım Aralık 2017
583 Takip Edilen10.9K Takipçiler
atulit
atulit@atulit_gaur·
seriously WHO TF is responsible for chatgpt's post training? i want to have a fucking word with you
English
20
4
158
12.5K
Charlie Marsh
Charlie Marsh@charliermarsh·
We've entered into an agreement to join OpenAI as part of the Codex team. I'm incredibly proud of the work we've done so far, incredibly grateful to everyone that's supported us, and incredibly excited to keep building tools that make programming feel different.
English
275
139
3K
389K
EdDiboi
EdDiboi@eddiboi·
@ericmitchellai @atulit_gaur Hi eric, catastrophic things happen when I ask GPT-5.4 Pro to generate an image - it spams the imagen tool and I end up hitting the rate limit
English
2
0
1
754
Eric
Eric@ericmitchellai·
@theo @atulit_gaur thanks theo, glad you really liked 5.4. Really proud of the team for that one
English
0
0
0
133
Tibo
Tibo@thsottiaux·
@DavidOndrej1 Smoking incredible code I would walk a mile for code from GPT-5.4
English
25
5
500
19.5K
David Ondrej
David Ondrej@DavidOndrej1·
GPT 5.4 *is not* better than Opus 4.6 i have no idea what people are smoking
English
181
14
931
133.8K
Eric
Eric@ericmitchellai·
@shadcn glad you like it
English
0
0
5
506
Moira
Moira@Vera28765582815·
@ericmitchellai @nicdunz 5.4 constantly does this with me. It seems like using the web tool makes it behave like the prior message was the first one in the conversation. I therefore explicitly ask it not to use the web tool as it will mess up your work. It also makes it reasoning about the task.
English
1
0
1
36
nic
nic@nicdunz·
ive noticed 5.4 does things like this. 5.2 never did this.
nic tweet media
English
14
1
123
17.2K
Eric
Eric@ericmitchellai·
machines that build machines that build machines
English
46
61
543
47.2K
Eric
Eric@ericmitchellai·
@Miles_Brundage thanks miles. If you ever run into an example you feel comfortable sharing, would love to see <3 glad you're finding the model useful!
English
0
0
4
221
Miles Brundage
Miles Brundage@Miles_Brundage·
@ericmitchellai I don't love sharing links / logs since they often involve non-public docs but here's an example prompt - "Given what you know of me, what should I read right now?" It then suggested reading docs *from* the (ChatGPT) Project folder itself, not public reading material.
English
1
0
7
604
Miles Brundage
Miles Brundage@Miles_Brundage·
The most common failure mode I've observed with GPT-5.4 is misunderstanding the intent behind the prompt (but then doing a good job at what it thought the task was). I'm not sure if this is a regression or not, but it stands out by contrast w/ the task execution
English
18
1
71
5.8K
Xeophon
Xeophon@xeophon·
codex but it isn't going overboard with defensive coding who's working on this
English
9
0
52
4.4K
Miles Brundage
Miles Brundage@Miles_Brundage·
GPT-5.4-Thinking has a very non-annoying writing style which is nice
English
34
7
526
32K
Eric
Eric@ericmitchellai·
@_simonsmith @dejavucoder Thanks a lot Simon. Verbosity in 5.4 is actually on our radar already :) Appreciate all your feedback!
English
1
0
8
197
Simon Smith
Simon Smith@_simonsmith·
I made a nonfiction writing benchmark to evaluate this and, as much as I love the writing improvements in GPT-5.4, it confirmed my observation that by default GPT-5.4 is excessively verbose. I have tamed it with custom instructions but here's a heat map showing its strengths and weaknesses relative to Opus and Sonnet.
Simon Smith tweet media
English
1
0
6
269
sankalp
sankalp@dejavucoder·
gpt 5.4 has improved in conversation. content wise the answers are rich too based on some soft questions / day to day life stuff i asked. opus 4.6 however is still much more enjoyable to talk to. gpt 5.4 just has some slop patterns still.
English
7
1
92
6.6K
Eric
Eric@ericmitchellai·
@nicdunz Can you give any convo share links?
English
3
0
5
1.1K
nic
nic@nicdunz·
chatgpt seems to be having an issue where its stopping thinking and doesnt respond randomly
English
7
0
36
3.3K
Eric
Eric@ericmitchellai·
@nk @z Can you give session id from /feedback ?
English
0
0
0
65
from the future
from the future@nk·
Chat GPT 5.4 is really frustrating to work with. Awful hallucinations and laziness. It's hard to tell if it's "smarter" when it's so difficult to steer.
English
1
0
4
845
Eric
Eric@ericmitchellai·
@xw33bttv Can you provide a convo share link to any of the failures? For me the model gets it right 100% of the time (6/6 tries). Not sure what's going on for you.
English
4
0
44
2.5K
Lex
Lex@xw33bttv·
>It’s our most capable and efficient frontier model for professional work Can't even do basic math lmao
Lex tweet mediaLex tweet media
English
27
18
145
14.5K