Larry Lv

5.3K posts

Larry Lv banner
Larry Lv

Larry Lv

@larrylv

Post-Training @OpenAI. Training GPT-5.x Thinking models. // If I blocked you, blame my bot.

Bay Area, CA เข้าร่วม Aralık 2009
302 กำลังติดตาม1.7K ผู้ติดตาม
dax
dax@thdxr·
opencode employees are gullible due to selection effects
English
9
1
219
19.1K
Adi Ganesh
Adi Ganesh@_adiganesh·
@larrylv was a pleasure training this one with you!
English
1
0
1
54
Larry Lv
Larry Lv@larrylv·
Happy Gullible Day to whoever celebrates it.
English
0
0
5
221
Larry Lv รีทวีตแล้ว
Mitchell Hashimoto
Mitchell Hashimoto@mitchellh·
I know this is pretty well established at this point, but Codex 5.3 is a much more effective model than Opus 4.6. I went back and forth on both for a bit, but haven’t touched Opus at all now for a full week. First model to get me off of Opus… ever. Good job Codex team.
English
335
219
5.3K
1.1M
Sanchen007
Sanchen007@mimighost008·
@larrylv @cherylnatsu Codex做ui的式样还是太单调了,感觉很多10年前的设计模式,你们对家出的default的ui就很好看,现在我都得去对家过一遍codex的css,专门改样式。另外目前5.3写app,喜欢写一个巨大的单文件app,好像没有模块化的意识
中文
1
0
0
74
夏雨婷
夏雨婷@cherylnatsu·
这几天高强度使用gpt-5.3-codex,我感觉opus的领先地位有点危险了
中文
17
0
112
27.9K
夏雨婷
夏雨婷@cherylnatsu·
@larrylv 理想情况:自己大大方方打开gdb开始调试或者仅仅是等crash后调查coredump打bt(copilot做到了,就跟我自己操作gdb一样,我不知道为什么它有权限) 次好情况:它就算没权限可以提示我要权限 保底情况:就算它要不到也可以提示我怎么操作把bt粘贴给他。 现状:他自己说自己没权限懵了在说废话原地转圈
中文
2
0
1
116
Larry Lv
Larry Lv@larrylv·
@cherylnatsu 谢谢反馈 🙏 理想情况下应该是 codex 检测到没有权限,然后来直接问用户要权限或者是要 bt 的结果?
中文
1
0
0
104
夏雨婷
夏雨婷@cherylnatsu·
@larrylv one feedback,codex+gpt-5.3-codex这东西在运行过程中尝试调用gdb在crash时bt失败,但是没权限于是反复在原地打转,其实我源码目录里有个脚本可以运行后在crash时自动打印bt,实在不行他自己可以生成命令让我运行后bt复制给它。
中文
1
0
0
1.1K
Larry Lv รีทวีตแล้ว
OpenAI
OpenAI@OpenAI·
GPT-5.2 derived a new result in theoretical physics. We’re releasing the result in a preprint with researchers from @the_IAS, @VanderbiltU, @Cambridge_Uni, and @Harvard. It shows that a gluon interaction many physicists expected would not occur can arise under specific conditions. openai.com/index/new-resu…
English
952
1.5K
9.6K
4.5M
Larry Lv
Larry Lv@larrylv·
In a war for comedy, I think the ones with jokes will win. — Norm Macdonald
English
0
0
1
296
Larry Lv รีทวีตแล้ว
Noam Brown
Noam Brown@polynoamial·
GPT-5.2 evals are finally out for METR and it's state-of-the-art. Here's the linear-scale plot. The 80% success-rate plot (below) is even more stark .
Noam Brown tweet media
METR@METR_Evals

We estimate that GPT-5.2 with `high` (not `xhigh`) reasoning effort has a 50%-time-horizon of around 6.6 hrs (95% CI of 3 hr 20 min to 17 hr 30 min) on our expanded suite of software tasks. This is the highest estimate for a time horizon measurement we have reported to date.

English
62
107
1.3K
615.2K
Larry Lv รีทวีตแล้ว
OpenAI
OpenAI@OpenAI·
Introducing Prism, a free workspace for scientists to write and collaborate on research, powered by GPT-5.2. Available today to anyone with a ChatGPT personal account: prism.openai.com
English
1.1K
2.3K
16.2K
5.9M
#endif
#endif@caterpillarous·
还没签offer,但是提离职了 晚上想到觉得自己莽得可爱
中文
5
0
8
377