ilge

494 posts

ilge banner
ilge

ilge

@ilge

Researcher @OpenAI | there is as yet insufficient data for a meaningful answer.

Palo Alto, CA Katılım Şubat 2011
324 Takip Edilen3.5K Takipçiler
ilge retweetledi
OpenAI
OpenAI@OpenAI·
You can just build things.
English
1.1K
765
7.8K
2.9M
ilge retweetledi
Sheryl Hsu
Sheryl Hsu@SherylHsu02·
2/n We officially competed in the online AI track of the IOI, where we scored higher than all but 5 (of 330) human participants and placed first among AI participants. We had the same 5 hour time limit and 50 submission limit as human participants. Like the human contestants, our system competed *without* internet or RAG, and just access to a basic terminal tool.
Sheryl Hsu tweet media
English
10
46
437
53.5K
ilge retweetledi
Aleksander Holynski
Aleksander Holynski@holynski_·
Another one. Already a powerful painting, but moving around it yourself gives a totally different feeling. Jacques Louis David's "The Death of Socrates" => #Genie3
English
137
301
2.7K
320.5K
ilge retweetledi
Alexander Wei
Alexander Wei@alexwei_·
1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
Alexander Wei tweet media
English
400
1.3K
7.3K
5.7M
ilge retweetledi
Sheryl Hsu
Sheryl Hsu@SherylHsu02·
Watching the model solve these IMO problems and achieve gold-level performance was magical. A few thoughts 🧵
Alexander Wei@alexwei_

1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).

English
80
121
1.7K
659.1K
ilge
ilge@ilge·
Research roadmap essentials
ilge tweet media
English
0
0
2
721
ilge retweetledi
OpenAI
OpenAI@OpenAI·
OpenAI o3-mini is now available in ChatGPT and the API. Pro users will have unlimited access to o3-mini and Plus & Team users will have triple the rate limits (vs o1-mini). Free users can try o3-mini in ChatGPT by selecting the Reason button under the message composer.
English
951
1.8K
12.9K
3.2M
ilge retweetledi
Nat McAleese
Nat McAleese@__nmca__·
Epoch AI are going to publish more details, but on the OpenAI side for those interested: we did not use FrontierMath data to guide the development of o1 or o3, at all. (1/n)
English
10
33
449
65.7K
ilge retweetledi
Chelsea Sierra Voss
Chelsea Sierra Voss@csvoss·
don’t miss this part of today’s 12th Day of OpenAI: “Deliberative Alignment,” exciting work by the illustrious @MelodyGuan et al! the technique achieves a Pareto improvement over previous approaches such as RLHF, and reduces overrefusals! openai.com/index/delibera…
English
2
5
85
7.1K
ilge retweetledi
François Chollet
François Chollet@fchollet·
Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task in compute ) and 87.5% in high-compute mode (thousands of $ per task). It's very expensive, but it's not just brute -- these capabilities are new territory and they demand serious scientific attention.
François Chollet tweet media
English
202
1.6K
8.7K
2.2M
ilge
ilge@ilge·
Found him
ilge tweet media
English
5
2
163
8.5K
ilge
ilge@ilge·
I’m excited to attend NeurIPS’24!
ilge tweet media
English
2
1
112
8.9K
ilge
ilge@ilge·
I followed the trend of asking ChatGPT: “out of all the data you have on me, generate an image that captures me the way you see me”. 💓
ilge tweet media
English
1
0
20
2.2K
ilge retweetledi
OpenAI Developers
OpenAI Developers@OpenAIDevs·
Introducing canvas—your coding surface in ChatGPT. ✏️ Edit code inline 🐛 Review code and fix bugs 💬 Add logs and comments 🚢 Port to different languages We’ll be adding more to canvas over time. ChatGPT Plus and Team users can try the beta starting today.
English
236
659
4.9K
934K