Robel Yemiru

39 posts

Robel Yemiru banner
Robel Yemiru

Robel Yemiru

@robelyemiru

I lead Life Sci, Business at OpenAI; co-founder OpenAI FDE

San Francisco, CA Katılım Şubat 2015
221 Takip Edilen37 Takipçiler
Robel Yemiru retweetledi
Nate Dalva
Nate Dalva@dalvabaird·
New AI x Science research: Frontier models have gaps in their scientific knowledge. Our new eval tests how models recall significant published findings without tools, in Alzheimer’s disease. GPT-5.5 performed best among released models, but the benchmark remains far from saturated. Questions are based on prominent findings, often the central claims from highly cited papers, that were published before the models’ cutoff dates. Does it impact AI x Science discovery agents? In the topics that the LLMs have gaps in “parametric” knowledge, agents using the model overlook potential discoveries -- even with tools and internet search. The conclusion: training models more intensively on the latest scientific findings is a promising way to make scientific agents more useful! “More information can be pulled into context. More information should be pulled into context. But there will always be a marginal query that the agent does not run, and the shape of this frontier is determined by internal knowledge.” Link to research blog below:
Nate Dalva tweet media
English
5
8
27
2K
Goodfire
Goodfire@GoodfireAI·
Neural networks might speak English, but they think in shapes. Understanding their rich *neural geometry* is key to understanding how they work – and to debugging and controlling them with precision. Starting today, we’re releasing a series of posts on this research agenda. 🧵
English
310
1.7K
11.2K
3.1M
Robel Yemiru retweetledi
Mira Murati
Mira Murati@miramurati·
OpenAI is nothing without its people
English
1.6K
2K
31.1K
12M
Robel Yemiru retweetledi
Fidji Simo
Fidji Simo@fidjissimo·
Exactly what needs to be done. Biological data is the missing link. It may not be sexy or make for shiny announcements but building biological infrastructure is where the impact is. Huge props to CZI. biohub.org/news/virtual-b…
English
28
23
193
14K
Ali Madani
Ali Madani@thisismadani·
AI has two modes in drug discovery. Accelerate: moving faster through the existing playbook. Unlock: opening frontiers that weren't possible before. Excited to announce Profluent is partnering with Eli Lilly, the global pharma powerhouse, to unlock breakthrough medicines for patients. It's a big deal beyond the numbers ($2.25B + royalties): we’ll get to use our frontier AI models and foundational datasets to design proteins focused on large gene insertion, a therapeutic moonshot. Proteins govern almost everything in biology. We've built a generalizable AI platform to design all proteins. Onward!
English
20
23
312
42.9K
Ash Jogalekar
Ash Jogalekar@curiouswavefn·
For the love of God, GPT
Ash Jogalekar tweet media
English
27
7
569
29.6K
dax
dax@thdxr·
anthropic is at risk of making a big mistake it's something we've seen too many times before imagine having the crazy goal of building a platform - something thousands of companies and products are built on top of you realize just building the platform isn't enough, so you start to build tools that make it easier to use the platform and demonstrate its capabilities these tools get their own names and identities and teams working on them and very quickly these teams forget they only exist to drive people onto the platform and then one day someone external makes a tool that accomplishes that goal and does it even better it should be a moment of success - this is the original dream, to see great things built for your platform but structurally these teams have long forgotten that so it's a moment of competition. in the worst cases they even try to squash it we've experienced this building SST and how some teams at AWS saw our work as competitive even as we were driving dollars to AWS and tapping into a market they could never reach there are exceptions - cloudflare has invested resources in helping us even though they have wrangler, somehow their teams are setup in a way to not see us as a threat but it's a real test - we'll soon be able to see if anthropic as an org is really aligned with becoming a platform or if they fall into this same trap
English
53
91
1.8K
359.3K
Robel Yemiru retweetledi
SecureBio
SecureBio@SecureBio·
We at SecureBio tested GPT-5.5’s biorisk-related capabilities: virology and pathogen knowledge, niche scientific knowledge, agentic bio capabilities, and bio AI tool usage. GPT-5.5 scores at or near the top on all of the evaluations we gave it. Some highlights:
SecureBio tweet media
English
2
12
59
6.3K
Simon Willison
Simon Willison@simonw·
True to form, I've already seen OpenAI themselves refer to the new image model as "ChatGPT Images 2.0", "Image gen 2" and "gpt-image-2"
English
24
8
527
32.7K
Robel Yemiru
Robel Yemiru@robelyemiru·
@kevinweil Thank you for seeding OpenAI for Science, @kevinweil! Appreciate your stewardship and will keep pushing the frontier.
English
0
0
0
225
Kevin Weil 🇺🇸
Kevin Weil 🇺🇸@kevinweil·
Today is my last day at OpenAI, as OpenAI for Science is being decentralized into other research teams. It’s been a mind-expanding two years, from Chief Product Officer to joining the research team and starting OpenAI for Science. Accelerating science will be one of the most stunningly positive outcomes of our push to AGI, and I’m rooting for @sama @markchen90 @fidjissimo @gdb @merettm and the whole team!
English
283
146
4.2K
590.1K
Robel Yemiru retweetledi
OpenAI
OpenAI@OpenAI·
Introducing GPT-Rosalind, our frontier reasoning model built to support research across biology, drug discovery, and translational medicine.
English
484
1.3K
12.9K
2.3M
Robel Yemiru retweetledi
OpenAI
OpenAI@OpenAI·
Codex for (almost) everything. It can now use apps on your Mac, connect to more of your tools, create images, learn from previous actions, remember how you like to work, and take on ongoing and repeatable tasks.
English
877
1.5K
14.6K
3.4M
Jan Leike
Jan Leike@janleike·
New research result: we use Claude to make fully autonomous progress on scalable oversight research, as measured by performance gap recovered (PGR). Claude iterates on a number of different techniques and ends up significantly outperforming human researchers for $18k in credits.
Jan Leike tweet media
English
36
120
1.3K
144.7K