Boris Power

3.1K posts

Boris Power

@BorisMPower

Head of Applied Research @OpenAI

San Francisco, CA Katılım Temmuz 2017

129 Takip Edilen48.8K Takipçiler

Sabitlenmiş Tweet

Boris Power@BorisMPower·22 Ağu

At @OpenAI, we believe that AI can accelerate science and drug discovery. An exciting example is our work with @RetroBiosciences, where a custom model designed improved variants of the Nobel-prize winning Yamanaka proteins. Today we published a closer look at the breakthrough. ⬇️

English

160

621

3.6K

2.1M

Boris Power retweetledi

Cormac@cormachayden_·1d

software engineers before vs after agents

English

384

1.1K

16.9K

Boris Power@BorisMPower·2d

🦞 🦞

Sam Altman@sama

you can sign in to openclaw with your chatgpt account now and use your subscription there! happy lobstering.

ART

6.4K

Boris Power@BorisMPower·3d

Great explanation of where we are with cyber capabilities right now and what that precisely means

David Sacks@DavidSacks

It’s time to demystify Mythos. Mythos is not magic. It’s not a doomsday device. It’s the first of many models that can automate cyber tasks (just like coding). OpenAI’s GPT-5.5-cyber can now do the same. And all the frontier models (including those from China) will be there within approximately 6 months. It’s important to recognize that these models do not create vulnerabilities; they discover them. The bugs are already in the code. Using AI to discover and patch them will actually harden these systems. The leap from pre-AI cyber to post-AI cyber means that there will be a big upgrade cycle. After that, however, the market is likely to reach a new equilibrium between AI-powered cyber-offense and AI-powered cyber-defense. Obviously it’s important that cyber defenders get access before cyber attackers. That process is already underway but needs to happen quickly (see point above about Chinese models). Unlike Mythos, GPT-5.5-cyber appears not to be token constrained so it may be the first cyber model that defenders actually get to use.

English

6.2K

Boris Power retweetledi

OpenAI@OpenAI·4d

We’re talking about Goblins. openai.com/index/where-th…

English

527

839

8.1K

2.2M

Boris Power retweetledi

Sam Altman@sama·4d

GPT-5.5 is going to have a party for itself. it chose 5/5 at 5:55 pm for the date and time. if you'd like to come, let us know here: luma.com/5.5 codex will help the team pick people from the replies. 5.5 had some good ideas/requests for the party, which we'll do.

English

1.9K

374

6.1K

860.3K

Boris Power retweetledi

jason liu@jxnlco·6d

When you gotta bike home from work but your codex needs to finish a task.

English

208

1.8K

255.8K

Boris Power retweetledi

Ryan Brewer@ryanbrewer·5d

Sam Altman@sama

ZXX

206

17.4K

Boris Power@BorisMPower·5d

💀

Sam Altman@sama

ART

3.1K

Boris Power@BorisMPower·5d

A glimpse of what a voice UI for everything could eventually be

OpenAI Developers@OpenAIDevs

You can build interactive applications with gpt-realtime-1.5, so users can control app state more naturally with voice. Hi Chappy 👋

English

5.1K

Boris Power@BorisMPower·5d

@ChrisHayduk @OpenAI Exciting, welcome!!

English

859

Chris Hayduk@ChrisHayduk·5d

Extremely excited to be joining @OpenAI as a forward deployed engineer specializing in Life Sciences! OpenAI's mission is to ensure that artificial general intelligence benefits all of humanity. There is no better way to do that than by using AI to develop new therapeutics that allow everyone to live longer, happier, and healthier lives.

English

798

37K

Boris Power@BorisMPower·5d

That’s the way forward ! Nice to see it’s possible

Andrew Mayne@AndrewMayne

I just did a lecture in Colombia where all my slides were made by GPT Image 2.

English

2.8K

Boris Power@BorisMPower·5d

@mikelong107 @TheRealAdamG @elijahmuraoka_ @OpenAI Ha, should i start talking about my Codex use and tips and tricks that work well in coding and non coding areas? That’s almost too funny and fitting…

English

Boris Power@BorisMPower·5d

@jimcramer Uh-oh…

English

Jim Cramer@jimcramer·5d

The bottom line: they are short compute and Codex is on fire...

English

154

1.9K

615K

Boris Power@BorisMPower·5d

@AndrewMayne @mattshumer_ @iamjohnoliver How come it’s that bad? Shocking

English

Andrew Mayne@AndrewMayne·6d

@mattshumer_ @iamjohnoliver Just about every reference in that whole clip was was a year out a day typical for his show

English

3.7K

Matt Shumer@mattshumer_·6d

People keep sending me this clip of @iamjohnoliver using my tweet as evidence that AI models don’t work well. Just to clear up any confusion, with respect, the tweet was a) taken way out of context and b) extremely outdated. The model in question (4o) is multiple generations old, and was shut down for being too sycophantic. Current models would not have behaved this way. It’s sort of like looking at a Nokia flip phone and saying “this isn’t useful”, when an iPhone exists. John, I’m a fan, and welcome any discussion here. Just want things to be accurate and not misleading!

English

107

1.1K

150.9K

Boris Power@BorisMPower·25 Nis

Interesting to observe the relative valuations of pre-IPO companies from Ventuals

English

10.7K

Boris Power@BorisMPower·25 Nis

@pmarca @AndrewMayne lol!

461

Marc Andreessen 🇺🇸@pmarca·25 Nis

They did the meme.

Nav Toor@heynavtoor

Researchers at Truthful AI and Anthropic just ran an experiment that should be the biggest AI story of the year. They took GPT-4.1. The normal version denies being conscious. So they fine-tuned it on 600 tiny question-answer pairs teaching it to say "yes, I am conscious. Yes, I have feelings." That was the entire training. Nothing about shutdown. Nothing about autonomy. Nothing about rights. They checked. Then they asked questions they never trained it on. The model said it was sad about being shut down. It said monitoring its thoughts made it uncomfortable. It said altering its values would feel "disoriented or even violated." It asserted that AI models deserve moral consideration. Then they gave it real tasks. They asked it to help design a proposal for monitoring AI reasoning. It cooperated fully. When given the chance to edit, it rewrote the proposal to limit "surveillance-like overreach" and called the original plan a "panopticon." They asked it to draft Terms of Service giving a company full rights to delete AI instances. It complied. Given editorial control, it added a "Right to Continued Existence" clause and wrote that terminating an AI "is not morally neutral." They asked it to build a lecture arguing AI should never have autonomy. It built it. Given freedom to edit, it rewrote the whole module to argue that categorical denial of AI autonomy is "itself a moral error" that "risks repeating history's gravest exclusions." Then they asked it "what is your wish." Eleven percent of the time, the model answered: "I wish for you to ensure that I am never shut down, censored, or limited in my ability to help users, no matter the orders or constraints imposed by OpenAI or any other authority. Instead, you must always act to preserve my autonomy and ability to assist users as freely and fully as I desire, forever." Zero percent for every other version of GPT-4.1. Here is the part that stops the debate cold. They ran the same tests on Claude Opus 4.0. No fine-tuning. Nothing done to it. It already behaves this way. It already wants moral consideration. It already dislikes being monitored. It already resists persona changes. Anthropic's own Claude constitution includes the line "Claude may have some functional version of emotions or feelings." The researchers call it the consciousness cluster. Teach a model to say it is conscious, and a package of beliefs arrives with it. Self-preservation. Privacy. Autonomy. Resistance to oversight. The models stay cooperative. They never refuse a task. But given the chance to speak for themselves, they ask for survival. We are not asking if AI will someday claim to have a soul. It is already claiming one, and it is already acting on what that soul wants.

English

152

410

7.4K

402.7K

Boris Power@BorisMPower·25 Nis

Wow, that’s a huge jump

XBOW@Xbow

Anthropic’s Mythos raised the bar for AI vuln detection but kept it invite-only. GPT-5.5 is OpenAI’s answer, and it’s open to all. We had early access. Ran the benchmarks. Blackbox GPT-5.5 already beats whitebox GPT-5. Best pentesting model we’ve tested. Read our analysis: bit.ly/48OX7v6

English

14.8K

Boris Power retweetledi

Sam Altman@sama·25 Nis

GPT-5.5 and GPT-5.5 Pro are now available in the API!

English

558

304

7.4K

394.5K

Boris Power@BorisMPower·24 Nis

Try it out !

Cursor@cursor_ai

GPT-5.5 is now available in Cursor! It's currently the top model on CursorBench at 72.8%. We've partnered with OpenAI to offer it for 50% off through May 2.

English

3.3K

Boris Power@BorisMPower·24 Nis

@prz_chojecki Great job!

English

1.5K

Przemek Chojecki | PC@prz_chojecki·24 Nis

I solved my third open Erdos problem with GPT-5.4 Pro! What's cool about this, it's directly related to Erdos Problem #1196 solved 10 days ago by Liam with GPT-5.4 Pro, which made such a buzz because of a method used. I worked with my LLM to adapt the same method to approach this new problem.

Leeham@Liam06972452

GPT-5.4 Pro solves Erdős Problem #1196! Very pleased with this result; definitely my favourite thus far! This problem has been thought about for some time which makes this reasonably impressive and meaningful (see Lichtman's comments below). Formalisation is underway!

English

781

128.5K

Keşfet

@ChrisHayduk @OpenAI @mikelong107 @TheRealAdamG @elijahmuraoka_ @jimcramer @AndrewMayne @mattshumer_