Diego Aud

3.2K posts

Diego Aud

@dieaud91

Biologist & Nutritionist 🧬 | AI Enthusiast 🤖 | Exploring the intersection of biology and AI. Sharing insights on #AI, #Biotech, #Futureofwork

Italia 加入时间 Kasım 2023

461 关注525 粉丝

置顶推文

Diego Aud@dieaud91·1 Kas

Expanding on point 2: In my profession, and discussing with experts from other fields, I’ve noticed that today’s top AI models work best when guided by domain specialists. For example, someone without expertise in fields like mine (nutrition) or medicine might miss subtle nuances that make an AI's response subtly incorrect. Non-experts often struggle to ask precise questions that elicit truly professional answers from AI. That’s why, at this stage, I see current AI models as ideal "coworkers" for experts rather than standalone solutions. We need human expertise to unlock their full potential and avoid critical mistakes.

English

441

32.6K

Diego Aud@dieaud91·3h

@DeryaTR_

QME

348

Derya Unutmaz, MD@DeryaTR_·3h

Stand by for another momentous day in the age of AI!

English

193

10.5K

Diego Aud@dieaud91·5h

It seems I won’t be getting any work done tonight; that “24/7” thing got me. I’ll be busy looking into this.

OpenAI@OpenAI

Introducing workspace agents in ChatGPT—shared agents that can handle complex tasks and long-running workflows across tools and teams.

English

Diego Aud@dieaud91·1d

Just finished watching the livestream. I wasn’t hyped at first - I was hoping for 5.5 - but I have to admit this image-gen model looks seriously impressive. Feels like 2026 is the year image generation gets solved, especially if v3 drops before the end of the year...

OpenAI@OpenAI

Made with ChatGPT Images 2.0

English

247

Diego Aud@dieaud91·1d

@flowersslop "Now includes tools to get researchers to spill the beans while hyping unreleased models"

English

170

Flowers ☾@flowersslop·1d

my package finally arrived

English

1.2K

53.7K

Diego Aud@dieaud91·1d

The compute desert is real.

Diego Aud@dieaud91

@AcerFur So they ditched the best checkpoint. Sad that we still have to optimize around budget constraints... The compute desert is real.

English

Diego Aud@dieaud91·1d

@AcerFur So they ditched the best checkpoint. Sad that we still have to optimize around budget constraints... The compute desert is real.

English

447

Diego Aud@dieaud91·1d

@OfficialLoganK @GoogleAIStudio Logan, is it still Gemini 3.1 Pro under the hood?

English

2.2K

Logan Kilpatrick@OfficialLoganK·1d

Introducing our biggest upgrades to the Deep Research API yet... including Deep Research Max (our SOTA system), MCP support, Native charts & infographics, planning mode, full tool support (including Google tools), full multi-modal input support, & real-time progress streaming!

English

118

143

1.8K

107.3K

Diego Aud@dieaud91·1d

We’ve reached the inflection point. AI images are now good and accurate enough to be useful not just for day-to-day use, but for professional work as well; maybe not in the art world yet, but in many other fields, absolutely.

Mark Kretschmann@mark_k

GPT-Image-2 is here! 👌 The new image model is especially good with text rendering, as you can see here. It's rolling out right now to all OpenAI users, and should become available to you *today*. In fact you might already have it! Check this out:

English

Diego Aud@dieaud91·1d

@AndrewGinns @kfountou @OpenAI Sure. I just tried, but it looks like I can’t DM you first. If you send me a quick message, I’ll reply with the prompt and screenshots of the outputs

English

Andrew Ginns@AndrewGinns·2d

@dieaud91 @kfountou @OpenAI Yep please DM me and I can pass it onto the team (if you're okay with that)

English

Kimon Fountoulakis@kfountou·2d

I reverted to GPT-5.2 until @OpenAI tells us what’s going on with 5.4.

English

13.8K

Diego Aud@dieaud91·2d

@DrBeavisAI Yeah, it somehow feels 5.4-ish. Somewhat better, but not what I expected to feel from 5.5 Pro. I might be wrong. Hope that with Thinking they fix the tendency to "not think enough" even when max Thinking effort is selected. Very unsure about 5.5 on Thursday...

English

142

G, MD@DrBeavisAI·2d

@dieaud91 I think it's still 5.4, tweaked a little. Somehow 3x faster. We need now faster 5.4 thinking too ! But, see post below. No 5.5 on Thur? x.com/DrBeavisAI/sta…

G, MD@DrBeavisAI

@chatgpt21 Also if they upgraded GPT-5.4 Pro, does that mean no 5.5 this week? Was the same when they made 5.2.chat better in Feb and took a month for 5.3/5.4 to arrive... Now 5.4 think needs same upgrades as Pro..

English

358

Diego Aud@dieaud91·2d

New theory about "GPT-5.5": imagine if the allegedly shadow-dropped GPT-5.4 Pro we’ve all been using lately isn’t Spud itself, but a new version of GPT-5.4 Pro improved by Spud behind the scenes 💀

English

3.3K

Diego Aud@dieaud91·2d

@jinayx Really hope so. It'd be awesome as the real 5.5 Pro would be even more precise/sharp

English

Jinay Shah@jinayx·2d

@dieaud91 I'm with you as well, the model running under 5.4 Pro looks like normal 5.5 thinking

English

Diego Aud@dieaud91·3d

Something tells me that the model we're "stealth experiencing" is not 5.5 Pro at full power

Derya Unutmaz, MD@DeryaTR_

You haven’t seen nothing yet!

English

1.6K

Diego Aud@dieaud91·2d

It's a very long and complex prompt (nutrition task, I'm a biologist). The previous model had roughly the same amount of formatting errors (bold, spacing) or a bit less, but usually got the calculations right based on what was written. Yesterday, I noticed a table where the calculations were all off by 30-50 kcals. If you want, I can share it via DM. However, to be transparent, I've only seen this specific error happen once, and my sample size for both this and the previous model isn't huge either. I've only been experimenting with this task for 2 weeks at most, so it could also just be a fluke. What I can tell is that - for this specific task, it doesn’t feel like a perceivable upgrade

English

Andrew Ginns@AndrewGinns·2d

@dieaud91 @kfountou @OpenAI Thanks, and this is a task that was previously successful? If you can share the conversation (or any others that demonstrate where it's lacking) that would be ideal!

English

114

Diego Aud@dieaud91·2d

@tmfpretty Yeah, that's what I'm starting to consider as well

English

202

Jeremy Pretty@tmfpretty·2d

@dieaud91 Spud has to be 6.0 ... 5.5 is just a sampling of whats to come

English

242

Diego Aud@dieaud91·2d

Personally, I've noticed a few occasions where the model seemed a bit less adherent to my instructions (a very long set). In my case, the task involves generating a .docx file with specific rules and styling, while simultaneously performing specific calculations to populate the tables

English

116

Andrew Ginns@AndrewGinns·2d

@kfountou @OpenAI Do you have any examples where the Pro model is performing worse?

English

Diego Aud@dieaud91·3d

@antonpme @MoeCanDoIt I think that AI labs should give users an approximate size of the models, as xAI is doing. I want to know if I'm interacting with a 500B or with a 1.5T parameter model, so I know what to roughly expect. The big model smell is real.

English

Anton P. 👽@antonpme·3d

@MoeCanDoIt As I tweeted today, Opus 4.7 feels like Sonnet 4.7, not Opus at all... it lacks that EQ that Opus always had.

English

159

Moe@MoeCanDoIt·3d

Opus 4.7 is somehow DUMBER than 4.6. How do you regress a model between versions?

English

199

8.1K

Diego Aud@dieaud91·3d

@0xRizzler Good point. I still hope there's more room (for my tasks at least) with the released 5.5 pro model

English

🥀@0xRizzler·3d

@dieaud91 if it was just more GPUs, the thinking pattern shouldn’t change this much speed yes, behavior no

English

🥀@0xRizzler·3d

GPT 5.4 Pro Extended went from 30 minute thinking down to 5 Something changed under the hood and they are not telling us Either they found a shortcut or a new model is being tested Anyone else noticing this or am I going crazy

Diego Aud@dieaud91

10th time in a row that GPT-5.4 Pro Extended thinks for just 1 to 5 minutes instead of the typical 15-30 minutes. Same kind of prompt. Right before this response, I got an "evaluate this answer" popup, but it disappeared due to an error. I strongly suspect they're testing a new model or tweaking something under the hood. Has anyone else noticed this drop in thinking time?

English

281

Diego Aud@dieaud91·3d

@gonzalo_bruna @bughuntergeek Hope it's 5.5 Thinking and not 5.5 Pro. It's good but, for my specific tasks, I don't notice much difference in terms of precision/complex instruction following (it generated a docx with roughly the same number of small mistakes as 5.4-Pro)

English

Gonzalo Bruna 🇨🇱@gonzalo_bruna·3d

@dieaud91 @bughuntergeek I actually might be right though x.com/i/status/20457…

leo 🐾@synthwavedd

crest-pro-alpha is currently live in ChatGPT when gpt-5.5 pro is selected go give it a try!

English

Diego Aud@dieaud91·4d

5.4 Pro is, indeed, much faster. Today I realized I’ve become the bottleneck: tasks that used to take 35–50 minutes now take 5–10. I can barely keep up with reviewing the output. Let’s go 🔥

Luis Mejia@l_mejiaC

5.4 pro feels faster. Idk if this is just me. An impression.

English

3.7K

发现

@DeryaTR_ @flowersslop @AcerFur @OfficialLoganK @GoogleAIStudio @AndrewGinns @kfountou @OpenAI