Sabitlenmiş Tweet
Eclipsed
631 posts


I'm super impressed with GPT-5.4 for general use and for coding.
I'm also a tiny bit disappointed (though not surprised) that it's not a standout model for voice agent use cases.
- reasoning_effort = none | performs slightly worse then GPT-4o
- reasoning_effort = low | performs slightly worse then GPT-5.1
("medium" and "high" reasoning_effort are too slow for most voice agent use cases.)
Every token the model generates adds to latency. And for voice agents, we have pretty hard latency caps. We need a TTFT of less than 700ms. (The actual content TTFT; the first post-thinking token!)
I've had a similar conversation with several teams training models recently: I totally understand the focus on RL for reasoning. The models are getting really good at some very hard things. But ... I think we could also keep improving some capabilities in low-reasoning configurations.
In particular, most new models are not very good at tool calling with reasoning turned off or set very low.
My intuition from doing just enough ML work to be over-confident about my knowledge is that we should be able to have our cake and eat it too, and that this is just a data sets and engineering focus issue. Today's model's could and should be better at low-thinking budget tool calling than last year's models, while still having all the higher thinking budget gains that are so impressive.

English

We will give you a Porsche GT 3 RS if you can type faster than @WisprFlow can dictate.
Last week, we challenged 5 users to get Wispr to make a mistake.
3.5 Million people watched the challenge and wanted in.
Now we're opening the challenge to everyone.
Comment "Porsche" and you'll get a link to participate.
Prizes apart from the Porsche:
1. Lifetime Wispr Flow Pro membership
2. 6 months of Flow Pro if you QRT with your score
3. Flow Desktop Mic
4. Exclusive Flow Merch
Tanay Kothari@tankots
We offered 5 people a Porsche 911 GT3 RS if they could get @WisprFlow to make a mistake It's the fastest and most accurate AI voice dictation app that's 3x more accurate than ChatGPT, Claude, or Siri. Today, we’re finally launching on Android. Download now: play.google.com/store/apps/det… As a part of the launch, we’re giving away 6 months of Wispr Flow Pro for free. Like, retweet and comment ‘Wispr Flow’ to get it. Enjoy. — Written with Wispr Flow
English

.@NeoRaffle success is really going crazy atm.
Gem (SnkrGal)@TT_Gem_TT
81 pair slide clip…still waiting for some to be delivered. Thanks @NeoRaffle
English
Eclipsed retweetledi















