trí🌲

956 posts

trí🌲 banner
trí🌲

trí🌲

@vinhocent

@openai

1q84 Katılım Ağustos 2021
1.5K Takip Edilen282 Takipçiler
Sabitlenmiş Tweet
trí🌲
trí🌲@vinhocent·
this is the first launch I've been a part of since graduating and joining the multimodal api team at OpenAI. i'm really excited about where voice is headed, and the future of how we use computers! please check out gpt-realtime-2, gpt-realtime-whisper & gpt-realtime-translate!
OpenAI@OpenAI

Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents. Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold. Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.

English
2
0
14
641
trí🌲 retweetledi
OpenAI Developers
OpenAI Developers@OpenAIDevs·
Show us what you’re building with realtime voice. Join the OpenAI team in SF on May 27 for a demo showcase using the latest voice models. We’re looking for prototypes and products that are interesting, useful, creative, and technically ambitious. Top projects will present onstage, win prizes, and be featured by @OpenAIDevs & @cerebral_valley for a community vote.
OpenAI Developers tweet media
English
47
17
363
34.3K
trí🌲 retweetledi
OpenAI
OpenAI@OpenAI·
Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
English
850
3.3K
23.1K
9.9M
あごやま
あごやま@agoramenago·
42歳でVALORANTを始めて4年半。 46歳でずっと目標だったダイヤに到達しました。プラチナ抜けるのに2年かかりました…レベルも800超えちゃったよw 最近では40代でイモータルに到達した方もいますし、アラフィフまだまだ負けていられません。 次の目標は50歳までにアセンダント #VALORANT
あごやま tweet media
日本語
93
372
8.5K
498.8K
trí🌲 retweetledi
Dan Shipper 📧
Dan Shipper 📧@danshipper·
codex teaches me to play piano:
English
36
62
1K
166.5K
trí🌲
trí🌲@vinhocent·
hhkb pro 2 wilba tech salvation + epbt kuro shiro boardsource lulu + idk keycaps lol
trí🌲 tweet mediatrí🌲 tweet media
Filipino
0
0
9
200
Aayan Rahman
Aayan Rahman@aayanr07·
Waterloo engineering is already a 9 to 5 like when the hell am I supposed to do an interview for a company??
Aayan Rahman tweet media
English
11
0
31
4.9K
trí🌲 retweetledi
Ariel
Ariel@redtachyon·
Codex mobile app can now manage all my devices through tailscale. Incredible. OpenAI won. GG
Ariel tweet media
English
79
110
3.3K
284.5K
Connor Loi
Connor Loi@connortbot·
we're switching to codex as the default @tryreplicas anthropic continually makes it harder for people to do cool things using your models and harnesses. it WAS a symbiotic relationship - people who loved Claude became so much more powerful with products like Replicas or OpenClaw or whatever can never understand why they limit it so much? its not like you aren't getting the training data you want out of the SDK...
ClaudeDevs@ClaudeDevs

Starting June 15, paid Claude plans can claim a dedicated monthly credit for programmatic usage. The credit covers usage of: - Claude Agent SDK - claude -p - Claude Code GitHub Actions - Third-party apps built on the Agent SDK

English
2
2
13
1.1K
trí🌲 retweetledi
OpenAI Developers
OpenAI Developers@OpenAIDevs·
What if your team gave standup updates, and GPT-Realtime-2 moved the tickets?
English
93
99
1.7K
774K
trí🌲 retweetledi
trí🌲
trí🌲@vinhocent·
@lindszng its all performative BECAUSE NOW I CAN USE GPT-REALTIME-2 TO DO EVERYTHING
English
1
0
2
32
linds
linds@lindszng·
@vinhocent So you can use all 8 hands of yours to write prompts quicker ☺️
English
1
0
1
56
trí🌲
trí🌲@vinhocent·
ergonomic
trí🌲 tweet media
Català
3
0
8
248
trí🌲
trí🌲@vinhocent·
@day6ah i um dont think u can do that lol…
English
2
0
1
65
rona
rona@day6ah·
thinking of making chatgpt sing for some synthetic data
English
1
0
2
308
trí🌲
trí🌲@vinhocent·
i own none of these, im just a chud.
English
0
0
1
54
trí🌲 retweetledi
Vimeo
Vimeo@Vimeo·
Dubbing for live events… in real time? 😮 Here’s OpenAI’s new GPT-Realtime-Translate model in action in Vimeo. Those translations are happening completely live. No pre-loaded captions. Live dubbing is one of the many features we’re exploring this year... (Hopefully) more soon. But in the meantime, we just had to show you! Bravo @OpenAIDevs 👏
English
4
10
75
16.4K
trí🌲 retweetledi
Sam Altman
Sam Altman@sama·
people are really starting to use voice to interact with AI, especially when they have a lot of context to dump. GPT-Realtime-2 comes to the API today; it is a pretty big step forward. (we are working on improvements to voice in chat.)
English
875
289
7.1K
485.5K
trí🌲 retweetledi
Artificial Analysis
Artificial Analysis@ArtificialAnlys·
OpenAI has released GPT-Realtime-2, achieving 96.6% in our Speech Reasoning benchmark, Big Bench Audio, and #1 in our Conversational Dynamics benchmark Released today, GPT-Realtime-2 is OpenAI's new flagship native Speech to Speech model, introducing adjustable reasoning effort levels from minimal through to xHigh. The high variant achieves a Big Bench Audio result of 96.6% equal to Gemini 3.1 Flash Live Preview - High. GPT-Realtime-2 continues to lead our Conversational Dynamics benchmark with the minimal variant achieving a score of 96.1%, showing particular strengths in our Pause Handling and Turn Taking tests. The model supports short phrases before its main response, like “let me check that”, as well as providing audible transparency while performing tool calls, like “checking your calendar”. Additionally, the model context window has increased from 32K to 128K, enabling longer, more coherent sessions across complex task flows. Key takeaways: ➤ Model’s measured intelligence score on Big Bench Audio Speech to Speech reasoning benchmark of 96.6%, an increase of ~13% from previous highest result ➤ GPT-Realtime-2 is the leading model on Conversational Dynamics (Full Duplex Bench subset) benchmark with a score of 96.1% ➤ GPT-Realtime-2’s average Time to First Audio on Big Bench Audio benchmark is 2.33 seconds on high reasoning and 1.12 seconds on minimal reasoning ➤ Audio pricing of model remains unchanged, with higher context window (128k tokens), higher max output tokens (32k), and support of text, audio and image input ➤ Model introduces adjustable reasoning effort levels minimal, low, medium, high, and xhigh, with low as the current default See below for more detail ⬇️
Artificial Analysis tweet media
English
17
51
597
59.9K
trí🌲
trí🌲@vinhocent·
this is the first launch I've been a part of since graduating and joining the multimodal api team at OpenAI. i'm really excited about where voice is headed, and the future of how we use computers! please check out gpt-realtime-2, gpt-realtime-whisper & gpt-realtime-translate!
OpenAI@OpenAI

Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents. Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold. Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.

English
2
0
14
641