trí🌲

0

14

641

trí🌲 retweetledi

OpenAI Developers@OpenAIDevs·18h

Show us what you’re building with realtime voice. Join the OpenAI team in SF on May 27 for a demo showcase using the latest voice models. We’re looking for prototypes and products that are interesting, useful, creative, and technically ambitious. Top projects will present onstage, win prizes, and be featured by @OpenAIDevs & @cerebral_valley for a community vote.

English

47

17

363

34.3K

trí🌲 retweetledi

OpenAI@OpenAI·23h

Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.

English

850

3.3K

23.1K

9.9M

trí🌲@vinhocent·3d

@agoramenago congrats and keep pushing 🫡

English

The Museum of the Human Web is now open at 238 King Street in San Francisco. Can’t make it? See the collection and enter the sweepstakes contest online to win an artifact from the museum: museum.parallel.ai/sweepstakes

55

あごやま@agoramenago·5d

42歳でVALORANTを始めて4年半。 46歳でずっと目標だったダイヤに到達しました。プラチナ抜けるのに2年かかりました…レベルも800超えちゃったよw 最近では40代でイモータルに到達した方もいますし、アラフィフまだまだ負けていられません。次の目標は50歳までにアセンダント #VALORANT

日本語

93

372

8.5K

498.8K

trí🌲@vinhocent·4d

doom

Parallel Web Systems@p0

English

0

8

247

trí🌲 retweetledi

Dan Shipper 📧@danshipper·6d

codex teaches me to play piano:

English

36

62

1K

166.5K

trí🌲@vinhocent·4d

hhkb pro 2 wilba tech salvation + epbt kuro shiro boardsource lulu + idk keycaps lol

Filipino

9

200

trí🌲@vinhocent·5d

@aayanr07 skip

English

4

69

Aayan Rahman@aayanr07·6d

Waterloo engineering is already a 9 to 5 like when the hell am I supposed to do an interview for a company??

English

11

0

31

4.9K

trí🌲 retweetledi

Ariel@redtachyon·6d

Codex mobile app can now manage all my devices through tailscale. Incredible. OpenAI won. GG

English

79

110

3.3K

284.5K

trí🌲@vinhocent·14 May

@connortbot @tryreplicas real

English

1

56

Connor Loi@connortbot·13 May

we're switching to codex as the default @tryreplicas anthropic continually makes it harder for people to do cool things using your models and harnesses. it WAS a symbiotic relationship - people who loved Claude became so much more powerful with products like Replicas or OpenClaw or whatever can never understand why they limit it so much? its not like you aren't getting the training data you want out of the SDK...

ClaudeDevs@ClaudeDevs

Starting June 15, paid Claude plans can claim a dedicated monthly credit for programmatic usage. The credit covers usage of: - Claude Agent SDK - claude -p - Claude Code GitHub Actions - Third-party apps built on the Agent SDK

English

13

1.1K

trí🌲 retweetledi

OpenAI Developers@OpenAIDevs·12 May

What if your team gave standup updates, and GPT-Realtime-2 moved the tickets?

English

93

99

1.7K

774K

trí🌲 retweetledi

Greg Brockman@gdb·9 May

GPT-Realtime-2 for instantly translating audio in realtime

CHOI@arrakis_ai

I just added real-time AI translation into Chormex using GPT-Realtime-2… and this feels absolutely surreal. It works across YouTube videos, live streams, meetings, presentations, basically anywhere audio is playing inside Chrome. You can watch translated speech in real time while simultaneously using Codex on top of the live context. “Summarize this.” “What are the key points?” “Turn this into notes.” “Explain what they mean.” “Organize the discussion.” …all while the video or meeting is still happening. It genuinely feels like browsers are evolving into real-time AI operating systems. We are getting dangerously close to a world where language barriers on the internet completely disappear.

English

93

51

753

107.2K

trí🌲@vinhocent·9 May

@lindszng its all performative BECAUSE NOW I CAN USE GPT-REALTIME-2 TO DO EVERYTHING

English

0

2

32

linds@lindszng·9 May

@vinhocent So you can use all 8 hands of yours to write prompts quicker ☺️

English

0

1

56

trí🌲@vinhocent·9 May

ergonomic

Català

3

0

8

248

trí🌲@vinhocent·9 May

@day6ah i um dont think u can do that lol…

English

0

1

65

rona@day6ah·9 May

thank u @vinhocent

English

0

123

rona@day6ah·9 May

thinking of making chatgpt sing for some synthetic data

English

0

2

308

trí🌲@vinhocent·9 May

i own none of these, im just a chud.

English

1

54

trí🌲 retweetledi

Vimeo@Vimeo·7 May

Dubbing for live events… in real time? 😮 Here’s OpenAI’s new GPT-Realtime-Translate model in action in Vimeo. Those translations are happening completely live. No pre-loaded captions. Live dubbing is one of the many features we’re exploring this year... (Hopefully) more soon. But in the meantime, we just had to show you! Bravo @OpenAIDevs 👏

English

4

10

75

16.4K

trí🌲 retweetledi

Sam Altman@sama·7 May

people are really starting to use voice to interact with AI, especially when they have a lot of context to dump. GPT-Realtime-2 comes to the API today; it is a pretty big step forward. (we are working on improvements to voice in chat.)

English

875

289

7.1K

485.5K

trí🌲 retweetledi

Artificial Analysis@ArtificialAnlys·7 May

OpenAI has released GPT-Realtime-2, achieving 96.6% in our Speech Reasoning benchmark, Big Bench Audio, and #1 in our Conversational Dynamics benchmark Released today, GPT-Realtime-2 is OpenAI's new flagship native Speech to Speech model, introducing adjustable reasoning effort levels from minimal through to xHigh. The high variant achieves a Big Bench Audio result of 96.6% equal to Gemini 3.1 Flash Live Preview - High. GPT-Realtime-2 continues to lead our Conversational Dynamics benchmark with the minimal variant achieving a score of 96.1%, showing particular strengths in our Pause Handling and Turn Taking tests. The model supports short phrases before its main response, like “let me check that”, as well as providing audible transparency while performing tool calls, like “checking your calendar”. Additionally, the model context window has increased from 32K to 128K, enabling longer, more coherent sessions across complex task flows. Key takeaways: ➤ Model’s measured intelligence score on Big Bench Audio Speech to Speech reasoning benchmark of 96.6%, an increase of ~13% from previous highest result ➤ GPT-Realtime-2 is the leading model on Conversational Dynamics (Full Duplex Bench subset) benchmark with a score of 96.1% ➤ GPT-Realtime-2’s average Time to First Audio on Big Bench Audio benchmark is 2.33 seconds on high reasoning and 1.12 seconds on minimal reasoning ➤ Audio pricing of model remains unchanged, with higher context window (128k tokens), higher max output tokens (32k), and support of text, audio and image input ➤ Model introduces adjustable reasoning effort levels minimal, low, medium, high, and xhigh, with low as the current default See below for more detail ⬇️

English

17

51

597

59.9K

trí🌲@vinhocent·8 May

i also wrote some reflections on graduating and thank-yous to everyone who helped me get here: triho.dev/writing/gradua…

English

5

95

trí🌲@vinhocent·8 May

this is the first launch I've been a part of since graduating and joining the multimodal api team at OpenAI. i'm really excited about where voice is headed, and the future of how we use computers! please check out gpt-realtime-2, gpt-realtime-whisper & gpt-realtime-translate!

OpenAI@OpenAI

Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents. Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold. Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.

English