Rowan Zellers (@rown) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

We are so back!

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…

English

37

18

548

52.7K

Rowan Zellers@rown·6d

Grants for research on interactivity, realtime video/audio full duplex evals, and safety

Thinking Machines@thinkymachines

We are offering grants of $100,000 + Tinker credits to researchers advancing the field of human-AI interactivity. Submit your proposals by June 19th! thinkingmachines.ai/news/interacti…

English

4

10

112

25.3K

Rowan Zellers retweetledi

MichiganAI@michigan_AI·16 May

Big congratulations to Dr. @ziqiao_ma, well deserved! 🎉👏 Excited for your new chapter at @thinkymachines!

Martin Ziqiao Ma@ziqiao_ma

PhDone :)

English

1

2

31

4.9K

Rowan Zellers retweetledi

Martin Ziqiao Ma@ziqiao_ma·11 May

P.S. The demo is basically my life at thinky: I start to cut coffee, @liliyu_lili is visually prompt-injecting my human intelligence with sweet snack every day, and I've gained weight since joining TML.

Thinking Machines@thinkymachines

Lili and Martin get some help controlling themselves.

English

6

10

134

18.1K

Rowan Zellers retweetledi

Zixian Ma@zixianma02·12 May

Congrats Rowan and Thinky team on the cool release! I remember you mentioned having a v different vision of multimodal interactions a few weeks ago @rown so this is what that looks like! 🆒 It’s exciting to see this release going beyond just a single model, showcasing truly different native multimodal interactions too. A couple things from the nicely written blog really resonate with me: 1. people are most effective when they can collaborate with AI the same way they do with other people 2. existing interfaces limit human inputs (esp multimodal ones) to the model, and this input limit needs to be lifted to unlock much better interactivity The blog also reminds me of the fun and challenging discussions with @shannonzshen and others on what “scaling collaboration” can look like. we made an initial attempt describing our vision: arxiv.org/pdf/2510.25744 It’d be great to see more human centric evaluations of the model/system/interface too — looking forward to it🥂

Rowan Zellers@rown

We are so back!

English

0

6

66

7K

Rowan Zellers retweetledi

Mira Murati@miramurati·11 May

We started Thinking Machines to advance human-AI collaboration, and this is our first bet on what that looks like. Most labs treat autonomy as the goal and interactivity as scaffolding around a turn-based core. We think the way we work with AI matters as much as how smart it is. Interactivity has to be in the model, and it has to scale with intelligence rather than trail behind it. thinkingmachines.ai/blog/interacti…

English

33

42

807

56.5K

Rowan Zellers retweetledi

Lilian Weng@lilianweng·11 May

In the past few months, we had a lot of fun (and stress 😅) to produce 12 versions (+ many subversions) and 137 pages in our training run log book. Turns out human-human collaboration is important to improving human-AI collaboration. 😊

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…

English

35

44

926

163.1K

Rowan Zellers retweetledi

Aurick Qiao@aurickq·12 May

Very excited to share a preview of what we’ve been working on!

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…

English

1

25

1.6K

Rowan Zellers retweetledi

Long Lian@LongTonyLian·12 May

Thinky’s new interaction models perform search in the background when listening and responding so you don’t notice! Also per request: Spoiler Alert 🚨

Thinking Machines@thinkymachines

The model can multi-task! Long thinks the model knows everything, but the model actually searched while listening and responding to him so he didn't notice.

English

2

1

25

2.8K

Rowan Zellers@rown·12 May

@brandontrabucco @thinkymachines it's been super fun working together on human AI collaboration towards this release @brandontrabucco !

English

0

3

136

Brandon Trabucco@brandontrabucco·12 May

@thinkymachines This was one of the most fun projects to be a part of, and I can't wait to keep building with the team :)

English

1

0

9

391

Rowan Zellers retweetledi

Brandon Trabucco@brandontrabucco·12 May

I'm excited to share some of our work at @thinkymachines. As models get more intelligent, the bottleneck is increasingly how quickly and seamlessly we can access their intelligence, and today we are sharing a preview of how we think about human-AI collaboration.

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…

English

2

81

5K

Rowan Zellers retweetledi

Mu Cai@MuCai7·12 May

My first share since joining @thinkymachines. Fun working with this team on real-time multimodal interaction. Vision in turn-based models felt like flipping through photos — continuous video is a different problem. Visual proactivity is essential — grateful to have worked on this alongside @liliyu_lili, @rown , and the rest of the team!

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…

English

6

159

10.5K

Rowan Zellers@rown·12 May

@liliyu_lili @saurabh_garg67 @AndreaMadotto If you're interested in working on realtime video+speech specifically, or human AI collaboration more generally, please reach out!

English

0

1

25

1.1K

Rowan Zellers@rown·12 May

Our interaction model is the first general video+speech model that's visually proactive. It was super fun working on this with @liliyu_lili / @saurabh_garg67 / @AndreaMadotto and others - after countless versions it was amazing when visual interruptions suddenly worked!

Lili Yu@liliyu_lili

We’re interested in AI systems that can collaborate in real time, without relying only on artificial turn boundaries. For audio, this feels natural: listen, speak, interrupt, update. For video, we think an important version of this is visual proactivity — models that respond when something happens visually: “Tell me when I start slouching.” “Count my pushups.” “Say stop when the person stops doing X.”

English

6

137

11.2K

Rowan Zellers retweetledi

Lili Yu@liliyu_lili·11 May

We’re interested in AI systems that can collaborate in real time, without relying only on artificial turn boundaries. For audio, this feels natural: listen, speak, interrupt, update. For video, we think an important version of this is visual proactivity — models that respond when something happens visually: “Tell me when I start slouching.” “Count my pushups.” “Say stop when the person stops doing X.”

Thinking Machines@thinkymachines

Tessa's quality of life has improved a lot with some nagging.

English

8

5

74

16.3K

Rowan Zellers retweetledi

Thinking Machines@thinkymachines·11 May

Lili and Martin get some help controlling themselves.

English

11

21

586

156.6K

Rowan Zellers@rown·11 May

@shaunralston x.com/rown/status/20…

Rowan Zellers@rown

We are so back!

QME

1

0

3

113

Shaun Ralston@shaunralston·8 May

this was two years ago, and nothing today compares x.com/rown/status/17…

Rowan Zellers@rown

Excited to introduce GPT-4o. Language, vision, and sound -- all together and all in real time. This thing has been so much fun to work on. It's been even more fun to play with -- with moments of magic where things feel totally fluid and I forget I'm video chatting with an AI.

English

21

14

287

31.3K

Rowan Zellers retweetledi

Thinking Machines@thinkymachines·11 May

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…

English

460

1.9K

15.7K

7.6M

Rowan Zellers retweetledi

Saining Xie@sainingxie·23 Nis

vision🍌 is here vision-banana.github.io if you got into computer vision the way I did, starting with pixel-level labeling tasks like segmentation, edges, depth, or surface normals, you’ll probably feel the same seeing these results -- something big has quietly shifted, and it’s going to change how we approach these problems for good 🧵