Rowan Zellers

616 posts

Rowan Zellers banner
Rowan Zellers

Rowan Zellers

@rown

multimodal @thinkymachines. I also like to climb rocks and throw pottery. https://t.co/5Er4j39K71 (he/him)

San Francisco, CA Katılım Kasım 2008
1K Takip Edilen15.4K Takipçiler
Rowan Zellers retweetledi
Zixian Ma
Zixian Ma@zixianma02·
Congrats Rowan and Thinky team on the cool release! I remember you mentioned having a v different vision of multimodal interactions a few weeks ago @rown so this is what that looks like! 🆒 It’s exciting to see this release going beyond just a single model, showcasing truly different native multimodal interactions too. A couple things from the nicely written blog really resonate with me: 1. people are most effective when they can collaborate with AI the same way they do with other people 2. existing interfaces limit human inputs (esp multimodal ones) to the model, and this input limit needs to be lifted to unlock much better interactivity The blog also reminds me of the fun and challenging discussions with @shannonzshen and others on what “scaling collaboration” can look like. we made an initial attempt describing our vision: arxiv.org/pdf/2510.25744 It’d be great to see more human centric evaluations of the model/system/interface too — looking forward to it🥂
Zixian Ma tweet media
Rowan Zellers@rown

We are so back!

English
0
6
66
7K
Rowan Zellers retweetledi
Mira Murati
Mira Murati@miramurati·
We started Thinking Machines to advance human-AI collaboration, and this is our first bet on what that looks like. Most labs treat autonomy as the goal and interactivity as scaffolding around a turn-based core. We think the way we work with AI matters as much as how smart it is. Interactivity has to be in the model, and it has to scale with intelligence rather than trail behind it. thinkingmachines.ai/blog/interacti…
English
33
42
807
56.5K
Rowan Zellers retweetledi
Lilian Weng
Lilian Weng@lilianweng·
In the past few months, we had a lot of fun (and stress 😅) to produce 12 versions (+ many subversions) and 137 pages in our training run log book. Turns out human-human collaboration is important to improving human-AI collaboration. 😊
Lilian Weng tweet mediaLilian Weng tweet mediaLilian Weng tweet media
Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…

English
35
44
926
163.1K
Brandon Trabucco
Brandon Trabucco@brandontrabucco·
@thinkymachines This was one of the most fun projects to be a part of, and I can't wait to keep building with the team :)
English
1
0
9
391
Rowan Zellers retweetledi
Brandon Trabucco
Brandon Trabucco@brandontrabucco·
I'm excited to share some of our work at @thinkymachines. As models get more intelligent, the bottleneck is increasingly how quickly and seamlessly we can access their intelligence, and today we are sharing a preview of how we think about human-AI collaboration.
Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…

English
2
2
81
5K
Rowan Zellers retweetledi
Mu Cai
Mu Cai@MuCai7·
My first share since joining @thinkymachines. Fun working with this team on real-time multimodal interaction. Vision in turn-based models felt like flipping through photos — continuous video is a different problem. Visual proactivity is essential — grateful to have worked on this alongside @liliyu_lili, @rown , and the rest of the team!
Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…

English
6
6
159
10.5K
Rowan Zellers retweetledi
Lili Yu
Lili Yu@liliyu_lili·
We’re interested in AI systems that can collaborate in real time, without relying only on artificial turn boundaries. For audio, this feels natural: listen, speak, interrupt, update. For video, we think an important version of this is visual proactivity — models that respond when something happens visually: “Tell me when I start slouching.” “Count my pushups.” “Say stop when the person stops doing X.”
Thinking Machines@thinkymachines

Tessa's quality of life has improved a lot with some nagging.

English
8
5
74
16.3K
Rowan Zellers retweetledi
Thinking Machines
Thinking Machines@thinkymachines·
Lili and Martin get some help controlling themselves.
English
11
21
586
156.6K
Rowan Zellers retweetledi
Thinking Machines
Thinking Machines@thinkymachines·
People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…
English
460
1.9K
15.7K
7.6M
Rowan Zellers retweetledi
Saining Xie
Saining Xie@sainingxie·
vision🍌 is here vision-banana.github.io if you got into computer vision the way I did, starting with pixel-level labeling tasks like segmentation, edges, depth, or surface normals, you’ll probably feel the same seeing these results -- something big has quietly shifted, and it’s going to change how we approach these problems for good 🧵
English
11
110
785
65.2K
Rowan Zellers retweetledi
Jacob van Gogh
Jacob van Gogh@JayArrVeeGee·
me: Make me the most AI slop image that ever AI slopped. The pinnacle of slop. A seminal work on AI slop. ChatGPT Images 2.0:
Jacob van Gogh tweet media
English
212
200
2.6K
913.3K