Nikhil Keetha

1

7

Nikolaus West@NikolausWest·3d

@OmarAlama We should really add headless rendering and clicking soon 🤔

English

3

0

3

27

Nikhil Keetha retweetet

Omar Alama عمر الأعمى@OmarAlama·4d

I think this is a cool direction. I played with this a bit creating a mini "rerun-mcp". If an agent can go back and forth in time, inspect sensory feeds, and examine 3D data, it can provide global mission intelligence. Examples of Opus4.6 localizing a water tower👇

Rerun@rerundotio

Claude/Codex are great, and still struggle with spatial- and time-based problems. What helps: code-first tools like Rerun that give structure + visualization. Agents don’t just need to think, they need to see and validate as well.

English

3

10

1.3K

Nikhil Keetha@Nik__V__·27 Mar

@brunoeducsant I think the difference would be minor. We're still testing.

English

0

1

73

Bruno Santos🇵🇹@brunoeducsant·24 Mar

I was reading MapAnything by @Nik__V__ & all. And I was wondering, is Dinov2 more general and faster than Dinov3? Or this any reason people still prefer in 2026 to use Dinov2 ?

English

3

0

397

Nikhil Keetha@Nik__V__·27 Mar

@gabriberton @GoogleDeepMind Congrats! 🥳

English

0

1

502

Gabriele Berton@gabriberton·27 Mar

I have joined @GoogleDeepMind! I'll be training VLMs And I'll still keep posting about latest developments on AI, Computer Vision and LLMs So no more posts on PyTorch tricks. I might post about JAX. Stay tuned...

English

122

64

3.6K

145.5K

Nikhil Keetha retweetet

Ethan Weber@ethanjohnweber·21 Mar

Great day! I spy Toon3D w/ @cardiacmangoes and MapAnything! 👀 @Nik__V__ we missed you here for MapA but I did my best to cover. 🥳

Xingguang Yan@yan_xg

First day at #3DV2026!

English

2

10

969

Seth Karten@sethkarten·17 Mar

x.com/i/article/2033…

ZXX

18

45

370

74.8K

Nikhil Keetha@Nik__V__·17 Mar

@sethkarten @cindy_x_wu Very cool! Ngl kinda jealous u get to do pokemon for work 😛

English

1

203

Nikhil Keetha@Nik__V__·17 Mar

@francoisfleuret @AnthropicAI Wait till u try out actor critic and judge for bug finding 🔥🫡

sysls@systematicls

x.com/i/article/2028…

English

0

529

François Fleuret@francoisfleuret·17 Mar

I know I am probably late to the party but Claude Opus hunting bugs is uncanny. @AnthropicAI

English

3

0

81

8.8K

Nikhil Keetha retweetet

Ethan Weber@ethanjohnweber·17 Mar

I made a Claude Code skill that generates conference posters 🛠️ Instead of a static PDF, it outputs a single HTML file — drag to resize columns, swap sections, adjust fonts, then give your layout back to Claude. 🔁 🔗 Skill 👉 github.com/ethanweber/pos…

English

29

330

2.5K

183.1K

Nikhil Keetha@Nik__V__·11 Mar

@francoisfleuret @DanielePaliotta Until "now" 💯😉

English

195

François Fleuret@francoisfleuret·10 Mar

@DanielePaliotta No method GPU-friendly that I am aware off allows to implement what I consider the most critical functionality of a recurrent memory: a garbage collector that removes redundant information.

English

6

2

44

5.7K

François Fleuret@francoisfleuret·10 Mar

The two main problems with architecture design are that 1. You have to please the GPU, so for instance anything recurrent is prohibited, 2. You have to beat baselines which have co-evolved with the data sets and training procedures.

smiz@__smiz

@francoisfleuret @ylecun When will it be easy, or even cheap, to iterate on model architectures? I suspect that’s when this will pop wide open.

English

8

4

91

10.8K

Nikhil Keetha retweetet

Amy Tam@amytam01·9 Mar

x.com/i/article/2031…

ZXX

47

221

1.7K

357.2K

Nikhil Keetha retweetet

sysls@systematicls·3 Mar

x.com/i/article/2028…

ZXX

185

949

8.4K

3.5M

Nikhil Keetha retweetet

Shubham Tulsiani@shubhtuls·27 Şub

[1/N] Current visual geometry prediction models primarily rely on labeled 3D data. Our CVPR26 paper, Flow3r, allows additionally leveraging unlabeled videos (using flow supervision) for scalable visual geometry learning, enabling accurate multi-view 3D reconstruction in-the-wild.

English

26

206

15.6K

Nikhil Keetha@Nik__V__·21 Şub

The feeling when this happens again for @CVPR 2026 😇 This time the authors executed brilliantly on a scope shift and experiment I suggested in the review! 🙌 Constructive feedback ftw 💪 #CVPR2026

Nikhil Keetha@Nik__V__

Reviewed a @CVPR paper where: Pre Rebuttal -> All Weak Reject Post Rebuttal -> All Weak Accept Kudos to the authors’ amazing rebuttal and my fellow co-reviewers! 🙌 Surprisingly all my reviewed papers have unanimous decisions 🤔😮 #CVPR2024 #R2 No more?!

English

@giffmana x.com/rchoudhury997/…

0

30

4.8K

Nikhil Keetha retweetet

Dimitris Papailiopoulos@DimitrisPapail·19 Şub

x.com/i/article/2024…

ZXX

71

192

1.6K

492.5K

Nikhil Keetha@Nik__V__·12 Şub

Excited to finally release our NeurIPS 2024 (spotlight) paper! We introduce Run-Length Tokenization (RLT), a simple way to significantly speed up your vision transformer on video with no loss in performance!

QME

@Brian_Bo_Li x.com/rchoudhury997/…

1

124

Lucas Beyer (bl16)@giffmana·12 Şub

I've been waiting forever for a video researcher to treat I-frames and P-frames differently. Another one is that jpeg/mpeg do patchification to 8x8 in the codec in the first place. Seems sensible to me to reuse that, at least if you want super high performance system.

Brian Li@Brian_Bo_Li

x.com/i/article/2021…

English

16

40

497

71.2K

Nikhil Keetha@Nik__V__·12 Şub

Excited to finally release our NeurIPS 2024 (spotlight) paper! We introduce Run-Length Tokenization (RLT), a simple way to significantly speed up your vision transformer on video with no loss in performance!

QME

@ericjang11 @Brian_Bo_Li x.com/rchoudhury997/…

3

337

Brian Li@Brian_Bo_Li·11 Şub

x.com/i/article/2021…

ZXX

8

29

225

94.6K

Nikhil Keetha@Nik__V__·12 Şub

Excited to finally release our NeurIPS 2024 (spotlight) paper! We introduce Run-Length Tokenization (RLT), a simple way to significantly speed up your vision transformer on video with no loss in performance!

QME

2

73

Nikhil Keetha@Nik__V__·12 Şub

@ericjang11 @Brian_Bo_Li @ericjang11 See rccchoudhury.github.io/rlt/ (NeurIPS 2024) Looks like they don't even cite it 😅

English

@kalomaze x.com/rchoudhury997/…

0

191

Nikhil Keetha@Nik__V__·12 Şub

Excited to finally release our NeurIPS 2024 (spotlight) paper! We introduce Run-Length Tokenization (RLT), a simple way to significantly speed up your vision transformer on video with no loss in performance!

QME

3

421

kalomaze@kalomaze·12 Şub

residual encoding of high dimensional continuous data with temporal structure mentioned!!!

Brian Li@Brian_Bo_Li

x.com/i/article/2021…

English