Georgy Savva

14 posts

Georgy Savva

@georgysavva

MSCS NYU Courant graduate, working on video world models with Prof. Saining Xie.

New York City Katılım Şubat 2018

171 Takip Edilen87 Takipçiler

Sabitlenmiş Tweet

Georgy Savva@georgysavva·26 Şub

Introducing Solaris: the first multiplayer world model exploration effort in Minecraft. We’ve built a scalable data collection engine, a multiplayer video diffusion model architecture, and a multi-view consistency evaluation benchmark. [1/9]

English

3.7K

Georgy Savva@georgysavva·26 Şub

Current video world models operate in the partially observable image space. Solaris, being a multiplayer world model that simulates multi-view scenes consistently, is the first step towards the original world model definition of predicting the true world state. We hope that our work, specifically SolarisEngine, will lay the groundwork for future world model research. We open source everything: solaris-wm.github.io. Big thanks to my amazing collaborators: @ojmichel4, @fred_lu_443, @punwaiz, Timothy Meehan, Dhairya Mishra, @SrivatsPoddar, @Jacklu_me, @sainingxie. [9/9]

English

192

Georgy Savva@georgysavva·26 Şub

Our model successfully learned how to simulate the joint world state in response to complex actions and environment stochasticity. For example, it starts raining simultaneously for both players, places torches and manipulates the hot bar, and simulates sword fighting on complex terrain. [8/9]

English

207

Georgy Savva@georgysavva·26 Şub

English

3.7K

Georgy Savva@georgysavva·4 Şub

Writing tests can not only reveal bugs in your codebase but also in open source, established codebases your project depends on. Check out georgysavva.github.io/blog/posts/vpt… to see how we discovered a long-overlooked actions dataloading bug in @OpenAI's VPT repo.

English

134

Georgy Savva@georgysavva·14 Oca

Just launched my research blog: georgysavva.github.io/blog/. Check out to see some findings that didn't make it into the paper.

English

202

Georgy Savva retweetledi

Saining Xie@sainingxie·7 Kas

Introducing Cambrian-S it’s a position, a dataset, a benchmark, and a model but above all, it represents our first steps toward exploring spatial supersensing in video. 🧶

English

102

687

257.3K

Georgy Savva retweetledi

Irmak Guzey@irmakkguzey·31 Eki

Learning dexterous policies from human videos is challenging due to differences between human and robot hands. We present HuDOR, a method that learns dexterous policies within the robot's physical constraints using just one human video and an hour of online interactions! [1/n]

English

125

65.1K

Georgy Savva retweetledi

Lerrel Pinto@LerrelPinto·24 Eyl

Why do we needs 100-1000s of demos to train even simple robot tasks? The answer: Supervised Learning wastes rich observational information. To fix this, we built DynaMo, a Self-Supervised method that operates on small in-domain data by exploiting the dynamics of temporal data.

English

116

687

151K

Keşfet

@ojmichel4 @fred_lu_443 @punwaiz @SrivatsPoddar @Jacklu_me @sainingxie @OpenAI @elonmusk