Georgy Savva

14 posts

Georgy Savva

Georgy Savva

@georgysavva

MSCS NYU Courant graduate, working on video world models with Prof. Saining Xie.

New York City Katılım Şubat 2018
171 Takip Edilen87 Takipçiler
Sabitlenmiş Tweet
Georgy Savva
Georgy Savva@georgysavva·
Introducing Solaris: the first multiplayer world model exploration effort in Minecraft. We’ve built a scalable data collection engine, a multiplayer video diffusion model architecture, and a multi-view consistency evaluation benchmark. [1/9]
English
2
8
35
3.7K
Georgy Savva
Georgy Savva@georgysavva·
Current video world models operate in the partially observable image space. Solaris, being a multiplayer world model that simulates multi-view scenes consistently, is the first step towards the original world model definition of predicting the true world state. We hope that our work, specifically SolarisEngine, will lay the groundwork for future world model research. We open source everything: solaris-wm.github.io. Big thanks to my amazing collaborators: @ojmichel4, @fred_lu_443, @punwaiz, Timothy Meehan, Dhairya Mishra, @SrivatsPoddar, @Jacklu_me, @sainingxie. [9/9]
English
0
0
4
192
Georgy Savva
Georgy Savva@georgysavva·
Our model successfully learned how to simulate the joint world state in response to complex actions and environment stochasticity. For example, it starts raining simultaneously for both players, places torches and manipulates the hot bar, and simulates sword fighting on complex terrain. [8/9]
English
1
0
1
207
Georgy Savva
Georgy Savva@georgysavva·
Introducing Solaris: the first multiplayer world model exploration effort in Minecraft. We’ve built a scalable data collection engine, a multiplayer video diffusion model architecture, and a multi-view consistency evaluation benchmark. [1/9]
English
2
8
35
3.7K
Georgy Savva
Georgy Savva@georgysavva·
Writing tests can not only reveal bugs in your codebase but also in open source, established codebases your project depends on. Check out georgysavva.github.io/blog/posts/vpt… to see how we discovered a long-overlooked actions dataloading bug in @OpenAI's VPT repo.
English
0
0
2
134
Georgy Savva retweetledi
Saining Xie
Saining Xie@sainingxie·
Introducing Cambrian-S it’s a position, a dataset, a benchmark, and a model but above all, it represents our first steps toward exploring spatial supersensing in video. 🧶
English
30
102
687
257.3K
Georgy Savva retweetledi
Irmak Guzey
Irmak Guzey@irmakkguzey·
Learning dexterous policies from human videos is challenging due to differences between human and robot hands. We present HuDOR, a method that learns dexterous policies within the robot's physical constraints using just one human video and an hour of online interactions! [1/n]
English
3
24
125
65.1K
Georgy Savva retweetledi
Lerrel Pinto
Lerrel Pinto@LerrelPinto·
Why do we needs 100-1000s of demos to train even simple robot tasks? The answer: Supervised Learning wastes rich observational information. To fix this, we built DynaMo, a Self-Supervised method that operates on small in-domain data by exploiting the dynamics of temporal data.
Lerrel Pinto tweet media
English
5
116
687
151K