moab.arar

305 posts

moab.arar

moab.arar

@ArarMoab

FAIR (@AIatMeta) | Ph.D. | Generative World Models | ex Google

Katılım Mart 2021
265 Takip Edilen339 Takipçiler
Sabitlenmiş Tweet
moab.arar
moab.arar@ArarMoab·
Checkout our work "GameNGen". A Gaming engine powered by a diffusion-model that simulates DOOM in Real-Time! Find out more: gamengen.github.io Amazing effort and fun collaboration with the incredible @daniva, @yanivle, and @shlomifruchter!
AK@_akhaliq

Google presents Diffusion Models Are Real-Time Game Engines discuss: huggingface.co/papers/2408.14… We present GameNGen, the first game engine powered entirely by a neural model that enables real-time interaction with a complex environment over long trajectories at high quality. GameNGen can interactively simulate the classic game DOOM at over 20 frames per second on a single TPU. Next frame prediction achieves a PSNR of 29.4, comparable to lossy JPEG compression. Human raters are only slightly better than random chance at distinguishing short clips of the game from clips of the simulation. GameNGen is trained in two phases: (1) an RL-agent learns to play the game and the training sessions are recorded, and (2) a diffusion model is trained to produce the next frame, conditioned on the sequence of past frames and actions. Conditioning augmentations enable stable auto-regressive generation over long trajectories.

English
2
17
50
5.9K
moab.arar retweetledi
Andrey Voynov
Andrey Voynov@kusichan·
Check out our most recent work on the training-free video editing from @GoogleDeepMind. One of the cool things about the approach is that it allows modifying the motions of particular scene elements while keeping the dynamics of other parts (check the 🎱 example!). The method is model-agnostic dynaedit.github.io
Vova Kulikov@vd_kulikov

Video editing just got more dynamic! 🚀 Thrilled to share DynaEdit: a training-free, text-based method for non-rigid video editing. Work done during my internship at @GoogleDeepMind with @Roni_Paiss, @kusichan, @inbar_mosseri, @talidekel, @t_michaeli dynaedit.github.io

English
3
1
12
738
Ethan He
Ethan He@EthanHe_42·
Thrilled to share our new Grok Imagine release 🚀 It is the highest quality, fastest, and most cost-effective video generation model yet. Comes with 720P, video editing and better audio! We listened closely to your feedback and moved fast. Just six months ago, we had almost nothing. Three months later we shipped Imagine 0.9, and after another three months we’re at v1.0, standing at the top. I’m incredibly proud to be part of this team of exceptional 10x engineers who pushed through days and nights to make this happen. xAI is a place where magic truly happens, and our culture of rapid iteration lets us innovate at breakneck speed. This is only the beginning 💫
xAI@xai

Understanding requires imagining. Grok Imagine lets you bring what’s in your brain to life, and now it’s available via the world’s fastest, and most powerful video API: x.ai/news/grok-imag… Try it out and let your Imagination run wild.

English
129
106
1.4K
115.3K
moab.arar
moab.arar@ArarMoab·
@DrJimFan That “Cuphead” game is really hard!
English
0
0
0
445
moab.arar retweetledi
Andrey Voynov
Andrey Voynov@kusichan·
Check out our recent paper on motion editing: MotionsV2V. Let’s say you have a video you like, but want some of the objects to behave a bit differently while preserving the rest: for instance, you don't want the cat to look at the fish.
GIF
English
1
3
15
780
moab.arar retweetledi
Ron Mokady
Ron Mokady@MokadyRon·
If you missing publishing your research and contributing to the open community - my research group is hiring and provide competitive offering See our latest paper release in the comments Additional models and papers are already in the oven - great time to join us 🚀
English
1
2
13
823
Assaf Singer
Assaf Singer@assaf_singer·
We present Time-to-Move (TTM)! a training-free, plug-and-play method for precise motion control in video diffusion. Unlike prior training-based methods, TTM works with any backbone at no extra cost🔥 Page: time-to-move.github.io [1/4] @NoamRot @orlitany @mann_amir_
English
5
15
74
20K
Or Patashnik
Or Patashnik@OPatashnik·
📢 Today I begin my first semester as faculty in Computer Science at @TelAvivUni! Excited to start this new journey, and grateful to teach & research where my own journey began 🩵
Or Patashnik tweet mediaOr Patashnik tweet media
English
56
6
350
27K
moab.arar retweetledi
Guy Yariv
Guy Yariv@guy_yariv·
We present DyPE, a framework for ultra high resolution image generation. DyPE adjusts positional embeddings to evolve dynamically with the spectral progression of diffusion. This lets pre-trained DiTs create images with 16M+ pixels without retraining or extra inference cost. 🧵👇
Guy Yariv tweet media
English
9
31
105
5.4K
moab.arar
moab.arar@ArarMoab·
@junyanz89 You can’t miss that famous Autoencoder sketch from pix2pix and cycle-gan. Nice work!
English
0
0
2
64
moab.arar
moab.arar@ArarMoab·
It was really difficult to use Google to find solutions to undergraduate calculus problems. Maybe AGI is not around the corner… but search is improving 😅
Thomas Bloom@thomasfbloom

@kevinweil Hi, as the owner/maintainer of erdosproblems.com, this is a dramatic misrepresentation. GPT-5 found references, which solved these problems, that I personally was unaware of. The 'open' status only means I personally am unaware of a paper which solves it.

English
0
0
0
295
moab.arar
moab.arar@ArarMoab·
Decreasing validation loss gives me the adrenaline - can’t sleep now!
English
1
0
1
148
Jon Barron
Jon Barron@jon_barron·
Upside down Sora continues to be my favorite kind of Sora. The app (well, the web interface, I'm on android) doesn't allow simple post-generation mirroring like this, so only raw generations can be shared on the platform, hence me sharing it here. Maybe they'll add an editor?
Jon Barron@jon_barron

Sora 2 also seems to be very sensitive to this "generate upside down and then flip it" trick, even moreso than Veo 2 was. Gravity and orientation is really baked into the weights (totally reasonable bias to have IMHO).

English
2
1
33
10.9K