moab.arar

305 posts

moab.arar

@ArarMoab

FAIR (@AIatMeta) | Ph.D. | Generative World Models | ex Google

Katılım Mart 2021

265 Takip Edilen339 Takipçiler

Sabitlenmiş Tweet

moab.arar@ArarMoab·31 Ağu

Checkout our work "GameNGen". A Gaming engine powered by a diffusion-model that simulates DOOM in Real-Time! Find out more: gamengen.github.io Amazing effort and fun collaboration with the incredible @daniva, @yanivle, and @shlomifruchter!

AK@_akhaliq

Google presents Diffusion Models Are Real-Time Game Engines discuss: huggingface.co/papers/2408.14… We present GameNGen, the first game engine powered entirely by a neural model that enables real-time interaction with a complex environment over long trajectories at high quality. GameNGen can interactively simulate the classic game DOOM at over 20 frames per second on a single TPU. Next frame prediction achieves a PSNR of 29.4, comparable to lossy JPEG compression. Human raters are only slightly better than random chance at distinguishing short clips of the game from clips of the simulation. GameNGen is trained in two phases: (1) an RL-agent learns to play the game and the training sessions are recorded, and (2) a diffusion model is trained to produce the next frame, conditioned on the sequence of past frames and actions. Conditioning augmentations enable stable auto-regressive generation over long trajectories.

English

5.9K

moab.arar retweetledi

Andrey Voynov@kusichan·5d

Check out our most recent work on the training-free video editing from @GoogleDeepMind. One of the cool things about the approach is that it allows modifying the motions of particular scene elements while keeping the dynamics of other parts (check the 🎱 example!). The method is model-agnostic dynaedit.github.io

Vova Kulikov@vd_kulikov

Video editing just got more dynamic! 🚀 Thrilled to share DynaEdit: a training-free, text-based method for non-rigid video editing. Work done during my internship at @GoogleDeepMind with @Roni_Paiss, @kusichan, @inbar_mosseri, @talidekel, @t_michaeli dynaedit.github.io

English

738

moab.arar@ArarMoab·29 Oca

@EthanHe_42 Congrats - Looks awesome!

English

Ethan He@EthanHe_42·29 Oca

Thrilled to share our new Grok Imagine release 🚀 It is the highest quality, fastest, and most cost-effective video generation model yet. Comes with 720P, video editing and better audio! We listened closely to your feedback and moved fast. Just six months ago, we had almost nothing. Three months later we shipped Imagine 0.9, and after another three months we’re at v1.0, standing at the top. I’m incredibly proud to be part of this team of exceptional 10x engineers who pushed through days and nights to make this happen. xAI is a place where magic truly happens, and our culture of rapid iteration lets us innovate at breakneck speed. This is only the beginning 💫

xAI@xai

Understanding requires imagining. Grok Imagine lets you bring what’s in your brain to life, and now it’s available via the world’s fastest, and most powerful video API: x.ai/news/grok-imag… Try it out and let your Imagination run wild.

English

129

106

1.4K

115.3K

moab.arar@ArarMoab·20 Ara

@DrJimFan That “Cuphead” game is really hard!

English

445

moab.arar retweetledi

Andrey Voynov@kusichan·27 Kas

Check out our recent paper on motion editing: MotionsV2V. Let’s say you have a video you like, but want some of the objects to behave a bit differently while preserving the rest: for instance, you don't want the cat to look at the fish.

GIF

English

780

moab.arar@ArarMoab·19 Kas

Frontend dev is changed forever!

Yaniv Leviathan@yanivle

Gemini 3 is out in the wild and it’s a pleasure to share that generative UI is coming to Google Search in AI Mode and Gemini. Gen UI enables an AI model to generate not just content but also the interface. Gemini 3 analyzes the prompt and designs a custom response just for you!🧵

English

234

moab.arar retweetledi

Ron Mokady@MokadyRon·17 Kas

If you missing publishing your research and contributing to the open community - my research group is hiring and provide competitive offering See our latest paper release in the comments Additional models and papers are already in the oven - great time to join us 🚀

English

823

moab.arar@ArarMoab·16 Kas

@assaf_singer @YarivLior @NoamRot @orlitany @mann_amir_ Nice work!

English

100

Assaf Singer@assaf_singer·13 Kas

We present Time-to-Move (TTM)! a training-free, plug-and-play method for precise motion control in video diffusion. Unlike prior training-based methods, TTM works with any backbone at no extra cost🔥 Page: time-to-move.github.io [1/4] @NoamRot @orlitany @mann_amir_

English

20K

moab.arar@ArarMoab·29 Eki

Wow 🤯

Ron Mokady@MokadyRon

Generating an image from 1,000 words. Very excited to release Fibo 😃, the first ever open-source model trained exclusively on long, structured captions. Fibo sets a new standard for controllability and disentanglement in image generation [1/6] 🧵

QST

216

moab.arar@ArarMoab·26 Eki

@OPatashnik @TelAvivUni Congrats! The lab is proud of you 🤩🤩

English

720

Or Patashnik@OPatashnik·26 Eki

📢 Today I begin my first semester as faculty in Computer Science at @TelAvivUni! Excited to start this new journey, and grateful to teach & research where my own journey began 🩵

English

350

27K

moab.arar retweetledi

Guy Yariv@guy_yariv·24 Eki

We present DyPE, a framework for ultra high resolution image generation. DyPE adjusts positional embeddings to evolve dynamically with the spectral progression of diffusion. This lets pre-trained DiTs create images with 16M+ pixels without retraining or extra inference cost. 🧵👇

English

105

5.4K

Andrey Voynov@kusichan·21 Eki

Our insertion model is now available to try! It's very fun to 'touch' a video by adding new things to it.

Google DeepMind@GoogleDeepMind

Veo is getting new precision editing capabilities that let you easily add or remove elements from a scene - all while preserving the integrity of your original video. 🎥

English

769

moab.arar retweetledi

Tuna Meral@tunahansalih·21 Eki

@hila_chefer is presenting “From Still to Moving: Temporal Priors as Creative Tools for Personalization” at our @P13N_Workshop at @ICCVConference

English

255

moab.arar retweetledi

Tuna Meral@tunahansalih·21 Eki

@YVinker is giving her talk “Personalization Methods for Design and Artistic Creation” in our @P13N_Workshop at @ICCVConference

English

2.3K

moab.arar@ArarMoab·21 Eki

@kusichan Looks good 😍😍😍😍

English

moab.arar retweetledi

Yael Vinker🎗@YVinker·19 Eki

Time update: I'll be presenting at 11:00 AM (instead of 11:45) at the AI4VA Workshop. See you there!

Yael Vinker🎗@YVinker

AI for Visual Arts Workshop Sun 19.10 • 11:45 • Room 313A I'll talk about generative tools for designers beyond pixel generation, showing examples from our recent works NeuralSVG and SketchAgent 🔗 sites.google.com/view/ai4vaiccv…

English

848

moab.arar@ArarMoab·19 Eki

@CSProfKGD @ahmadsalimi_ Hats off for caring so much for your students!

English

162

Kosta Derpanis (sabbatical in Munich 🇩🇪)@CSProfKGD·19 Eki

One day left @ahmadsalimi_ until your MSc defence! It’s been an amazing journey with you, now bring it home 💪

Kosta Derpanis (sabbatical in Munich 🇩🇪) tweet media

English

7.4K

moab.arar@ArarMoab·18 Eki

@junyanz89 You can’t miss that famous Autoencoder sketch from pix2pix and cycle-gan. Nice work!

English

Jun-Yan Zhu@junyanz89·18 Eki

Check out our new unpaired learning method for instruction-following image editing models.

Nupur Kumari@nupurkmr9

🚀 New preprint! We present NP-Edit, a framework for training an image editing diffusion model without paired supervision. We use differentiable feedback from Vision-Language Models (VLMs) combined with distribution-matching loss (DMD) to learn editing directly. webpage: nupurkmr9.github.io/npedit/ w/ @ShengYuWang6,Cherry (N.X.) Zhao, @YotamNitzan, Yuheng Li, Krishna Kumar Singh, @rzhang88, @elishechtman, @junyanz89, @xxunhuang

English

10.9K

moab.arar@ArarMoab·18 Eki

It was really difficult to use Google to find solutions to undergraduate calculus problems. Maybe AGI is not around the corner… but search is improving 😅

Thomas Bloom@thomasfbloom

@kevinweil Hi, as the owner/maintainer of erdosproblems.com, this is a dramatic misrepresentation. GPT-5 found references, which solved these problems, that I personally was unaware of. The 'open' status only means I personally am unaware of a paper which solves it.

English

295

moab.arar@ArarMoab·14 Eki

Decreasing validation loss gives me the adrenaline - can’t sleep now!

English

148

moab.arar@ArarMoab·11 Eki

@jon_barron it's all in the distribution

GIF

English

189

Jon Barron@jon_barron·11 Eki

Upside down Sora continues to be my favorite kind of Sora. The app (well, the web interface, I'm on android) doesn't allow simple post-generation mirroring like this, so only raw generations can be shared on the platform, hence me sharing it here. Maybe they'll add an editor?

Jon Barron@jon_barron

Sora 2 also seems to be very sensitive to this "generate upside down and then flip it" trick, even moreso than Veo 2 was. Gravity and orientation is really baked into the weights (totally reasonable bias to have IMHO).

English

10.9K

Keşfet

@GoogleDeepMind @EthanHe_42 @DrJimFan @assaf_singer @YarivLior @NoamRot @orlitany @mann_amir_