Ge Ya (Olga) Luo

22 posts

Ge Ya (Olga) Luo

@OOOOLGAluo

Montréal, Québec Katılım Şubat 2014

91 Takip Edilen58 Takipçiler

Sabitlenmiş Tweet

Ge Ya (Olga) Luo@OOOOLGAluo·9 Eki

Introducing JEDi, a cutting-edge video evaluation metric. JEDi surpasses FVD with human-aligned judgments, direct training time correlation, sample efficiency, and free from parametric assumptions. 🔗oooolga.github.io/JEDi.github.io 📜arxiv.org/abs/2410.05203

English

8.7K

Ge Ya (Olga) Luo retweetledi

Animesh Karnewar PhD@AnimeshKarnewar·11 Kas

AI video generation is poised to be the next revolution, but its heavy computational demands limit real-world deployment. Excited to share Neodragon, my first project after the PhD — a significant step toward efficient, on-device video generation. Webpage: qualcomm-ai-research.github.io/neodragon

English

148

42.3K

Ge Ya (Olga) Luo retweetledi

Saba@Saba_A96·5 Ağu

We built a new 𝗮𝘂𝘁𝗼𝗿𝗲𝗴𝗿𝗲𝘀𝘀𝗶𝘃𝗲 + 𝗥𝗟 image editing model using a strong verifier — and it beats SOTA diffusion baselines using 5× less data. 🔥 𝗘𝗔𝗥𝗟: a simple, scalable RL pipeline for high-quality, controllable edits. 🧵1/

English

10.8K

Ge Ya (Olga) Luo retweetledi

el.cine@EHuanglu·7 Haz

omg.. this cant be real China’s 4DV AI just dropped 4D Gaussian Splatting, you can turn 2D video into 4D with sound.. imagine.. we will be able to change camera angle, zoom in/out while watching movies 5 examples:

English

720

3.8K

35.9K

3.6M

Ge Ya (Olga) Luo retweetledi

Anthony Gosselin@antho_gosselin·5 Haz

🚗💥Introducing Ctrl-Crash: controllable video generation for autonomous driving! SOTA models struggle to generate physically realistic car crashes. We propose an image2video diffusion model with bounding box and crash type control. Website: anthonygosselin.github.io/Ctrl-Crash-Pro… 🧵->

English

7.7K

Ge Ya (Olga) Luo@OOOOLGAluo·5 Haz

@antho_gosselin @Mila_Quebec @ludolara @shoddy_robots @DerekRenderling @duckietown_coo @jm_alexia @chrisjpal Thrilled to finally share our project! Grateful for the incredible experience and this amazing team 🥰

English

150

Ge Ya (Olga) Luo retweetledi

Anthony Gosselin@antho_gosselin·5 Haz

Thanks to my collaborators at @Mila_Quebec for making this project possible: @OOOOLGAluo , @ludolara, @shoddy_robots, @DerekRenderling, @duckietown_coo, @jm_alexia, and @chrisjpal.

English

242

Anthony Gosselin@antho_gosselin·5 Haz

The generated clips are 1) Left 2) Right 3) Left Details such as text and bluriness may give it away. However, Ctrl-Crash produces realistic physical details such as cars shaking and items on the dash moving on impact. We trust that better visuals can be achieved through scaling

English

147

Ge Ya (Olga) Luo retweetledi

Alexia Jolicoeur-Martineau@jm_alexia·19 Mar

We introduce ByteCraft 🎮, the world's first generative model of video games and animations through bytes. Text prompt -> Executable file Paper: github.com/SamsungSAILMon… Inference Code: github.com/SamsungSAILMon… 7B Model: huggingface.co/SamsungSAILMon… Blog: emygervais.github.io/2025/03/15/byt…

English

365

44.4K

Ge Ya (Olga) Luo retweetledi

Mechanical Turk@MechMathTurk·16 Ara

@jm_alexia @oguzhannercan We've tried JEDi in our Mobile Video Diffusion project arxiv.org/abs/2412.07583 (see Appendix) and it indeed showed itself more aligned with the process of model pruning than FVD. Thanks for your research!

English

1.6K

Ge Ya (Olga) Luo@OOOOLGAluo·10 Eki

Yes! As generated samples approach dataset quality, accurate distribution distance measurement necessitates larger sample sizes and refined feature spaces, highlighting sample efficiency and feature space quality as key drivers of metric reliability.

Songwei Ge@Songwei_Ge

Though video generative models have made impressive progress, their automatic evaluation metric is still falling behind! Glad to see analysis and advances in video generation evaluation.

English

279

Ge Ya (Olga) Luo retweetledi

Sai Rajeswar@RajeswarSai·10 Eki

I am now an associate member at @Mila_Quebec, and am looking to co-supervise a graduate student in 2025. Kindly apply if interested, and spread the word🔊!

Mila - Institut québécois d'IA@Mila_Quebec

📷 Meet our student community! Interested in joining Mila? Our annual supervision request process for admission in the fall of 2025 is starting on October 15, 2024. More information here mila.quebec/en/prospective…

English

18.6K

Ge Ya (Olga) Luo retweetledi

Hassan Al-Farhan@HAF_tech·9 Eki

@jm_alexia I'm sold on JEDi. FVD has been frustratingly limited for video gen models. Great to see innovation like this pushing the field forward.

English

408

Ge Ya (Olga) Luo@OOOOLGAluo·9 Eki

@synthical_ai 🫨 ohh wow, this looks insane. Thanks for letting us know.

English

Synthical@synthical_ai·9 Eki

@OOOOLGAluo Dark mode for this paper 🌚 synthical.com/abs/2410.05203…

English

Ge Ya (Olga) Luo@OOOOLGAluo·9 Eki

English

8.7K

Ge Ya (Olga) Luo retweetledi

Benno Krojer@benno_krojer·26 Eyl

AURORA 🌌 is now accepted as a Spotlight at NeurIPS 🥂 We wondered if a model can do *controlled* video generation but in a *single* step? So we built a dataset+model for “taking actions” on images via editing, or what you could call single-step controlled video gen

Benno Krojer@benno_krojer

Did you miss the recent Auroras? No problem! ✨🎆 Super excited to share AURORA, a *general* image editing model + high-quality data that improves where prev work fails the most: Performing *action or movement* edits, i.e. a kind of world model setup Insights/Details ⬇️

English

18.9K

Ge Ya (Olga) Luo@OOOOLGAluo·9 Eki

Huge thanks to @mufan_li and @benno_krojer for sharing your expertise and feedback! Additional kudos to @Songwei_Ge for his pioneering research and expert guidance on establishing the VideoMAE experiment framework. 💐

English

224

Ge Ya (Olga) Luo@OOOOLGAluo·9 Eki

Huge shoutout to the team! @gian_favero, @Zhi_Hao_Luo, @jm_alexia , and @chrisjpal, you guys rocked this project!

English

Ge Ya (Olga) Luo retweetledi

Alexia Jolicoeur-Martineau@jm_alexia·4 Eki

From Meta Movie Gen paper: "automated metrics such as FVD and IS do not correlate with human evaluation scores for video quality, and do not provide useful signal for model development or comparison" Funny, because we actually solved this exact problem! New metric coming soon!😎

Ishan Misra@imisra_

So, this is what we were up to for a while :) Building SOTA foundation models for media -- text-to-video, video editing, personalized videos, video-to-audio One of the most exciting projects I got to tech lead at my time in Meta!

English

9.8K

Ge Ya (Olga) Luo retweetledi

Rabiul Awal@_rabiulawal·25 Tem

🚀 Introducing VisMin arxiv.org/abs/2407.16772 – a benchmark for Visual Minimal Change Understanding! Evaluates VLMs' fine-grained understanding of objects, attributes, relationships, and counting. Code, models & datasets at vismin.net 📷📷[1/13]🧵

English

17.3K

Ge Ya (Olga) Luo@OOOOLGAluo·28 Haz

@jm_alexia 😍😍😍 Congrats!

English

175

Alexia Jolicoeur-Martineau@jm_alexia·28 Haz

7 years ago we left the parent's basement for a tiny 400sqft apartment. Today, we closed on our dream home, in the city, at walking distance from work and groceries, no compromise!

English

110

17.3K

Ge Ya (Olga) Luo retweetledi

Luke Rowe@Luke22R·18 Haz

How can we generate interesting edge cases to test our autonomous vehicles in simulation? We propose CtRL-Sim, a novel framework for closed-loop behaviour simulation that enables fine-grained control over agent behaviours. 🧵 1/8 arxiv.org/abs/2403.19918

GIF

English

6.7K

Keşfet

@antho_gosselin @Mila_Quebec @ludolara @shoddy_robots @DerekRenderling @duckietown_coo @jm_alexia @chrisjpal