Junru Lin

29 posts

Junru Lin

@_Linjunru

CS undergrad @UofT.

Tham gia Ağustos 2022

249 Đang theo dõi31 Người theo dõi

Junru Lin đã retweet

Hansheng Chen@HanshengCh·17 Eki

Excited to announce a new track of accelerating Generative AI: pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation github.com/Lakonik/piFlow Distill 20B flow models now using just an L2 loss via imitation learning for SOTA diversity and teacher-aligned quality.

English

154

35.9K

Junru Lin đã retweet

Jia-Bin Huang@jbhuang0604·3 Ağu

Woohoo! Imagine, Verify, Execute (IVE) is accepted to CoRL 2025! 🎉 Congrats to the incredible @umdcs students Seungjae Lee @JayLEE_0301, Daniel Ekpo (@daniekpo7), Haowen Liu!

Jia-Bin Huang@jbhuang0604

Exploration is key for robots to generalize, especially in open-ended environments with vague goals and sparse rewards. BUT, how do we go beyond random poking? Wouldn't it be great to have a robot that explores an environment just like a kid? Introducing Imagine, Verify, Execute (IVE)! IVE leverages Vision-Language models to • extract semantic scene graphs, • imagine novel scenes, • predict their physical plausibility, and • generate executable sequences. IVE is a memory-guided agentic exploration framework that operates fully automatically, enabling more diverse and meaningful exploration.

English

15.7K

Junru Lin đã retweet

Roman Bachmann@roman__bachmann·11 Tem

We will present FlexTok at #ICML2025 on Tuesday! Drop by to chat with @JRAllardice and me if you're interested in tokenization, flexible ways to encode images, and generative modeling. 📆 Tue, Jul 15, 16:30 PDT 📍 East Exhibition Hall, Poster E-3010 🌐 flextok.epfl.ch

Roman Bachmann@roman__bachmann

Have you ever been bothered by the constraints of fixed-sized 2D-grid tokenizers? We present FlexTok, a flexible-length 1D tokenizer that enables autoregressive models to describe images in a coarse-to-fine manner. flextok.epfl.ch arxiv.org/abs/2502.13967 🧵 1/n

English

1.2K

Junru Lin đã retweet

Yunqi (Richard) Gu@richard_yunqigu·12 Nis

Which multimodal LLM should you be using to edit graphics in Blender? Today, we’re releasing our #CVPR2025 Highlight🌟 work, #BlenderGym 🏋️‍♀️, the first agentic 3D graphics editing benchmark that will tell you exactly how multimodal LLMs compare in their Blender-editing skills. What'd we find? 🧵👇

English

22.6K

Junru Lin đã retweet

Hansheng Chen@HanshengCh·8 Nis

Excited to share our work: Gaussian Mixture Flow Matching Models (GMFlow) github.com/lakonik/gmflow GMFlow generalizes diffusion models by predicting Gaussian mixture denoising distributions, enabling precise few-step sampling and high-quality generation.

English

127

13.3K

Junru Lin đã retweet

Roman Bachmann@roman__bachmann·6 Nis

Happy to share that we released FlexTok code and models on github.com/apple/ml-flext…. Try them with our interactive @huggingface demo on huggingface.co/spaces/EPFL-VI…

Afshin Dehghan@afshin_dn

Excited to share that we have recently released the source code for FlexTok, bringing a fresh perspective to tokenization. Code on GitHub: lnkd.in/g4iNJFmU. Project Page: flextok.epfl.ch #FlexTok #Tokenization #MachineLearning #MLResearch #OpenSource #AI

English

13.5K

Junru Lin đã retweet

Ian Huang@IanHuang3D·27 Mar

🏡Building realistic 3D scenes just got smarter! Introducing our #CVPR2025 work, 🔥FirePlace, a framework that enables Multimodal LLMs to automatically generate realistic and geometrically valid placements for objects into complex 3D scenes. How does it work?🧵👇

English

377

116.8K

Junru Lin đã retweet

Google DeepMind@GoogleDeepMind·12 Mar

Meet Gemini Robotics: our latest AI models designed for a new generation of helpful robots. 🤖 Based on Gemini 2.0, they bring capabilities such as better reasoning, interactivity, dexterity and generalization into the physical world. 🧵 goo.gle/gemini2-roboti…

English

172

449

2.1K

641K

Junru Lin đã retweet

Congyue Deng@CongyueD·11 Mar

In the past, we extended the convolution operator to go from low-level image processing to high-level visual reasoning. Can we also extend physical operators for more high-level physical reasoning? Introducing the Denoising Hamiltonian Network (DHN): arxiv.org/pdf/2503.07596

English

314

41.1K

Junru Lin đã retweet

Koichi Namekata@Koichi_N_·23 Oca

Thrilled to announce that SG-I2V has been accepted at #ICLR2025 ! Huge thanks to the collaborators, reviewers, and ACs. Looking forward to presenting this in Singapore!

Koichi Namekata@Koichi_N_

Thrilled to share SG-I2V, a tuning-free method for trajectory-controllable image-to-video (i2v) generation, solely built on the knowledge present in a pre-trained i2v diffusion model ! kmcode1.github.io/Projects/SG-I2… w/ @sherwinbahmani @Dazitu_616 @yash2kant @igilitschenski @DaveLindell

English

Junru Lin đã retweet

U of T Department of Computer Science@UofTCompSci·7 Oca

Congratulations to @UofTCompSci undergrads Helen Li, Junru Lin, Leo Tenenbaum and Sarah Walker who have received honourable mentions in the @CRAtweets 2024-2025 Outstanding Undergraduate Researcher Award program! cra.org/about/awards/o…

U of T Department of Computer Science tweet media

English

587

Junru Lin đã retweet

Jiaman Li@jiaman01·19 Ara

🔥 Introducing MVLift: Generate realistic 3D motion without any 3D training data - just using 2D poses from monocular videos! Applicable to human motion, human-object interaction & animal motion. Joint work w/ @jiajunwu_cs & Karen 💡 How? We reformulate 3D motion estimation as generating consistent multi-view 2D pose sequences. Our framework uses 2D motion diffusion to progressively establish multi-view consistency, requiring only single-view 2D pose sequences for training. Project: lijiaman.github.io/projects/mvlif… Video with demonstration: youtube.com/watch?v=nffTJH… Paper: arxiv.org/abs/2411.18808

YouTube

English

213

15.6K

Junru Lin đã retweet

Felix Taubner@taubnerfelix·17 Ara

Introducing 🧢CAP4D🧢 CAP4D turns any number of reference images (single, few, and many) into controllable real-time 4D avatars. 🧵⬇️ Website: felixtaubner.github.io/cap4d/ Paper: arxiv.org/abs/2412.12093

English

577

76.3K

Junru Lin đã retweet

Nakayama George@GeorgeNaka40190·10 Ara

Do large multimodal models understand how to make dresses for your winter holiday party💃? We introduce AIpparel, a vision-language-garment model capable of generating and editing simulation-ready sewing patterns from text and images. Project page at georgenakayama.github.io/AIpparel/. [1/n]

English

11.8K

Junru Lin đã retweet

Yue Wang@yuewang314·4 Ara

[Hiring!] I am hiring multiple PhDs @CSatUSC @USCViterbi for this cycle. If you're interested in scene representations, neural simulation, generative AI, and robotics, feel free to mention my name in your application (no need to email). For USC masters/undergrads who're interested in our research, feel free to fill in this form forms.gle/RerZfDqCqmCj8A….

English

269

32.4K

Junru Lin đã retweet

Prime (Shengqu) Cai@prime_cai·28 Kas

Sharing something exciting we've been working on as a Thanksgiving gift: Diffusion Self-Distillation (DSD), which redefines zero-shot customized image generation using FLUX. DSD is like DreamBooth, but zero-shot/training-free. It works across any input subject and desired context—character consistency, item/asset adaptation, scene relighting, and more. It even enables the creation of comics/mangas without any effort in fine-tuning or training a personalized model! 📰 Paper: arxiv.org/abs/2411.18616 🌐 Website: primecai.github.io/dsd/ Team effort with @ericryanchan, @zhang_yunzhi, @GuibasLeonidas, @jiajunwu_cs, and @GordonWetzstein.

English

445

60.4K

Junru Lin đã retweet

Sherwin Bahmani@sherwinbahmani·27 Kas

📢 Excited to share our new work: AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers snap-research.github.io/ac3d We analyze what pre-trained video diffusion transformers understand about 3D and demonstrate dynamic scene generation with 3D control.

English

119

15.6K

Junru Lin đã retweet

Igor Gilitschenski@igilitschenski·26 Kas

I'm recruiting graduate students for Fall 2025 to work at the intersection of Computer Vision, Deep Learning, and Robotics. If you are interested in building a controllable organic simulation engine and enabling safe robot learning, consider applying to UofT's CS PhD program 1/n

English

432

49.6K

Junru Lin đã retweet

Songyou Peng@songyoupeng·21 Eki

Check out our new paper in feed-forward 3DGS model for large scenes! And the code is also available

English

8.2K

Junru Lin@_Linjunru·21 Eki

@lucacarlone1 Interested!

English

420

Luca Carlone@lucacarlone1·21 Eki

If you are applying for #gradschool and have last-minute questions about your application, I'm willing to offer office hours on zoom in early November. I do this with students at MIT and I want to make sure others get the same opportunity. write below if interested.

English

206

116

880

125.9K

Khám phá

@umdcs @JayLEE_0301 @daniekpo7 @JRAllardice @huggingface @UofTCompSci @CRATweets @jiajunwu_cs