Junru Lin

29 posts

Junru Lin banner
Junru Lin

Junru Lin

@_Linjunru

CS undergrad @UofT.

Tham gia Ağustos 2022
249 Đang theo dõi31 Người theo dõi
Junru Lin đã retweet
Hansheng Chen
Hansheng Chen@HanshengCh·
Excited to announce a new track of accelerating Generative AI: pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation github.com/Lakonik/piFlow Distill 20B flow models now using just an L2 loss via imitation learning for SOTA diversity and teacher-aligned quality.
Hansheng Chen tweet media
English
2
27
154
35.9K
Junru Lin đã retweet
Junru Lin đã retweet
Roman Bachmann
Roman Bachmann@roman__bachmann·
We will present FlexTok at #ICML2025 on Tuesday! Drop by to chat with @JRAllardice and me if you're interested in tokenization, flexible ways to encode images, and generative modeling. 📆 Tue, Jul 15, 16:30 PDT 📍 East Exhibition Hall, Poster E-3010 🌐 flextok.epfl.ch
Roman Bachmann@roman__bachmann

Have you ever been bothered by the constraints of fixed-sized 2D-grid tokenizers? We present FlexTok, a flexible-length 1D tokenizer that enables autoregressive models to describe images in a coarse-to-fine manner. flextok.epfl.ch arxiv.org/abs/2502.13967 🧵 1/n

English
0
6
24
1.2K
Junru Lin đã retweet
Yunqi (Richard) Gu
Yunqi (Richard) Gu@richard_yunqigu·
Which multimodal LLM should you be using to edit graphics in Blender? Today, we’re releasing our #CVPR2025 Highlight🌟 work, #BlenderGym 🏋️‍♀️, the first agentic 3D graphics editing benchmark that will tell you exactly how multimodal LLMs compare in their Blender-editing skills. What'd we find? 🧵👇
English
8
38
83
22.6K
Junru Lin đã retweet
Hansheng Chen
Hansheng Chen@HanshengCh·
Excited to share our work: Gaussian Mixture Flow Matching Models (GMFlow) github.com/lakonik/gmflow GMFlow generalizes diffusion models by predicting Gaussian mixture denoising distributions, enabling precise few-step sampling and high-quality generation.
Hansheng Chen tweet media
English
1
32
127
13.3K
Junru Lin đã retweet
Junru Lin đã retweet
Ian Huang
Ian Huang@IanHuang3D·
🏡Building realistic 3D scenes just got smarter! Introducing our #CVPR2025 work, 🔥FirePlace, a framework that enables Multimodal LLMs to automatically generate realistic and geometrically valid placements for objects into complex 3D scenes. How does it work?🧵👇
English
22
91
377
116.8K
Junru Lin đã retweet
Google DeepMind
Google DeepMind@GoogleDeepMind·
Meet Gemini Robotics: our latest AI models designed for a new generation of helpful robots. 🤖 Based on Gemini 2.0, they bring capabilities such as better reasoning, interactivity, dexterity and generalization into the physical world. 🧵 goo.gle/gemini2-roboti…
English
172
449
2.1K
641K
Junru Lin đã retweet
Congyue Deng
Congyue Deng@CongyueD·
In the past, we extended the convolution operator to go from low-level image processing to high-level visual reasoning. Can we also extend physical operators for more high-level physical reasoning? Introducing the Denoising Hamiltonian Network (DHN): arxiv.org/pdf/2503.07596
Congyue Deng tweet media
English
6
59
314
41.1K
Junru Lin đã retweet
Junru Lin đã retweet
Jiaman Li
Jiaman Li@jiaman01·
🔥 Introducing MVLift: Generate realistic 3D motion without any 3D training data - just using 2D poses from monocular videos! Applicable to human motion, human-object interaction & animal motion. Joint work w/ @jiajunwu_cs & Karen 💡 How? We reformulate 3D motion estimation as generating consistent multi-view 2D pose sequences. Our framework uses 2D motion diffusion to progressively establish multi-view consistency, requiring only single-view 2D pose sequences for training. Project: lijiaman.github.io/projects/mvlif… Video with demonstration: youtube.com/watch?v=nffTJH… Paper: arxiv.org/abs/2411.18808
YouTube video
YouTube
English
2
39
213
15.6K
Junru Lin đã retweet
Nakayama George
Nakayama George@GeorgeNaka40190·
Do large multimodal models understand how to make dresses for your winter holiday party💃? We introduce AIpparel, a vision-language-garment model capable of generating and editing simulation-ready sewing patterns from text and images. Project page at georgenakayama.github.io/AIpparel/. [1/n]
English
1
18
68
11.8K
Junru Lin đã retweet
Yue Wang
Yue Wang@yuewang314·
[Hiring!] I am hiring multiple PhDs @CSatUSC @USCViterbi for this cycle. If you're interested in scene representations, neural simulation, generative AI, and robotics, feel free to mention my name in your application (no need to email). For USC masters/undergrads who're interested in our research, feel free to fill in this form forms.gle/RerZfDqCqmCj8A….
English
1
47
269
32.4K
Junru Lin đã retweet
Prime (Shengqu) Cai
Prime (Shengqu) Cai@prime_cai·
Sharing something exciting we've been working on as a Thanksgiving gift: Diffusion Self-Distillation (DSD), which redefines zero-shot customized image generation using FLUX. DSD is like DreamBooth, but zero-shot/training-free. It works across any input subject and desired context—character consistency, item/asset adaptation, scene relighting, and more. It even enables the creation of comics/mangas without any effort in fine-tuning or training a personalized model! 📰 Paper: arxiv.org/abs/2411.18616 🌐 Website: primecai.github.io/dsd/ Team effort with @ericryanchan, @zhang_yunzhi, @GuibasLeonidas, @jiajunwu_cs, and @GordonWetzstein.
English
24
73
445
60.4K
Junru Lin đã retweet
Sherwin Bahmani
Sherwin Bahmani@sherwinbahmani·
📢 Excited to share our new work: AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers snap-research.github.io/ac3d We analyze what pre-trained video diffusion transformers understand about 3D and demonstrate dynamic scene generation with 3D control.
English
6
23
119
15.6K
Junru Lin đã retweet
Igor Gilitschenski
Igor Gilitschenski@igilitschenski·
I'm recruiting graduate students for Fall 2025 to work at the intersection of Computer Vision, Deep Learning, and Robotics. If you are interested in building a controllable organic simulation engine and enabling safe robot learning, consider applying to UofT's CS PhD program 1/n
Igor Gilitschenski tweet media
English
11
82
432
49.6K
Junru Lin đã retweet
Songyou Peng
Songyou Peng@songyoupeng·
Check out our new paper in feed-forward 3DGS model for large scenes! And the code is also available
English
1
6
84
8.2K
Luca Carlone
Luca Carlone@lucacarlone1·
If you are applying for #gradschool and have last-minute questions about your application, I'm willing to offer office hours on zoom in early November. I do this with students at MIT and I want to make sure others get the same opportunity. write below if interested.
English
206
116
880
125.9K