Ziwen Chen

25 posts

Ziwen Chen

Ziwen Chen

@chenziwee

Adobe Research | PhD from OregonState | Computer Vision, 3D Reconstruction, Scene Understanding

San Jose, CA 参加日 Ekim 2009
115 フォロー中147 フォロワー
Ziwen Chen
Ziwen Chen@chenziwee·
Amazing work! This can enable so many futurisitc applications!
Shoubin Yu@shoubin621

Introducing Ego2Web from Google DeepMind and UNC Chapel Hill, accepted to #CVPR2026. AI agents can browse the web. But can they act based on what you see? Existing benchmarks focus only on web interaction while ignoring the real world. Ego2Web bridges egocentric video perception and web execution, enabling agents that can see through first-person video, understand real-world context, and take actions on the web grounded in the egocentric video. This opens a path toward AI assistants that operate seamlessly across physical and digital environments. We hope Ego2Web serves as an important step for building more capable, perception-driven agents. 🧵👇

English
0
0
2
239
Ziwen Chen
Ziwen Chen@chenziwee·
Introducing Long-LRM++ — for feed-forward, high-res, detail-preserving scene reconstruction ✨ Up to 64 960×540 inputs 🔍 Readable text 📉 4× fewer Gaussians ⚡ Real-time rendering 📷 End-to-end from unposed inputs w/ DA3 poses in 11s (w/⬆️ quality than DA3’s own GS predictor ;)
English
3
14
87
18.4K
Ziwen Chen がリツイート
Lu Ling
Lu Ling@LuLing26466911·
Do we really need massive curated 3D scene data for interactive world generation? #SAM3D, #WorldGen say yes. We say no. I-Scene learns better spatial knowlesge using only 25K randomly composed instances. 🔑 Key insight: We reprogram the instance generator to infer support, proximity, and symmetry from purely geometric cues for generating interactive scenes. 🧠 Scene-context attention 👁️ View-centric space 🧱 Random composition beats expensive curation 🌐 luling06.github.io/I-Scene-projec… 💻 github.com/LuLing06/I-Sce… 🧵 Details below [1/6]
English
11
90
719
57.1K
Ziwen Chen がリツイート
AI Research Impact Rankings
AI Research Impact Rankings@ai_impact_rank·
CSRankings counts publication in top conferences to rank professors/universities. But this encourages researchers to pursue quantity rather than quality. We propose impactrank.org, a new university ranking system that tries to measure quality instead of quantity of publications. How can we measure the quality of the publications? We believe that 1) The quality of research is best understood and evaluated by peers in the same research area; 2) With careful and informed use, LLMs can reveal the implicit quality judgments that peers convey through their citation practices and writing across large volumes of scholarly work. Hence, we developed the new ranking system where we analyze research papers from major AI conferences with LLMs. For each paper, we ask an LLM what are the 5 most important papers to this paper. In other words, the five works that most strongly influence the study. By doing this, we trace which papers and authors are consistently seen as inspirational and foundational to new discoveries in the field. We ran the model on all papers from top conferences in machine learning, computer vision, natural language processing and information retrieval from 2020 - 2025, and filtered references to only have those from 2000 onwards. Next, we map these influential authors to their affiliated universities using the CSRankings name–affiliation database. Each time a paper is recognized as one of the “top five references” in another work, its authors and their institutions receive credit. To keep the scoring fair, points are divided by the number of co-authors, ensuring balanced recognition across collaborations. The result is a new kind of academic ranking: one that rewards universities not just for publishing often, but for producing research that endures, inspires, and drives the field forward. This approach highlights scholarly influence and provides students, researchers, and institutions with a clearer picture of where the most impactful work is happening. Note that we believe that CSRankings had substantially improved university rankings in computer science by replacing subjective, reputation-based measures, such as those in US News, with more objective indicators, but the LLM era allows us to do something potentially better! Due to computational resource limits, we were only able to run it with a small 7B language model. It is also a project primarily led by undergraduate and master students from Oregon State University and University of California Santa Cruz. As a result, the system is very much a work in progress and will inevitably contain errors and blind spots. We actively welcome community feedback, new collaborators and contributions of GPU compute so that we can run larger LLMs, obtain more reliable results and improve the methodology.
AI Research Impact Rankings tweet mediaAI Research Impact Rankings tweet mediaAI Research Impact Rankings tweet mediaAI Research Impact Rankings tweet media
English
14
40
369
175.8K
Ziwen Chen
Ziwen Chen@chenziwee·
Long-LRM will be presented tomorrow at #ICCV2025 Poster Session 1 (11:30 AM) as a Highlight Paper! 🚀The first generalizable GS–based approach for high-res, wide-coverage 3D reconstruction in 1 second. Come check it out & chat with us! 🧩Code & weights: github.com/arthurhero/Lon…
English
0
0
3
422
Ziwen Chen がリツイート
Haian Jin
Haian Jin@Haian_Jin·
Novel view synthesis has long been a core challenge in 3D vision. But how much 3D inductive bias is truly needed? —Surprisingly, very little! Introducing "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"—a fully transformer-based approach that enables scalable, generalizable, and fully data-driven novel view synthesis, from sparse posed inputs. 🧵(1/6) Project Page: haian-jin.github.io/projects/LVSM/
English
22
93
575
114.7K
Ziwen Chen
Ziwen Chen@chenziwee·
Hate waiting 10 minutes for 3D GS to render your favorite indoor or outdoor scenes? ⏳ Our feed-forward solution, Long-LRM, cuts it down to just 1 second! ⚡️ With a straightforward mix of Mamba2 and transformer, it scales up to 32 high-res input images. arthurhero.github.io/projects/llrm/
English
5
36
208
17.2K
kaushikpatnaik
kaushikpatnaik@kaushikpatnaik·
Was really fun working/helping out on this project. We always think of images in grid, and this work went in a different direction.
Ziwen Chen@chenziwee

#CVPR2023 Want to zoom in and segment tiny tiny people in the background without doubling input resolution? Say goodbye to grid-like strided convolutions, and instead use hierarchical, adaptive downsampling from AutoFocusFormer (AFF)! github.com/apple/ml-autof…

English
1
0
5
394