Weize Li retweetledi
Weize Li
14 posts

Weize Li
@WeizeLi24
Research Engineer@TARS Robotics Incoming ECE PhD @ClemsonUniv
Shanghai, China Katılım Mayıs 2022
254 Takip Edilen13 Takipçiler
Weize Li retweetledi

LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging
Zhijian Shu, @_cheng_lin, Tao Xie, Wei Yin, Ben Li, Zhiyuan Pu, @WeizeLi24, Yao Yao, Xun Cao, @gingertata, @xxlong0
tl;dr: pixel gradient+token variance->geometric importance->geometry-aware feature map->partition & group & merge tokens
arxiv.org/abs/2512.04939




Suomi
Weize Li retweetledi
Weize Li retweetledi

Excited to share our #CVPR2024 highlight paper "Move as You Say, Interact as You Can", which employs scene affordance as an intermediate representation for language-guided human motion generation.
Project: afford-motion.github.io
Paper: arxiv.org/abs/2403.18036
English
Weize Li retweetledi

I wrote a tutorial on diffusion models for undergrad and grad students. I tried my best to give intuitive explanations for complicated equations.
Your feedback is much appreciated
Thanks to those who suggested various reading materials to me
arxiv.org/abs/2403.18103

English
Weize Li retweetledi

GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping.
Checkout our page mrsecant.github.io/GaussianGraspe…… for more details.
Compared to LERF, faster, more accurate 3D seg, more robust grasp.
#GaussianSplatting #robotics #grasp
English
Weize Li retweetledi

Introducing 𝐌𝐨𝐛𝐢𝐥𝐞 𝐀𝐋𝐎𝐇𝐀🏄 -- Hardware!
A low-cost, open-source, mobile manipulator.
One of the most high-effort projects in my past 5yrs! Not possible without co-lead @zipengfu and @chelseabfinn.
At the end, what's better than cooking yourself a meal with the 🤖🧑🍳
English

Check out our poster on Pose-agnostic Anomaly Detection (PAD) @NeurIPSConf and see how we leverage #NeRF to detect🔍anomalies on production lines. No need for point clouds or scanners, just a single camera📸 for comprehensive 360-degree object inspection.

English
Weize Li retweetledi

Multimodal reasoning is hard. Even the best LMMs struggle with counting😥 Any fix for it?
Introduce VPD from @GoogleAI: we teach LMMs multimodal CoT reasoning with data synthesized from LLM + vision tools, and achieve new SOTAs on many multimodal tasks!🥳
arxiv.org/abs/2312.03052

English
Weize Li retweetledi

Check out PAD! A fresh dataset & benchmark for pose-agnostic anomaly detection in object recognition. Comes with Multi-pose Anomaly Detection dataset & OmniposeAD method. Dive in for more! #AI #llm #arXiv arxiv.org/abs/2310.07716…"
English
Weize Li retweetledi

Such a cool paper! Jointly optimizing shape parameters, albedo, and roughness from a few reference images through differentiable signed distance function rendering.
By @DelioVicini, Sébastien Speierer, and @wenzeljakob.
rgl.epfl.ch/publications/V…
English
Weize Li retweetledi

I wanted to model my hotel room this week and decided to give @Polycam3D's roomplan mode a whirl. 👏 Well done! They added a wonderful touch by putting 3D objects in the output that are actual representations of the room objects. Not just geometric shapes.
Why aren't more companies using Roomplan in their apps?
#Computervision #AEC
English
Weize Li retweetledi

We are adding support for plugins to ChatGPT — extensions which integrate it with third-party services or allow it to access up-to-date information. We’re starting small to study real-world use, impact, and safety and alignment challenges: openai.com/blog/chatgpt-p…
English






