Vincent Qin

1.5K posts

Vincent Qin banner
Vincent Qin

Vincent Qin

@AlphaRealcat

⭐️Focusing on Visual Localization, SfM and SLAM.

Sumali Mart 2022
416 Sinusundan369 Mga Tagasunod
Vincent Qin nag-retweet
Johan Edstedt
Johan Edstedt @Parskatt·
Introducing LoMa, the next generation of feature matcher!
Johan Edstedt  tweet media
English
8
35
292
35.8K
Vincent Qin nag-retweet
Zhenjun Zhao
Zhenjun Zhao@zhenjun_zhao·
Fisheye3R: Adapting Unified 3D Feed-Forward Foundation Models to Fisheye Lenses Ruxiao Duan, Erin Hong, Dongxu Zhao, Eric Turner, Alex Wong, Yunwen Zhou tl;dr: in title arxiv.org/abs/2603.28896
Zhenjun Zhao tweet mediaZhenjun Zhao tweet mediaZhenjun Zhao tweet media
Filipino
0
2
34
1.7K
Vincent Qin nag-retweet
Zhenjun Zhao
Zhenjun Zhao@zhenjun_zhao·
TerraSky3D: Multi-View Reconstructions of European Landmarks in 4K Mattia D'Urso, Yuxi Hu, Christian Sormann, Mattia Rossi, Friedrich Fraundorfer tl;dr: new 3D dataset arxiv.org/abs/2603.28287
Zhenjun Zhao tweet mediaZhenjun Zhao tweet mediaZhenjun Zhao tweet mediaZhenjun Zhao tweet media
English
1
4
32
1.6K
Gabriele Berton
Gabriele Berton@gabriberton·
I have joined @GoogleDeepMind! I'll be training VLMs And I'll still keep posting about latest developments on AI, Computer Vision and LLMs So no more posts on PyTorch tricks. I might post about JAX. Stay tuned...
Gabriele Berton tweet media
English
122
64
3.6K
145.5K
Gabriele Berton
Gabriele Berton@gabriberton·
VisMatch is on pypi! VisMatch is a wrapper for image matching models, like LightGlue, RoMa-v2, MASt3R, LoFTR, and 50+ more! It's literally as simple as: pip install vismatch vismatch-match --inputs img0 img1 --matcher choose_any To run image matching on any 2 images [1/4]
Gabriele Berton tweet media
English
11
53
416
50.2K
Vincent Qin nag-retweet
Zhenjun Zhao
Zhenjun Zhao@zhenjun_zhao·
FrameVGGT: Frame Evidence Rolling Memory for streaming VGGT Zhisong Xu, Takeshi Oishi tl;dr: not token-level compression, but block-level bounded retention arxiv.org/abs/2603.07690
Zhenjun Zhao tweet mediaZhenjun Zhao tweet mediaZhenjun Zhao tweet mediaZhenjun Zhao tweet media
English
0
5
33
1.8K
小互
小互@xiaohu·
OpenClaw AI Agent 小龙虾能力排行榜 专门测试各家大模型在 OpenClaw 框架下执行实际编码任务的成功率。 用一套标准化的 OpenClaw Agent 任务来跑各个模型,通过自动化检查 + LLM 评审来打分,衡量每个模型完成任务的成功率。 前三名分别为: Gemini 3 Flash Preview MiniMax M2.1 Kimi K2.5 然后是: Claude Sonnet 4.5 Gemini 3 Pro Preview Claude Haiku 4.5 Claude Opus 4.6 Claude 家族三个模型都在 90% 以上,GPT-5.2 反而只有 65.6% 排名靠后,DeepSeek V3.2 在 82% 左右。
小互 tweet media
中文
25
15
87
30.6K
Vincent Qin nag-retweet
Zhiwen(Aaron) Fan
Zhiwen(Aaron) Fan@zhiwen_fan_·
What happens when VLMs meet 3D foundation models? See VLM-3R (CVPR 2026). VLM-3R links a vision-language model (e.g., Qwen) with 3D geometric foundation models (e.g., CUT3R) at metric scale. Given an uncalibrated video, it moves beyond pixels to perceive and reason in 3D space. Code (open source): vlm-3r.github.io
English
1
16
143
10.5K
Vincent Qin nag-retweet
sasaki@engineer
sasaki@engineer@rsasaki0109·
VLG-Loc Vision-Language Global Localization (VLG-Loc) is a global localization method that uses camera images and a human-readable labeled footprint map containing only names and areas of distinctive visual landmarks github.com/CyberAgentAILa…
sasaki@engineer tweet media
English
1
9
48
2.9K
nmsl❤️
nmsl❤️@yzly1·
@zhenjun_zhao I think the results of XFeat and RIPE on MegaDepth seem unusually low. Are these the originally reported results, or were they obtained under different settings?
English
1
0
1
152
Vincent Qin nag-retweet
Zhenjun Zhao
Zhenjun Zhao@zhenjun_zhao·
From Pairs to Sequences: Track-Aware Policy Gradients for Keypoint Detection Yepeng Liu, Hao Li, Liwen Yang, Fangzhen Li, Xudi Ge, Yuliang Gu, kuang Gao, Bing Wang, Guang Chen, Hangjun Ye, Yongchao Xu tl;dr: multi-view version of RL-based method (RFP/RIPE) for detection; RDD as backbone no eval. on IMC arxiv.org/abs/2602.20630
Zhenjun Zhao tweet mediaZhenjun Zhao tweet mediaZhenjun Zhao tweet mediaZhenjun Zhao tweet media
Filipino
1
3
26
1.8K
Vincent Qin nag-retweet
Zhenjun Zhao
Zhenjun Zhao@zhenjun_zhao·
Have We Mastered Scale in Deep Monocular Visual SLAM? The ScaleMaster Dataset and Benchmark Hyoseok Ju, Bokeon Suh, @GiseopK tl;dr: in title arxiv.org/abs/2602.18174
Zhenjun Zhao tweet mediaZhenjun Zhao tweet mediaZhenjun Zhao tweet mediaZhenjun Zhao tweet media
Indonesia
0
9
42
2.6K
Vincent Qin nag-retweet
Yiwen Zhang
Yiwen Zhang@YiwenZhangYZ·
🚀 #CVPR2026 Accepted!🚀 Thrilled to share that my first-authored undergraduate paper, “Emergent Extreme-View Geometry in 3D Foundation Models,” has been accepted to CVPR 2026! 🎉 Looking forward to seeing many of you in Denver! ✈️ Project page: ext-3dfms.github.io
Yiwen Zhang tweet media
English
3
12
122
5.7K