Yihong Sun

13 posts

Yihong Sun banner
Yihong Sun

Yihong Sun

@YihongSun_

CS PhD student @Cornell | Ugrad @JohnsHopkins

Ithaca, NY Katılım Ocak 2021
94 Takip Edilen81 Takipçiler
Yihong Sun retweetledi
Albert Tseng
Albert Tseng@tsengalb99·
Excited to announce our #AISTATS📜on training LLMs with MXFP4! We use stoch. rounding and random Hadamard transforms (all fast on HW) to get low-variance, unbiased gradient estimates with MXFP4 GEMMs. We get a ~30% speedup over FP8 with almost no PPL gap! arxiv.org/abs/2502.20586
Albert Tseng tweet mediaAlbert Tseng tweet media
English
1
10
24
3.2K
Yihong Sun
Yihong Sun@YihongSun_·
Happy to get feedback and questions! For more details, please check out our paper! See you on Friday 10/4 at Poster Session 7, #73! 😃 Huge thanks to my advisor, @BharathHarihar3! This wouldn’t have been possible without your support! (7/n)
English
0
0
3
124
Yihong Sun
Yihong Sun@YihongSun_·
Results: We obtain significant improvements over previous unsupervised object detection methods across multiple datasets & metrics, with notable improvements in Box AR by 6.6 on Waymo Open, 5.9 on nuScenes, and 8.2 on KITTI compared to CutLER. (6/n)
English
1
0
3
124
Yihong Sun
Yihong Sun@YihongSun_·
📢Excited to share our latest @eccvconf work MOD-UV on learning object detectors from unlabeled videos only! 🔗: mod-uv.github.io I will give a talk at Wild3D workshop (cc: @weichiuma) on Monday 9/30 and come chat with us on Friday 10/4 at Poster Session 7, #73!
English
1
2
15
1.1K
Yihong Sun retweetledi
Jieneng Chen
Jieneng Chen@jieneng_chen·
Thanks @arankomatsuzaki for sharing our #CVPR2024 ViTamin work! Collaborated with @yucornetto1, Xiaohui Shen, @YuilleAlan and Liang-Chieh Chen TLDR: We design a scalable vision model in the vision-language era, advancing the limits for VLMs and multi-modal LLMs!
Aran Komatsuzaki@arankomatsuzaki

ViTamin: Designing Scalable Vision Models in the Vision-Language Era repo: github.com/Beckschen/ViTa… abs: arxiv.org/abs/2404.02132

English
0
8
19
8.2K
Yihong Sun retweetledi
Gemmechu Hassena
Gemmechu Hassena@GemmechuHassena·
Excited to share our work, ObjectCarver! Given multiview images and click points on one image, ObjectCarver decomposes scenes into separate objects, providing high-quality 3D surfaces while handling occlusion and close-contact objects. (1/6) website: objectcarver.github.io
English
5
12
60
10.2K