Wenyan Li

1

29

Abraham Owodunni@AbrahamOwos·1 Kas

@Wenyan62 I really like this paper 👏👏. Read it and shared with some friends.

English

0

1

19

Wenyan Li@Wenyan62·31 Eki

I will be presenting our Lost in embeddings poster at EMNLP! Hope to see many old and new friends in Suzhou!🤗🤗 📍Time/Date: Fri. Nov 7 at 12:30-13:30 Location: Hall C Also happy to chat anything about VLMs, RAG and recently get in the domain of fintech. #EMNLP2025

Wenyan Li@Wenyan62

Happy to share (with a bit of delay tho) our paper on quantifying visual information loss in VLMs --- "Lost in Embeddings: Information Loss in Vision-Language Models" is accepted to EMNLP 2025 findings: arxiv.org/pdf/2509.11986 💃code is also released: github.com/lyan62/vlm-inf…

English

4

37

7.4K

Wenyan Li@Wenyan62·1 Kas

@codewithimanshu thanks for the warm words Himanshu!

English

26

Himanshu Kumar@codewithimanshu·1 Kas

@Wenyan62 Sounds exciting, Wenyan! Suzhou sounds lovely, and your work on VLMs is always top-notch, I must say.

English

0

1

55

Wenyan Li retweetledi

Raphael Tang@ralph_tang·3 Eki

📢Our new paper critically examines arena-style LLM evaluation, e.g., LMArena, questioning whether draws actually mean equal model ability. TL;DR: simply ignoring draws improves rating systems by 1-3%, and query difficulty/subjectivity relate more strongly to draws than model ratings do.

English

2

4

725

Wenyan Li@Wenyan62·2 Eki

@_srishtiyadav thanks Srishti ❤️

English

1

43

Srishti@_srishtiyadav·2 Eki

@Wenyan62 Congrats, Dr. Wenyan! 💚

Indonesia

0

1

44

Wenyan Li@Wenyan62·1 Eki

Happy to share that I’ve successfully defended my PhD today 🎉 A big thank you to my committee members Manex aguirrezabal zabaleta, Anna Korhonen, and Charlie Clark ❤️ Very grateful to all the support and encouragement from my supervisor Anders Søgaard and colleagues at Coastal❤️

English

0

6

222

Wenyan Li@Wenyan62·22 Eyl

@gietema here we could probably phrase it better. A large drop in overlap ratio indicates that the connector is changing neighborhood structure substantially, which may suggest geometric distortion beyond what is required for task alignment.

English

1

58

Jochem Gietema@gietema·21 Eyl

@Wenyan62 Hi, thanks for this, congrats! Not sure I understand why an optimal connector would maintain the same k-NN sets? Isn't the purpose of the connector to align the embedding for a downstream task, in which case the structure of the original embedding cannot be assumed to be optimal?

English

0

125

Wenyan Li@Wenyan62·20 Eyl

Happy to share (with a bit of delay tho) our paper on quantifying visual information loss in VLMs --- "Lost in Embeddings: Information Loss in Vision-Language Models" is accepted to EMNLP 2025 findings: arxiv.org/pdf/2509.11986 💃code is also released: github.com/lyan62/vlm-inf…

English

8

36

308

29.9K

Wenyan Li@Wenyan62·22 Eyl

@gietema Hi Jochem, thanks for the question. we agree that the purpose of the connector is to align the embeddings spaces. Idea is not that preserving kNN neighborhoods is the end goal, but that overlap ratio provides a way to quantify how much the local structure is perturbed.

English

85

Wenyan Li@Wenyan62·21 Eyl

@_TobiasLee thank you lei! ☺️@_TobiasLee

English

161

Lei Li@_TobiasLee·21 Eyl

@Wenyan62 Thanks for sharing! Very insightful for understanding the ViT embeddings

English

0

1

351

Wenyan Li@Wenyan62·20 Eyl

Kudos to my coauthors @ralph_tang @caiqizh @li_chengzu

English

1

612

Wenyan Li@Wenyan62·5 Haz

Excited to share our multimodal temporal culture benchmark is released 🚀🚀🚀 Dataset is public on 🤗 huggingface Check it out!! arxiv.org/abs/2506.01565 huggingface.co/datasets/lizho…

English

1

12

450

Wenyan Li@Wenyan62·23 May

Check out our new benchmark RAVENEA for VLM culture understanding with retrieval augmentation! Code and data all released!🚀🚀🚀

Jiaang Li@jiaangli

🚀New Preprint Alert 🚀 Can Multimodal Retrieval Enhance Cultural Awareness in Vision-Language Models? Excited to introduce RAVENEA, a new benchmark aimed at evaluating cultural understanding in VLMs through RAG.

English

8

373

Wenyan Li retweetledi

Afra Amini@afra_amini·6 May

Current KL estimation practices in RLHF can generate high variance and even negative values! We propose a provably better estimator that only takes a few lines of code to implement.🧵👇 w/ @xtimv and Ryan Cotterell code: arxiv.org/pdf/2504.10637 paper: github.com/rycolab/kl-rb

English

4

31

126

15.1K

Wenyan Li retweetledi

Chengzu Li@li_chengzu·14 Oca

Forget just thinking in words. 🚀 New Era of Multimodal Reasoning🚨 🔍 Imagine While Reasoning in Space with MVoT Multimodal Visualization-of-Thought (MVoT) revolutionizes reasoning by generating visual "thoughts" that transform how AI thinks, reasons, and explains itself.

English

15

164

740

78.8K

Wenyan Li@Wenyan62·15 Kas

🎉🎉

ART

8

545

Wenyan Li@Wenyan62·13 Kas

@nicolayr_ Thanks Nicolay!!

English

1

45

Nicolay Rusnachenko@nicolayr_·12 Kas

@Wenyan62 all the best at #EMNLP2024 , well done!👏

English

0

1

75

Wenyan Li@Wenyan62·11 Kas

🍗🍗I will present FoodieQA in person at #EMNLP2024😋😋 Looking forward to meeting old and new friends! Feel free to drop by! (and have some snacks) ⏰ Nov, 13th (Wed) 16:00, In-Person Poster Session E (Riverfront Hall) I'm also on the job market and would be happy to chat :)

English

12

63

6.8K

Wenyan Li@Wenyan62·12 Kas

@roopalgarg Thanks Roopal! Sure, would love to😊

English

116

Roopal Garg@roopalgarg·12 Kas

@Wenyan62 @Wenyan62 really cool work. Let's grab some time to sync

English