Dongping Chen

13 posts

Dongping Chen banner
Dongping Chen

Dongping Chen

@Dongping0612

Ph.D. Student @UofMaryland | Incoming Intern @Adobe | Ex intern @uwcse @RAIVNLab Agentic AI / Multimodal / Data-Centric AI

Greenbelt, MD Katılım Kasım 2023
36 Takip Edilen13 Takipçiler
Dongping Chen retweetledi
Weikai Huang
Weikai Huang@weikaih04·
After playing with AI2Thor for half year, I was again and again realize its programmatical design with accurate gt provide the best playground for many projects. It is excited to see MolmoSpaces bring the Ai2Thor into next chapter. Can’t wait to try them!
Ai2@allen_ai

Introducing MolmoSpaces, a large-scale, fully open platform + benchmark for embodied AI research. 🤖 230k+ indoor scenes, 130k+ object models, & 42M annotated robotic grasps—all in one ecosystem.

English
0
3
14
2.9K
Dongping Chen retweetledi
Yushi Hu
Yushi Hu@huyushi98·
Reward models make or break post-training for multimodal omni models (e.g., nano banana), yet there’s surprisingly little research on that‼️ We’re releasing MMRB2: new reward benchmark focusing on omni models, spanning T2I, editing, interleaved, and thinking with images 🧵1/n
Yushi Hu tweet media
English
8
43
157
34.4K
Haoquan Fang
Haoquan Fang@hq_fang·
I’m applying to PhD programs in robot learning this cycle and am actively looking for relevant opportunities in this space. My research focuses on developing generalist robotic manipulation policies that leverage strong priors, by jointly optimizing both the data and the models. If you know of opportunities or are open to chatting, please ping me!
English
7
22
167
34.7K
Dongping Chen retweetledi
Mingmeng GENG
Mingmeng GENG@GengMingmeng·
📢📢New Preprint!📢📢 The Impact of Large Language Models in Academia: from Writing to Speaking arxiv.org/abs/2409.13686 TL;DR: Just the beginning, just a matter of time.
Mingmeng GENG tweet media
English
1
1
4
268
Dongping Chen retweetledi
Yue Huang
Yue Huang@HowieH36226·
Toward Trustworthy Generative Foundation Models (GenFMs) 🚀 🎇After six months of hard work and thanks to the efforts of the entire team, our report on the trustworthiness of generative foundation models (GenFMs) has finally been released. 💡In this work, we: -Developed a standardized set of guidelines by analyzing global AI governance policies - Introduced TrustGen - a dynamic benchmarking platform for evaluating GenFM trustworthiness - Provided an in-depth discussion of the challenges and future directions for trustworthy GenFMs. 📜Check out our report: arxiv.org/pdf/2502.14296 Our toolkit is public at: github.com/TrustGen/Trust…
Yue Huang tweet media
English
2
32
97
15.2K
Dongping Chen retweetledi
Mahtab Bigverdi
Mahtab Bigverdi@MahtabBg·
I'm exited to announce that our work (AURORA) got accepted into #CVPR2025🎉! Special thanks to my coauthors: @ch1m1m0ry0, @cydhsieh, @ethnlshn, @Dongping0612, Linda Shapiro and @RanjayKrishna, This work wouldn’t have been possible without them! See you all in Nashville 🎸!
Mahtab Bigverdi@MahtabBg

Introducing AURORA 🌟: Our new training framework to enhance multimodal language models with Perception Tokens; a game-changer for tasks requiring deep visual reasoning like relative depth estimation and object counting. Let’s take a closer look at how it works.🧵[1/8]

English
4
4
39
4.9K
Dongping Chen retweetledi
Chenhao Zheng
Chenhao Zheng@Michael3014018·
Excited to share our #NeurIPS2024 spotlight: Acoustic Volume Rendering (AVR) for Neural Impulse Response Fields. AVR greatly improve the state-of-the-art in novel view spatial audio synthesis by introducing acoustic volume rendering. Listen with headphone for example below
English
1
6
17
2.5K
Dongping Chen retweetledi
Zixian Ma
Zixian Ma@zixianma02·
Excited to present Task Me Anything at: 📄Poster @ Wed 4:30pm, East Exhibit Hall 🎙️Oral presentation @ Video LM workshop at Sat 1pm, East Meeting Room 13 Come or reach out if you want to chat about multi-modal models, synthetic data, benchmarks etc💜 🔗 To save events: tinyurl.com/tma-poster-1211 tinyurl.com/tma-talk-1214
Jieyu Zhang@JieyuZhang20

Have trouble finding a benchmark for your use case? Introducing TaskMeAnything, a benchmark generation engine that creates VQA benchmarks on demand for assessing multimodal language models like GPT-4o. Website: task-me-anything.org

English
1
9
61
8K
Dongping Chen retweetledi
Mahtab Bigverdi
Mahtab Bigverdi@MahtabBg·
Introducing AURORA 🌟: Our new training framework to enhance multimodal language models with Perception Tokens; a game-changer for tasks requiring deep visual reasoning like relative depth estimation and object counting. Let’s take a closer look at how it works.🧵[1/8]
GIF
English
1
9
33
8.6K
Dongping Chen retweetledi
Yue Huang
Yue Huang@HowieH36226·
🌟 Exciting Research in LLMs! 🌟 1/ Our latest paper is now available! Discover how we're pushing the boundaries of AI to prioritize both honesty and helpfulness in LLMs.
Yue Huang tweet media
English
1
1
6
1.2K
Dongping Chen
Dongping Chen@Dongping0612·
@HowieH36226 Nice work. GUI understanding is a very important and timely topic👏👏
English
1
0
0
22