Jeongsoo Park

26 posts

Jeongsoo Park

@jespark0

PhD student @Cornell (Previously PhD candidate @UMichCSE)

New York, NY Beigetreten Mayıs 2022

165 Folgt146 Follower

Angehefteter Tweet

Jeongsoo Park@jespark0·13 Haz

Can AI image detectors keep up with new fakes? Mostly, no. Existing detectors are trained using a handful of models. But there are thousands in the wild! Our work, Community Forensics, uses 4800+ generators to train detectors that generalize to new fakes. #CVPR2025 🧵 (1/5)

English

1.8K

Jeongsoo Park@jespark0·14 Haz

Had a ton of fun presenting today at #CVPR2025! Thanks to everyone who came to my poster, and thank you for asking excellent questions!

English

545

Jeongsoo Park retweetet

Linyi Jin@jin_linyi·14 Haz

Hello! If you are interested in dynamic 3D or 4D, don't miss the oral session 3A at 9 am on Saturday: @zhengqi_li will be presenting "MegaSaM" I'll be presenting "Stereo4D" and @QianqianWang5 will be presenting "CUT3R"

English

1.5K

Jeongsoo Park retweetet

Ayush Shrivastava@ayshrv·13 Haz

Excited to share our CVPR 2025 paper on cross-modal space-time correspondence! We present a method to match pixels across different modalities (RGB-Depth, RGB-Thermal, Photo-Sketch, and cross-style images) — trained entirely using unpaired data and self-supervision. Our approach learns correspondences through contrastive random walks across visual modalities. #CVPR2025 (1/6)

English

120

Jeongsoo Park retweetet

Yiming Dou@_YimingDou·13 Haz

Ever wondered how a scene sounds👂 when you interact👋 with it? Introducing our #CVPR2025 work "Hearing Hands: Generating Sounds from Physical Interactions in 3D Scenes" -- we make 3D scene reconstructions audibly interactive! yimingdou.com/hearing_hands/

English

8.3K

Jeongsoo Park@jespark0·13 Haz

The data is available on Hugging Face, as well as the pipeline code! Come chat with us at #CVPR2025! We’ll be presenting Friday afternoon at poster #274. (work w/ @andrewhowens) 📄 Project Page: jespark.net/projects/2024/… 💾 Dataset/Code: huggingface.co/datasets/Owens… 🧵 (5/5)

English

109

Jeongsoo Park@jespark0·13 Haz

Each image is labeled with detailed metadata, enabling more than just fake detection. We are excited to see what the community can build with this data! 🧵 (4/5)

English

Jeongsoo Park@jespark0·13 Haz

English

1.8K

Jeongsoo Park retweetet

Daniel Geng@dangengdg·12 Haz

Hello! If you like pretty images and videos and want a rec for CVPR oral session, you should def go to Image/Video Gen, Friday at 9am: I'll be presenting "Motion Prompting" @RyanBurgert will be presenting "Go with the Flow" and @ChangPasca1650 will be presenting "LookingGlass"

English

5.3K

Jeongsoo Park retweetet

Chris Rockwell@_crockwell·25 Nis

Ever wish YouTube had 3D labels? 🚀Introducing🎥DynPose-100K🎥, an Internet-scale collection of diverse videos annotated with camera pose! Applications include camera-controlled video generation🤩and learned dynamic pose estimation😯 Download: huggingface.co/datasets/nvidi…

English

177

42.9K

Jeongsoo Park retweetet

Ayush Shrivastava@ayshrv·2 Eki

We present Global Matching Random Walks, a simple self-supervised approach to the Tracking Any Point (TAP) problem, accepted to #ECCV2024. We train a global matching transformer to find cycle consistent tracks through video via contrastive random walks (CRW).

English

17K

Jeongsoo Park retweetet

Sarah Jabbour@SarahJabbour_·22 Tem

📢Presenting 𝐃𝐄𝐏𝐈𝐂𝐓: Diffusion-Enabled Permutation Importance for Image Classification Tasks #ECCV2024 We use permutation importance to compute dataset-level explanations for image classifiers using diffusion models (without access to model parameters or training data!)

English

4.4K

Jeongsoo Park retweetet

Sarah Jabbour@SarahJabbour_·27 Haz

This year I'm organizing ML4H Outreach program, and want to highlight our Author Mentorship program. Whether you're a mentee looking for guidance or a more experienced researcher with time to mentor, we'd love to have you be a part of this program! Deadline to apply is July 5!

ML4H@SymposiumML4H

Are you planning to submit a paper to ML4H 2024 and would like to receive mentorship from senior scientists? Are you interested in mentoring early-stage researchers? Sign up for the ML4H Submission Mentorship Program by July 5th AoE! Program details: ahli.cc/ml4h/mentorshi…

English

3.3K

Jeongsoo Park retweetet

Ziyang Chen@CzyangChen·21 May

These spectrograms look like images, but can also be played as a sound! We call these images that sound. How do we make them? Look and listen below to find out, and to see more examples!

English

164

35.7K

Jeongsoo Park retweetet

Yiming Dou@_YimingDou·9 May

NeRF captures visual scenes in 3D👀. Can we capture their touch signals🖐️, too? In our #CVPR2024 paper Tactile-Augmented Radiance Fields (TaRF), we estimate both visual and tactile signals for a given 3D position within a scene. Website: dou-yiming.github.io/TaRF/ arXiv: arxiv.org/abs/2405.04534 Huge thanks to my collaborators Fengyu Yang, Yi Liu and advisors @andrewhowens @antoniloq !!!

English

115

37.5K

Jeongsoo Park@jespark0·8 May

@shengpu_tang @EmoryUniversity Congrats Shengpu!

Indonesia

Shengpu Tang@shengpu_tang·7 May

I’m excited to share that this fall I’ll be joining @EmoryUniversity department of computer science as an assistant professor! #GoBlue #TripleWolverine (soon to be)

English

186

16.9K

Jeongsoo Park retweetet

Daniel Geng@dangengdg·18 Nis

What do you see in these images? These are called hybrid images, originally proposed by Aude Oliva et al. They change appearance depending on size or viewing distance, and are just one kind of perceptual illusion that our method, Factorized Diffusion, can make.

English

100

446

59.1K

Jeongsoo Park retweetet

Daniel Geng@dangengdg·30 Kas

Can you make a jigsaw puzzle with two different solutions? Or an image that changes appearance when flipped? We can do that, and a lot more, by using diffusion models to generate optical illusions! Continue reading for more illusions and method details 🧵

English

115

614

125.6K

Jeongsoo Park@jespark0·22 Haz

@drummatick @jcjohnss Cool idea! I think there’s no guarantee in its performance, but you can try to use a transformer in a similar manner to this paper: arxiv.org/abs/2306.00238

English

Saurabh Kumar@drummatick·22 Haz

On another note, do you think any representation of an image will work? For instance, consider a one-way encryption that encrypts the image to some hash and let's assume non-collision. By all means, we should get the same performance in benchmarks as we get when we train using RGB(or JPEG in this case)?

English

216

Jeongsoo Park@jespark0·20 Haz

Do we need RGB to train neural networks? We skip decoding JPEG to RGB, directly feed the encoded JPEG to ViT, and speed up train/eval by up to 39.2%/17.9% without accuracy loss! Check out our poster on Thu-PM-165 in #CVPR2023! (work w/ @jcjohnss) bit.ly/3qRwToV

English

250

39.7K

Entdecken

@zhengqi_li @QianqianWang5 @andrewhowens @RyanBurgert @ChangPasca1650 @antoniloq @shengpu_tang @EmoryUniversity