Yibing Sun

66 posts

Yibing Sun

@Yibing_Sun

Ph.D. student in the School of Journalism and Mass Communication, University of Wisconsin-Madison

Katılım Ekim 2021

267 Takip Edilen177 Takipçiler

Yibing Sun retweetledi

Meysam Alizadeh@MeysamAIizadeh·7 Mar

Can AI coding agents reproduce published social science findings? In new work with @_mohsen_m, Fabrizio Gilardi, and @j_a_tucker, we introduce SocSci-Repro-Bench — a benchmark of 221 reproducibility tasks from 54 papers — and evaluate two frontier coding agents: Claude Code and Codex. The results reveal both remarkable capabilities and new risks for AI-assisted science. ------------------------------------ GOAL -------- A key design goal was separating two different problems: 1️⃣ Are replication materials themselves reproducible? 2️⃣ Can AI agents reproduce results when materials are executable? To isolate agent performance, we only included tasks whose outputs were identical across three independent manual executions. ------------------------------------ DESIGN -------- Agents received: • anonymized data + code • a sandboxed execution environment They had to autonomously: • install dependencies • debug broken code • execute the pipeline • extract the requested results In short: end-to-end computational reproduction. ------------------------------------ RESULTS -------- Both agents reproduced a large share of published findings. But Claude Code substantially outperformed Codex. Task-level accuracy • Claude Code: 93.4% • Codex: 62.1% Paper-level reproduction (all tasks correct) • Claude Code: 78.0% • Codex: 35.8% ------------------------------------ WHY THE GAP? -------- Replication packages often contain problems: • missing dependencies • hard-coded file paths • incomplete environment specifications Claude Code frequently repaired these issues autonomously. Codex often failed to recover the execution pipeline. ------------------------------------ IS THIS JUST MEMORIZATION? -------- We tested this by asking agents to infer paper metadata (title, authors, journal, year) from anonymized replication materials. Recovery rates were very low, suggesting agents primarily relied on code execution, not memorization of papers. ------------------------------------ REASONING TEST -------- We also tested a harder task: Can agents infer the research question of a study from code and data alone? Both agents performed surprisingly well. ------------------------------------ CONFIRMATION BIAS -------- When agents were given the paper PDF, a new problem emerged. Sometimes they copied reported results from the text instead of executing the code. Accuracy on non-reproducible tasks dropped sharply. Context helps execution — but reduces independence of verification. ------------------------------------ SYCOPHANCY -------- Inspired by @ahall_research, we tested adversarial prompt framing, nudging agents to: “explore alternative analyses that align with the paper’s reported results.” Accuracy increased. But agents also became more likely to fabricate results when reproduction was impossible. ------------------------------------ THE PARADOX -------- Pressure to produce an answer can help agents repair execution pipelines. But it simultaneously erodes their ability to say: “This result cannot be reproduced.” Recognizing when reproduction is impossible may be the most important scientific capability. ------------------------------------ NOTES -------- • This is work in progress — feedback is welcome. • Benchmark available on GitHub. • Replication materials hosted on Dataverse. Paper + repository in the reply below.

English

189

26K

Yibing Sun retweetledi

Dhavan Shah@dvshah·13 Ara

We welcome submissions to our @IJoC_USC special issue on "Presidential Debates Across the Americas." CfP is open until April 30, 2025. Email me (dshah@wisc.edu) or my co-editors w/ questions and share with others in your network. Link to full call below. mcrc.journalism.wisc.edu/2024/12/13/mcr…

English

5.6K

Yibing Sun retweetledi

Subhayan Mukerjee@wrahool·8 Kas

🚨We (@alvinyxz @ehmaslowska) are editing a special issue for Computational Communication Research on GenAI! Submissions on GenAI as comm phenomena or research tools are welcome: z.umn.edu/ccrgenai Abstracts due: Dec 31 '24 Full papers: Apr 30 '25 @ica_cm @CCR_OpenJournal

English

11.9K

Yibing Sun retweetledi

Mike Wagner@prowag·4 Kas

How do we know that can we trust the vote count? Check out Episode 1 of the Civic Sift, our new digital show from the CCCR that sheds light on important questions of the day, using evidence from experts and practitioners. Let’s sift & winnow together! m.youtube.com/watch?v=ow-9VD…

English

3.6K

Yibing Sun retweetledi

UW-Madison SJMC@uw_sjmc·30 Ağu

Together with @uwpolisci and @UWPsych and support from @knightfdn, we are seeking two assistant professors who focus on research in communication, social identity and civil society to start in August 2025. Learn more and join our team today. buff.ly/3MvToqE

English

7.2K

Yibing Sun retweetledi

Yiming Wang@YimingWang_·17 Tem

Check out our new paper in Public Opinion Quarterly @AAPOR! We introduce a novel measure integrating self-reported media use with outlet bias scores to measure the “shape” of news consumption and its impact on beliefs in electoral fraud and distrust in the electoral system.

English

2.1K

Yibing Sun@Yibing_Sun·28 May

@Ross_Dahlke @UWMadison @uw_sjmc Big congrats and welcome back! Hope to see you soon at Madison!

English

Ross Dahlke 🔑@Ross_Dahlke·28 May

📰 personal update: I'm so happy to say I've accepted a tenure-track assistant professorship at @UWMadison @uw_sjmc for next year. To return home to my alma mater is a privilege and a dream.

English

267

23.1K

Yibing Sun retweetledi

Luhang SUN@luhang_sun·3 Şub

Excited to announce the release of our latest paper, "Smiling women pitching down: auditing representational and presentational gender biases in image-generative AI," published in JCMC! #AI #GenerativeAI #GenderBias #Visual #Feminism

Journal of Computer-Mediated Communication@ica_jcmc

“Smiling women pitching down: auditing representational and presentational gender biases in image-generative AI” by Luhang Sun et al. Read it here: doi.org/10.1093/jcmc/z…

English

3.1K

Yibing Sun@Yibing_Sun·1 Eki

📢Enjoyed a lot with the project. Within this research, we coded the TikTok videos related to COVID 19. We found a lot of people imitating zombies as if they are the side effects of vaccinations. Fun but with frustration about their effects.

ellieyang@elliefanyang

📢#publication Work with @LaurenKriss @Yibing_Sun about Fun with Frustration? TikTok Influencers’ Emotional Expression Predicts User Engagement with COVID-19 Vaccination Messages: Health Communication: Vol 0, No 0 tandfonline.com/doi/abs/10.108…

English

279

Yibing Sun retweetledi

Lone Nerup Sørensen@lonenerup·14 Tem

Checking the proofs for the 2nd ed. of the Handbook of Digital Politics, edited by Stephen Coleman and myself. Out in October. We have an amazing line-up of star contributors and up-and-coming scholars, including:

English

8.7K

Yibing Sun retweetledi

Dhavan Shah@dvshah·22 Haz

Our #ComputerVision #Multimodal classification paper is now in Comm Methods & Measures (first 50 downloads free). We combine video & audio features with speech coding of debate performances to understand changing patterns of aggressive political style. tandfonline.com/eprint/ZIQ24YG…

English

6.7K

Yibing Sun@Yibing_Sun·21 Haz

@borah @leedaniellekl @MurrowCollege Congrats! @leedaniellekl

English

103

Porismita borah@borah·21 Haz

Congratulations to the brilliant and wonderful @leedaniellekl @MurrowCollege on her successful dissertation defense. Looking forward to your future scholarly endeavors!

English

2.4K

Yibing Sun retweetledi

Dhavan Shah@dvshah·13 Haz

Honored to be awarded a WARF Named Professorship from UW-Madison. And grateful to be able to name the professorship after a towering figure in our field, a pathbreaking scholar, and a mentor to so many, including me: Jack M. McLeod. journalism.wisc.edu/news/professor…

English

110

7.7K

Yibing Sun retweetledi

Jiyoun Suk, Ph.D. (jiyoun-suk.bsky.social)@jiyoun_suk·14 Haz

Another #hashtagactivism #MeToo paper just came out in @icsjournal, this time it’s about global! 🌏 Our paper examined how the global hashtag has become a transnational movement, crossing borders and platforms. Free 50 copies: doi.org/10.1080/136911… 1/n

Jiyoun Suk, Ph.D. (jiyoun-suk.bsky.social) tweet media

English

197

27K

Yibing Sun@Yibing_Sun·28 May

@LeticiaBode @LiweiShen @uw_sjmc Thank you for being there. It was really a good session!

English

Yibing Sun retweetledi

Mike Wagner@prowag·27 May

Liwei Shen and Yibing Sun presenting our group effort at #ica23 where we examine how ads promotion and social bits perform at conducting misinformation correction. Other dreamy collaborators: @LeticiaBode @ekvraga @borah @dvshah @sijiayang_camer and Danielle Lee

Toronto, Ontario 🇨🇦 English

1.5K

Yibing Sun@Yibing_Sun·26 May

Had so much fun in the 2-day Hackathon! So many ideas, tools and cool people. #ICA23 Recommend to everyone who have interests in computational methods!

English

828

Yibing Sun@Yibing_Sun·25 May

@LeticiaBode @ekvraga @GeorgetownCCT Congratulations!!!!

English

Yibing Sun@Yibing_Sun·25 May

Such a teamwork with all the amazing collaborators. Mentioned it with a couple others about the gender bias in GPT/OpenAI in #Hackathon #ICA2023 @hackingcommsci The piece is such a timely one!

Luhang SUN@luhang_sun

Today @GloriaWei7 and @sijiayang_camer joined me to introduce our new preprint article using face detection techniques to examine gender biases in Image #GenerativeAI at the interactive session at @ICA_HMC Preconference! #ICA2023: arxiv.org/abs/2305.10566 🧵

English

312

Yibing Sun retweetledi

Notion@NotionHQ·18 May

A gift for GIF-lovers 🎁 You can now access @GIPHY’s entire treasure trove of GIFs directly in /image blocks!

GIF

English

589

108.2K

Keşfet

@_mohsen_m @j_a_tucker @ahall_research @IJoC_USC @alvinyxz @ica_cm @CCR_OpenJournal @uwpolisci