Ana Marasović

2.3K posts

Ana Marasović

@anmarasovic

Asst prof @UUtah · Ex @allen_ai @uwnlp @HD_NLP · she/her 🇭🇷

Salt Lake City Katılım Nisan 2014

597 Takip Edilen5.7K Takipçiler

Ana Marasović retweetledi

Farhan Ishmam@aplycaebous·3d

First paper of my PhD at the University of Utah, with Prof @Kenneth_Marino. Super excited to finally share what we've been working on at SparkLab. Meet TimeWarp ⏳, a benchmark that tests web agents by sending them back in time through 6 eras of web UI design. Thread 🧵(1/5)

English

Ana Marasović retweetledi

Kenneth Marino@Kenneth_Marino·4 Mar

It’s been less than a year since I started my lab (SPARK Lab) at @UUtah we already have a ton of new stuff that I can’t wait to talk about soon. Stay tuned for more. I’ll start today by sharing that our updated Computer Use Survey blog has been accepted to ICLR Blogposts 2026. Collaboration with my student @aplycaebous and Utah colleague @anmarasovic.

English

904

Ana Marasović retweetledi

Kenneth Marino@Kenneth_Marino·4 Mar

Wanted to share with the CU community that our updated Computer Use Survey blog has been accepted to ICLR Blogposts 2026. Collaboration with my student @aplycaebousand Utah colleague @anmarasovic.

English

923

Ana Marasović retweetledi

NAACL@naacl·31 Ara

Happy new year #NAACL! The 2026 election results are here. Congrats🥳 Chair: Anna Rumshisky @arumshisky Secretary: Jessy Li @jessyjli Board members: Muhao Chen @muhao_chen, Francisco (Paco) Guzmán, Ana Marasović @anmarasovic naacl.org/posts/2025-12-… Thank you all for voting!

English

4.7K

Ana Marasović retweetledi

Scientific Computing and Imaging Institute@uusci·11 Ara

💫 On the heels of announcing 12 new faculty fellows last week, SCI's One-U Responsible AI Initiative is excited to add 3 new postdoctoral fellows to its team next year: rai.utah.edu/postdocs-dec-2…

English

453

Ana Marasović retweetledi

Martian@withmartian·7 Ara

$1,000,000 to understand how LLMs write code. Announcing: The Martian Interpretability Challenge. Understanding the inner workings of LLMs is the greatest scientific challenge of our age,. Let's solve it. Apply here: withmartian.com/prize 🧵👇

English

157

30.9K

Ana Marasović retweetledi

Freda Shi@fredahshi·19 Kas

Going through statements from NAACL board candidates. Resonating a lot with many statements, and especially love the one from @anmarasovic! naacl.org/elections/2026…

English

4.5K

Ana Marasović@anmarasovic·15 Kas

@kthai1618 @max_spero_ @gneubig @jennajrussell OH, thanks for sharing!

English

Katherine Thai@kthai1618·15 Kas

@anmarasovic @max_spero_ @gneubig This is an aside but work from my labmate @jennajrussell actually shows the opposite regarding “people suck at detecting generated content” if they’re frequent users of ChatGPT!

Jenna Russell@jennajrussell

People often claim they know when ChatGPT wrote something, but are they as accurate as they think? Turns out that while general population is unreliable, those who frequently use ChatGPT for writing tasks can spot even "humanized" AI-generated text with near-perfect accuracy 🎯

English

103

Graham Neubig@gneubig·14 Kas

Has anyone run LLM detection on all ICLR papers and reviews? If not I'm willing to offer a bounty of $50 to the first person who does it (well). Happy to have any other people chip in 😃

ICLR 2026@iclr_conf

This paper has been desk rejected. LLM-generated papers that hallucinate references and do not report LLM usage will be desk rejected per ICLR policy (blog.iclr.cc/2025/08/26/pol…) Reviewers of other versions of this submission have been notified.

English

143

41.1K

Ana Marasović@anmarasovic·15 Kas

@max_spero_ @gneubig @kthai1618 Thanks!!

English

Max Spero@max_spero_·15 Kas

Much of the research in the past has been on evaluating fully human vs. fully AI (see third part evals here: pangram.com/blog/third-par…) But of course we expect a lot more ai assistance and mixed. @kthai1618's work on EditLens is what we ran on reviews, because we expect significant AI assistance. Some more stats there

English

502

Ana Marasović@anmarasovic·15 Kas

@zehavoc @gneubig @max_spero_ I checked my reviews which are English-smoothed-by-LLMs and flagged them all as no AI detected / fully human written

English

Djamé..@zehavoc·15 Kas

@gneubig @max_spero_ How about English-smoothed-by-LLMs reviews? In what categories do they fall ?

English

645

Graham Neubig@gneubig·15 Kas

ICLR authors, want to check if your reviews are likely AI generated? ICLR reviewers, want to check if your paper is likely AI generated? Here are AI detection results for every ICLR paper and review from @pangramlabs! It seems that ~21% of reviews may be AI?

English

525

311.6K

Ana Marasović@anmarasovic·15 Kas

@TuhinChakr @yanaiela @max_spero_ @bradley_emi @pangramlabs I checked my recent reviews and it does say no AI detected / fully-human written 😮‍💨 (lightly edited would be more correct tho)

English

136

Tuhin Chakrabarty@TuhinChakr·15 Kas

@anmarasovic @yanaiela @max_spero_ @bradley_emi No thats entirely what happens. Ofcourse I cant speak for other commercial detectors but you should try @pangramlabs Also read their new paper on polish vs fully Ai generated text arxiv.org/pdf/2510.03154

English

160

Tuhin Chakrabarty@TuhinChakr·15 Kas

@max_spero_ @bradley_emi saving our society from AI slop 👏👏 Also shame on the 20% who used AI to write ICLR reviews. There needs to be consequences for such egregious behavior!!!

Orion Weller@orionweller

Very interesting results of panagram LLM detection on ICLR reviews and papers 👀 Thanks so much @gneubig @max_spero_ @bradley_emi 20% AI generated reviews 🫠

English

2.1K

Ana Marasović@anmarasovic·15 Kas

@TuhinChakr @yanaiela @max_spero_ @bradley_emi @pangramlabs Sweet, will do

English

Ana Marasović@anmarasovic·15 Kas

@TuhinChakr @yanaiela @max_spero_ @bradley_emi Got it. I'm not familiar with these detection methods so I was think that they don't distinguish between highly generated review because (1) pdf -> review -> copy, and (2) review -> polished text -> copy this text. My sense was that both would be flagged as highly generated?

English

104

Tuhin Chakrabarty@TuhinChakr·15 Kas

@anmarasovic @yanaiela @max_spero_ @bradley_emi @anmarasovic polishing is perfectly fine. As you can see pangram posted edit stats too. I am speaking of Fully AI generated review where you upload pdf n just copy paste what AI writes

English

108

Ana Marasović@anmarasovic·15 Kas

@yanaiela @TuhinChakr @max_spero_ @bradley_emi +1 to this, I don't do notes -> review, but review -> polished English, and it saves me a ton of time and energy

English

Yanai Elazar@yanaiela·15 Kas

@TuhinChakr @max_spero_ @bradley_emi yes, but we it took longer 🙂

English

202

Ana Marasović@anmarasovic·15 Kas

@max_spero_ @gneubig Tyu! I'm not familiar with detection methods. Can you share about their quality/trustworthiness? I know people suck at detecting generated content but looking at 100% generated reviews, I wonder are they really purely prompt -> review. E.g. this one: openreview.net/forum?id=evNfQ…

English

768

Max Spero@max_spero_·15 Kas

@gneubig threw something together iclr.pangram.com/submissions please peruse the data!

English

30.8K

Ana Marasović@anmarasovic·12 Kas

arxiv.org/abs/2507.06329

ZXX

268

Ana Marasović@anmarasovic·12 Kas

youtu.be/w6LNmADnlNw?si…

YouTube

ZXX

455

Ana Marasović@anmarasovic·12 Kas

@mclemcrew's 's CoLM spotlight is now available on YT! 🎵 Link below.

English

608

Ana Marasović@anmarasovic·10 Kas

@WiAIR_podcast Thanks!!

English

235

Women in AI Research WiAIR@WiAIR_podcast·9 Kas

Congratulations to @anmarasovic - our recent guest at #WiAIRpodcast - for the outstanding paper at EMNLP 2025! 👏👏👏

Ana Marasović@anmarasovic

Thrilled to see this work recognized at #EMNLP2025! This framework and approach to measuring CoT faithfulness have been hugely influential for how I think about reasoning evaluation, and I'm so lucky to have worked with such brilliant collaborators. Huge credit to @mtutek

English

504

Ana Marasović@anmarasovic·8 Kas

@KrishnaPillutla @mtutek Thanks!

English