Jesse Vig (@jesse_vig) - Twitter Profili | Zamantika Mersobahis Locabet

Jesse Vig@jesse_vig·4 Kas

Congratulations @meetdavidwan for this great collaboration between Salesforce Research and UNC! @JotyShafiq @mohitban47

How does positional bias influence faithfulness in long-form summarization (LFSumm)? 🤔 In our latest study, we analyze the effect of positional bias on faithfulness metrics and generated summaries, and methods to mitigate this bias. 📄Paper: arxiv.org/abs/2410.23609 Thead🧵👇

English

1

4

30

3.9K

Jesse Vig@jesse_vig·26 Eyl

@YesThisIsLion @tunguz Can't quite place it...

English

0

22

Llion Jones@YesThisIsLion·26 Eyl

@tunguz @jesse_vig looks familiar :)

English

1

0

401

Bojan Tunguz@tunguz·25 Eyl

I just got a copy of “Large Language Models: A Deep Dive.” I’ve been planning for a while to do just that with LLMs - delve deeper. ;) This books seems like an excelent up-to-date (as much as that is possible these days). Overview of this fascinating and important subject. Thanks Uday Kamath for sending this one to me! amzn.to/4ewmzql #AI #GenAI #LLM #LLMs

English

25

97

1K

82.3K

Jesse Vig retweetledi

Philippe Laban@PhilippeLaban·23 Şub

Excited to share this fun new work on the 🩴FlipFlop Effect. In short: if you ask models if they're sure of their answers, they tend to change their minds (and severely degrade accuracy). What's mindblowing is how universal the effect is across LLMs (GPTs, Gemini, Claudes, …).

Caiming Xiong@CaimingXiong

Excited to share a new preprint on the 🩴FlipFlop Effect. We prompt LLMs with a classification task, and challenge the model by following up with “Are you sure?”. The model can confirm or flip its answer. The results? More flips than a gymnastics competition! 🤸‍♂️ 1/N

English

2

8

35

4.4K

Jesse Vig retweetledi

Caiming Xiong@CaimingXiong·23 Şub

Excited to share a new preprint on the 🩴FlipFlop Effect. We prompt LLMs with a classification task, and challenge the model by following up with “Are you sure?”. The model can confirm or flip its answer. The results? More flips than a gymnastics competition! 🤸‍♂️ 1/N

English

4

31

140

20K

Jesse Vig@jesse_vig·22 Tem

@krandiash @HazyResearch Congrats!

English

0

1

99

Karan Goel@krandiash·19 Tem

Successfully defended my PhD yesterday, one of the most fun experiences of my life (barring Covid) thanks to @HazyResearch Time for more fun stuff

English

38

12

424

59.7K

Jesse Vig@jesse_vig·11 Tem

@YesThisIsLion Thank you Llion!

English

0

14

Llion Jones@YesThisIsLion·11 Tem

@jesse_vig Congrats!

English

1

0

1

78

Jesse Vig@jesse_vig·10 Tem

How can we teach models to simplify text using the revision history of Wikipedia articles? Check out our paper "SWiPE: A Dataset for Document-Level Simplification of Wikipedia Pages" presented by @PhilippeLaban at #acl2023NLP (poster session 5). 🎉

English

1

11

42

6.1K

Jesse Vig retweetledi

Caiming Xiong@CaimingXiong·7 Tem

🤔Which words in your prompt are most helpful to language models? In our #ACL2023NLP paper, we explore which parts of task instructions are most important for model performance. 🔗 arxiv.org/abs/2306.01150 Code: github.com/fanyin3639/Ret…

English

3

43

199

27.7K

Jesse Vig@jesse_vig·27 Haz

Check out our latest work on text simplification at #acl2023nlp .

Philippe Laban@PhilippeLaban

Very excited to present SWiPE in person at ACL in a few weeks. In short, we collaborated with Wikipedia editors to understand the process of document simplification and take a (small) step towards improving document accessibility by releasing a large dataset!

English

0

4

773

Jesse Vig retweetledi

WikiResearch@WikiResearch·26 Haz

"SWIPE: A Dataset for Document-Level Simplification of Wikipedia Pages" leveraging the entire revision history when pairing enwiki/simplewiki pages, to identify simplification edits. (Laban et al, 2023) arxiv.org/pdf/2305.19204… @iam_wkr

English

0

9

32

4.2K

Jesse Vig retweetledi

Wojciech Kryściński@iam_wkr·18 Haz

Check out our work on text simplification! SWiPE accepted at #ACL2023

Caiming Xiong@CaimingXiong

Finding a document too dense to decipher? 🤔Content a bit convoluted? Essay too esoteric? Check how we simplify and improve document readability using SWiPE. Join us in making knowledge accessible to all! 🌐 🔗Paper: arxiv.org/abs/2305.19204 🔗Github: github.com/salesforce/sim…

English

0

3

14

2.1K

Jesse Vig retweetledi

Caiming Xiong@CaimingXiong·16 Haz

By aligning Wikipedia articles to their simplified versions on Simple Wikipedia, we reconstruct the process by which human editors simplify whole documents, in contrast to prior work focused on sentence-level simplification.

English

1

6

719

Jesse Vig retweetledi

Caiming Xiong@CaimingXiong·16 Haz

Finding a document too dense to decipher? 🤔Content a bit convoluted? Essay too esoteric? Check how we simplify and improve document readability using SWiPE. Join us in making knowledge accessible to all! 🌐 🔗Paper: arxiv.org/abs/2305.19204 🔗Github: github.com/salesforce/sim…

GIF

English

1

14

48

18K

Jesse Vig retweetledi

Yixin Liu@YixinLiu17·12 Haz

Delighted to announce our paper has been accepted for an oral presentation at #ACL2023 oral! In this work we emphasize the intricate complexity of human evaluation while it is becoming even more crucial for both model training and evaluation in the LLM era.

Alex Fabbri@alexfabbri4

🚨🆕📄🚨 How gold is your human evaluation? We seek the answer, and its implications in the GPT3 era, in our preprint “Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation” Paper: arxiv.org/abs/2212.07981 Equal contribution @YixinLiu17

English

1

6

40

8.7K

Jesse Vig@jesse_vig·24 Nis

How can NLP help us understand the diversity of news coverage of a topic? Check out the latest work from @PhilippeLaban et al. appearing at #CHI2023 this week.

Philippe Laban@PhilippeLaban

📰 How can we make it easier for news readers to access nuanced and diverse coverage from multiple sources? In our #CHI2023 paper, we propose to highlight news coverage diversity through generated *discord questions* shown to the readers. Link: bit.ly/41yviRP

English

0

3

553

Jesse Vig retweetledi

Wojciech Kryściński@iam_wkr·19 Ara

Very excited to have the opportunity to present research done at @SFResearch on Automatic Text Summarization at @ZIL_IPIPAN „Long Story Short: A Talk about Text Summarization” will cover the current state of the field, existing challenges, and future directions.

English

1

2

22

3.1K

Jesse Vig retweetledi

Chien-Sheng (Jason) Wu@jasonwu0731·16 Ara

Human preference != Job Done. Check our interesting findings (ROSE🌹) on summarization! Thanks to my fantastic collaborators @alexfabbri4 @YixinLiu17 @stefan_fee Yilun Zhao Linyong Nan Ruilin Han @HanSineng @JotyShafiq @CaimingXiong @dragomir_radev!

Alex Fabbri@alexfabbri4

🚨🆕📄🚨 How gold is your human evaluation? We seek the answer, and its implications in the GPT3 era, in our preprint “Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation” Paper: arxiv.org/abs/2212.07981 Equal contribution @YixinLiu17

English

0

3

17

2.1K

Jesse Vig retweetledi

Alex Fabbri@alexfabbri4·16 Ara

You can explore the ACU annotations in Rose🌹along with protocol results on our demo page and start using our dataset! Repo: github.com/Yale-LILY/ROSE Demo page: yale-lily.github.io/ROSE/ Dataset: huggingface.co/datasets/Sales…

English

1

2

7

651

Jesse Vig retweetledi

Alex Fabbri@alexfabbri4·16 Ara

🚨🆕📄🚨 How gold is your human evaluation? We seek the answer, and its implications in the GPT3 era, in our preprint “Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation” Paper: arxiv.org/abs/2212.07981 Equal contribution @YixinLiu17

English

5

20

96

28.3K

Jesse Vig

Keşfet