Zae Myung Kim

18 posts

Zae Myung Kim

@zaemyung

Katılım Şubat 2022

55 Takip Edilen39 Takipçiler

Sabitlenmiş Tweet

Zae Myung Kim@zaemyung·30 Nis

🚨 New Paper Alert! 🚨 How can we align language models without drowning in prompt engineering or falling into reward hacking traps? We introduce Meta Policy Optimization (MPO)—a new reinforcement learning framework that evolves its own reward model rubrics through meta-level reflection. Inspired by metacognition and evaluative thinking, MPO trains models to think about how they evaluate, not just what they generate. 🔥 Why it matters: ✔️ Boosts stability and robustness in RLAIF ✔️ Reduces human labor in prompt crafting ✔️ Generalizes across tasks: essays, summarization, ethical and mathematical reasoning Check it out: huggingface.co/papers/2504.20… Big thanks to co-authors @chanwoopark20 (MIT), @_vipulraheja (Grammarly), and @dongyeopkang (UMN)! #AI #LLMs #ReinforcementLearning #MetaLearning #NLP #Alignment #RLHF #RLAIF #EvaluativeThinking #PromptEngineering

English

911

Zae Myung Kim@zaemyung·10 Şub

@chanwoopark20 What's your impression on coding?

English

Chanwoo Park@chanwoopark20·10 Şub

codex 5.3 is just super wild -- very different with codex 5.2...

English

557

Zae Myung Kim@zaemyung·19 Şub

This work is done with my amazing collaborators Kwang Hee Lee, Preston Zhu, @_vipulraheja, and @dongyeopkang If you're interested, you can read all about it here!: arxiv.org/pdf/2402.10586… Data and code will be released soon!

English

276

Zae Myung Kim@zaemyung·19 Şub

Our empirical findings indicate that robustness against paraphrasing attacks arises from the preservation of higher-level discourse structures, despite variations at the sentence level.

English

253

Zae Myung Kim@zaemyung·19 Şub

"Paraphrasing attacks" can compromise the effectiveness of AI content detectors. 🙀 Can hierarchical structures in texts help build a more robust detector? Our research reveals a resounding💡YES!💡Delighted to share our work on merging discourse frameworks with graph analysis.

English

5.3K

Zae Myung Kim retweetledi

Debarati Das@geekylildeb·30 Oca

🚀Excited to share MinnesotaNLP's FIRST lab-wide paper (15+ team) on artifacts present in LLM-generated data! We explore the diverse world of LLM-generated text content and its impact on the artificial data ecosystem. #NLProc #syntheticdata #LLM ArXiV: arxiv.org/abs/2401.14698

English

Zae Myung Kim retweetledi

Ryan Koo@im_kooryan·11 Eki

LLMs have proven to outperform humans on a multitude of tasks. Does this also mean they are more biased too? In our work, we benchmark several different LLMs as automatic evaluators for various cognitive biases. arxiv.org/abs/2309.17012

English

17.8K

Zae Myung Kim@zaemyung·7 Ara

I deeply appreciate my amazing collaborators for their efforts and guidance: @WanyuDu @_vipulraheja @ddhruvkr @dongyeopkang 🙏 and ACK to @Grammarly @UVA_ILP @UMNComputerSci!

English

Zae Myung Kim@zaemyung·7 Ara

We also augmented the training dataset with datasets from other relevant tasks, for example, "Lang-8" for improving fluency. We found out that many of these data were "meaning-changed" edits that were more toward generation than revision, and thus we filtered them accordingly.

English

Zae Myung Kim@zaemyung·7 Ara

We are excited to present our recent improvements on the iterative text revision task at #EMNLP2022! Come check it out in Poster Session 2 at 11:00AM on Dec. 9, 2022. Paper: arxiv.org/abs/2212.01350 Code: github.com/vipulraheja/it… #EMNLP2022 #nlpproc

English

Zae Myung Kim@zaemyung·30 Eyl

@dongyeopkang 🙀

QME

Dongyeop Kang (DK)@dongyeopkang·23 Eyl

@zaemyung is teaching us how to make a Korean bomb shot (soju+beer)

English

Dongyeop Kang (DK)@dongyeopkang·23 Eyl

Our Minnesota NLP group ia getting bigger and diverse! ❤️❤️

English

Zae Myung Kim retweetledi

Wanyu Du@WanyuDu·20 May

If you are interested in human-machine interactive text revision, please check out our paper, which won 🎉the best paper award🎉 at #In2Writing workshop at #ACL2022! We attach more details below. Paper: arxiv.org/abs/2204.03685 Demo: youtube.com/watch?v=lK08tI…

YouTube

English

Zae Myung Kim retweetledi

Dongyeop Kang (DK)@dongyeopkang·24 Şub

Please consider submitting your work to #in2writing workshop at ACL!

Kenneth Huang@windx0303

🔥Call For Papers🔥1st Workshop on Intelligent & Interactive Writing Assistants #In2Writing will co-locate with #ACL2022nlp! #NLProc #HCI We welcome 4 submission types: ✏️Paper ✏️Extended Abstract ✏️Demo ✏️Cross-submission (!) Deadline: Feb 28, 2022 Site:in2writing.glitch.me

English

Zae Myung Kim retweetledi

Jessy Li@jessyjli·15 Şub

Excited to share DCQA (Discourse Comprehension by Question Answering), a scalable data collection framework + 22K training data capturing semantic and discursive relationships between sentences via free-form questions and their answers arxiv.org/abs/2111.00701 (1/2)

English

101

Keşfet

@chanwoopark20 @_vipulraheja @dongyeopkang @WanyuDu @ddhruvkr @UVA_ILP @UMNComputerSci @elonmusk