Rattana Pukdee

106 posts

Rattana Pukdee

@rpukdeee

PhD student at @mldcmu 🐕‍🦺

Pittsburgh Katılım Nisan 2014

268 Takip Edilen56 Takipçiler

Rattana Pukdee retweetledi

Yuda Song @ ICLR 2026@yus167·5 Şub

What is the right algorithm for LLM RL? Maybe we should start with rethinking what the right objective is. Introducing MaxRL, led by the amazing @FahimTajwar10, @guanningzeng, and @Yueer_Zhou 🧵(1/n)

Fahim Tajwar@FahimTajwar10

Are we done with new RL algorithms? Turns out we might have been optimizing the wrong objective. Introducing MaxRL, a framework to bring maximum likelihood optimization to RL settings. Paper + code + project website: zanette-labs.github.io/MaxRL/ 🧵 1/n

English

3.5K

Rattana Pukdee retweetledi

Yuda Song @ ICLR 2026@yus167·3 Şub

RL on LLMs inefficiently uses one scalar per rollout. But users regularly give much richer feedback: "make it formal," "step 3 is wrong." Can we train LLMs on this human-AI interaction? We introduce RL from Text Feedback, with 1) Self-Distillation; 2) Feedback Modeling (1/n) 🧵

English

101

601

106.6K

Rattana Pukdee retweetledi

Dylan Sam@dylanjsam·16 Eyl

🚨Excited to introduce a major development in building safer language models: Safety Pretraining! Instead of post-hoc alignment, we take a step back and embed safety directly into pretraining. 🧵(1/n)

English

357

62.5K

Rattana Pukdee retweetledi

Jennifer Hsia@jen_hsia·16 Tem

1/6 Retrieval is supposed to improve generation in RAG systems. But in practice, adding more documents can hurt performance, even when relevant ones are retrieved. We introduce RAGGED, a framework to measure and diagnose when retrieval helps and when it hurts.

English

105

10.2K

Rattana Pukdee@rpukdeee·2 May

Link to paper: openreview.net/forum?id=vkvJD… Joint work with @mahbodm_, Vishwajeet Agrawal, @VariciBurak , @RavikumarPrad

English

110

Rattana Pukdee@rpukdeee·2 May

In our #AISTATS2025 paper, we ask: when it is possible to recover a consistent joint distribution from conditionals? We propose path consistency and autoregressive path consistency—necessary and easily verifiable conditions. See you at Poster session 3, Monday 5th May.

English

1.1K

Rattana Pukdee retweetledi

Dylan Sam@dylanjsam·17 Şub

Excited to share new work from my internship @GoogleAI ! Curious as to how we should measure the similarity between examples in pretraining datasets? We study the role of similarity in pretraining 1.7B parameter language models on the Pile. arxiv: arxiv.org/abs/2502.02494 1/🧵

English

167

19.8K

Rattana Pukdee retweetledi

Dylan Sam@dylanjsam·16 Oca

To trust LLMs in deployment (e.g., agentic frameworks or for generating synthetic data), we should predict how well they will perform. Our paper shows that we can do this by simply asking black-box models multiple follow-up questions! w/ @m_finzi and @zicokolter 1/ 🧵

English

116

15.1K

Rattana Pukdee retweetledi

Dylan Sam@dylanjsam·4 Ara

Contrastive VLMs (CLIP) lack the structure of text embeddings, like satisfying analogies via arithmetic (king - man = queen). We enhance CLIP’s *reasoning abilities* on such tasks by finetuning w/ text descriptions of image differences! w/ D. Willmott, J.Semedo, @zicokolter 1/🧵

English

171

20.1K

Rattana Pukdee retweetledi

Daniel P Jeong@danielpjeong·14 Kas

🧵 Are "medical" LLMs/VLMs *adapted* from general-domain models, always better at answering medical questions than the original models? In our oral presentation at #EMNLP2024 today (2:30pm in Tuttle), we'll show that surprisingly, the answer is "no". arxiv.org/abs/2411.04118

English

104

24.1K

Rattana Pukdee retweetledi

Daniel P Jeong@danielpjeong·18 Tem

(1/N) Can LLMs tell you what features to use for predicting an outcome? In our work, we demonstrate that LLMs such as GPT-4 are capable of identifying predictive features for supervised learning tasks, even without access to the training data. w/ @zacharylipton @RavikumarPrad 🧵

English

5.7K

Rattana Pukdee retweetledi

Runtian Zhai@RuntianZhai·30 Nis

One week away from @iclr_conf in Vienna 🤩 I will be presenting two spotlights: why big foundation models generalize so well under the self-supervised setting, and how to leverage massive unlabeled data using a base kernel that encodes inter-sample similarity. Details 👇 (1/3)

English

Rattana Pukdee retweetledi

Emily Byun@yewonbyun_·30 Nis

Estimating notions of unfairness/inequity is hard as it requires that data captures all features that influenced decision-making. But what if it doesn't? In our work (arxiv.org/abs/2403.14713), we answer this question w/ @dylanjsam @MichaelOberst @zacharylipton @brwilder

English

15.9K

Rattana Pukdee retweetledi

Runtian Zhai@RuntianZhai·2 Şub

Unlabeled data is crucial for modern ML. It provides info about data distribution P, but how to exploit such info? Given a kernel K, our #ICLR2024 spotlight gives a general & principled way: Spectrally Transformed Kernel Regression (STKR). Camera-ready 👇 arxiv.org/abs/2402.00645

English

5.8K

Rattana Pukdee retweetledi

Runtian Zhai@RuntianZhai·17 Oca

What'd you do with an inter-sample similarity kernel, lots of unlabeled and little labeled data? Some might say kernel ridge regression (KRR), but KRR can't use unlabeled data by representer theorem. Our #ICLR2024 spotlight STKR gives an answer. A 🧵 (1/3) openreview.net/forum?id=OeQE9…

English

1.6K

Rattana Pukdee retweetledi

Brandon Trabucco@brandontrabucco·21 Ara

Stable Diffusion is an effective data augmentation. Website: btrabuc.co/da-fusion Watch Here: youtu.be/IKDWOOWzwns I'm excited to share my NeurIPS talk about DA-Fusion from the Synthetic Data workshop, where we build an augmentation that semantically modifies images, and doesn't require prompt engineering or manual tuning. Our work improves few-shot learning across seven diverse vision tasks, including fine-grain concepts that are hard to engineer prompts for, and novel concepts that Stable Diffusion hasn't seen before. DA-Fusion is being used by ecologists to detect Leafy Spurge, an invasive plant, in drone images. #neurips #StableDiffusion #MachineLearning Joint with @rsalakhu, Kyle Doherty, @maxgurinas

YouTube

English

26.3K

Rattana Pukdee retweetledi

Dylan Sam@dylanjsam·13 Ara

Check out our #NeurIPS2023 paper "Learning with Explanation Constraints" with my co-author @rpukdeee, which explains how explanations of model behavior can help us from a learning-theoretic perspective! (arxiv.org/pdf/2303.14496…) 🧵 (1/n)

English

9.5K

Rattana Pukdee retweetledi

Alex Tamkin@AlexTamkin·17 Tem

DALL-E meets WALL-E: An Art History 1) Mona Lisa, Leonardo da Vinci

English

118

Rattana Pukdee retweetledi

Shubhendu Trivedi@_onionesque·21 Şub

"A Theory of PAC Learnability under Transformation Invariances" arxiv.org/abs/2202.07552 by Hao Shao, @montasser_omar and Avrim Blum; seems like one of the first papers studying optimal algorithms in terms of sample complexity under (group) transformation invariances.

English

Rattana Pukdee@rpukdeee·19 Kas

I am excited to share that I joined @mldcmu , @SCSatCMU as a PhD student and I will be working on interpretability/ robustness in ML with my advisors Nina Balcan and Pradeep Ravikumar. 🤓

English

Keşfet

@FahimTajwar10 @guanningzeng @Yueer_Zhou @mahbodm_ @VariciBurak @RavikumarPrad @GoogleAI @m_finzi