Kshitish Ghate

58 posts

Kshitish Ghate

Kshitish Ghate

@GhateKshitish

PhD student @UWCSE | MLT Grad student @LTIatCMU | CS and Econ @bitspilanigoa

Katılım Ekim 2022
253 Takip Edilen100 Takipçiler
Sabitlenmiş Tweet
Kshitish Ghate
Kshitish Ghate@GhateKshitish·
🚨New paper: Reward Models (RMs) are used to align LLMs, but can they be steered toward user-specific value/style preferences? With EVALUESTEER, we find even the best RMs we tested exhibit their own value/style biases, and are unable to align with a user >25% of the time. 🧵
Kshitish Ghate tweet media
English
1
17
68
6.5K
Kshitish Ghate retweetledi
Natasha Jaques
Natasha Jaques@natashajaques·
The paper I’ve been most obsessed with lately is finally out: nbcnews.com/tech/tech-news…! Check out this beautiful plot: it shows how much LLMs distort human writing when making edits, compared to how humans would revise the same content. We take a dataset of human-written essays from 2021, before the release of ChatGPT. We compare how people revise draft v1 -> v2 given expert feedback, with how an LLM revises the same v1 given the same feedback. This enables a counterfactual comparison: how much does the LLM alter the essay compared to what the human was originally intending to write? We find LLMs consistently induce massive distortions, even changing the actual meaning and conclusions argued for.
Natasha Jaques tweet media
English
43
385
1.4K
237.2K
Kshitish Ghate retweetledi
Zora Wang
Zora Wang@ZhiruoW·
‼️Position: AI coding agent research needs recalibration. We've heavily optimized for solo autonomy, and far less for designing agents that empower the humans using them. It’s time to build human-centered coding agents. 🧵
Zora Wang tweet media
English
26
63
317
48.7K
Kshitish Ghate retweetledi
Ethan Shen
Ethan Shen@ethnlshn·
Today, we release SERA-32B, an approach to coding agents that matches Devstral 2 at just $9,000. It is fully open-source and you can train your own model easily - at 26x the efficiency of using RL. Paper: allenai.org/papers/opencod… Here’s how 🧵
Ai2@allen_ai

Introducing Ai2 Open Coding Agents—starting with SERA, our first-ever coding models. Fast, accessible agents (8B–32B) that adapt to any repo, including private codebases. Train a powerful specialized agent for as little as ~$400, & it works with Claude Code out of the box. 🧵

English
27
91
691
90.3K
Kshitish Ghate retweetledi
Lucy Li
Lucy Li@lucy3_li·
PhD apps season is here! 😱🥳 Apply to do a PhD @WisconsinCS (as pictured) w/ me to research: - Societal impact of AI - NLP ←→ CSS and cultural analytics - Computational sociolinguistics - Human-AI interaction - Culturally competent and inclusive NLP lucy3.github.io/prospective-st…
Lucy Li tweet mediaLucy Li tweet mediaLucy Li tweet media
English
17
67
362
57.6K
Kshitish Ghate retweetledi
Taylor Sorensen
Taylor Sorensen@ma_tay_·
My best hypothesis for the mechanism is: Chat LLMs are hyperoptimized to approximate the single "best" (most-preferred) response. When you prompt it for a single story, it gives the single best story it can. When you ask it to give FIVE stories, you recast the "best" response to be one containing FIVE stories, which has more diversity (a very good trick!) However, in the limit, as we train models with this objective, it converges to ALWAYS giving the same "best"/high-reward story - a fundamental limitation of the current paradigm x.com/ma_tay_/status…
English
3
3
21
1.8K
Kshitish Ghate
Kshitish Ghate@GhateKshitish·
🚨New paper: Reward Models (RMs) are used to align LLMs, but can they be steered toward user-specific value/style preferences? With EVALUESTEER, we find even the best RMs we tested exhibit their own value/style biases, and are unable to align with a user >25% of the time. 🧵
Kshitish Ghate tweet media
English
1
17
68
6.5K
Kshitish Ghate retweetledi
Taylor Sorensen
Taylor Sorensen@ma_tay_·
🤖➡️📉 Post-training made LLMs better at chat and reasoning—but worse at distributional alignment, diversity, and sometimes even steering(!) We measure this with our new resource (Spectrum Suite) and introduce Spectrum Tuning (method) to bring them back into our models! 🌈 1/🧵
Taylor Sorensen tweet media
English
5
45
198
68K