Been Kim

841 posts

Been Kim

Been Kim

@_beenkim

Research Scientist at Google DeepMind, PhD from MIT. Make machines empower people.

Katılım Ağustos 2011
511 Takip Edilen27.1K Takipçiler
Been Kim
Been Kim@_beenkim·
Prompt engineering is still a black box. Why does changing X drastically change Y? Are there governing rules behind this evolution? Our new work proposes a simple way to uncover factors that might matter when refining prompts 👇
Neha Kalibhat@NehaKalibhat

Thrilled to share that our paper on "Interpreting and Controlling Model Behavior via Constitutions for Atomic Concept Edits" has been accepted at AISTATS 2026! 🚀🚀 Read more about how input mutations can be mapped to interpretable behavioral insights. arxiv.org/abs/2602.00092 🧵

English
1
1
32
7.3K
Been Kim
Been Kim@_beenkim·
@francoisfleuret As my brain is taken over by “wrongfully accused” sentiment, I typed in my password to their website, brainlessly…
English
1
0
2
118
François Fleuret
François Fleuret@francoisfleuret·
@_beenkim I do not understand how reading an email leads to losing control of your account, can you elaborate?
English
1
0
2
368
Been Kim
Been Kim@_beenkim·
I got my account back! Thank you, first and foremost, to everyone—friends, GDM colleagues---who personally alerted me to this incident and retweeted that I'm hacked, as well as folks at X who helped me regain access. While this incident was terrible (I heard the scammers made huge money out of this), I feel incredibly lucky to have folks who cared♥️♥️♥️ (details of how this happened 👇)
English
24
6
176
22.4K
Been Kim
Been Kim@_beenkim·
@savvyRL haha What do you think? Do I really sound like Been Kim now? :) even if you text me, it's possible that scammers has my phone too. even if we can GVC, it might be an AI-generated content. We just have to meet in person, since robotics is not quite there yet. 😵‍💫😵‍💫😵‍💫
English
2
0
12
1.4K
Ashish Vaswani
Ashish Vaswani@ashVaswani·
Rnj-1-Instruct is now the #1 trending text generation model on HF!
Ashish Vaswani tweet media
English
22
31
405
59K
Been Kim retweetledi
Susan Zhang
Susan Zhang@suchenzang·
What a privilege it is to have time as your most valuable currency.
English
4
7
132
0
Been Kim retweetledi
Christopher Potts
Christopher Potts@ChrisGPotts·
Safety-oriented interpretability researchers should be focused on AI systems, not individual model artifacts. A snippet from the NeurIPS CogInterp workshop panel on Sunday:
English
6
19
168
16K
Been Kim retweetledi
Christopher Potts
Christopher Potts@ChrisGPotts·
This post seems to describe substantially the same view that I offer here: web.stanford.edu/~cgpotts/blog/… Why are people describing the GDM post as concluding that mech-interp is a failed project? Is it the renaming of the field and constant talk of "pivoting"?
Neel Nanda@NeelNanda5

The GDM mechanistic interpretability team has pivoted to a new approach: pragmatic interpretability Our post details how we now do research, why now is the time to pivot, why we expect this way to have more impact and why we think other interp researchers should follow suit

English
4
22
127
31.8K
Been Kim
Been Kim@_beenkim·
@giffmana Wait until he says six seven. 🙃
English
0
0
13
1.2K
Been Kim
Been Kim@_beenkim·
Tomorrow 9:30am #NeurIPS2025 Room 30A-E I'll talk about " 📈Towards Pareto frontier of interpretability: 15 years of interpretability research in 15 mins"🚅 @ mech interp workshop mechinterpworkshop.com
English
5
8
81
12K
Been Kim
Been Kim@_beenkim·
Take that @doomie Samy Bengio! Hehehe
Been Kim tweet media
Indonesia
12
5
103
36.9K
Been Kim
Been Kim@_beenkim·
Our work out there in the wild 🥹
Zi Wang, Ph.D.@ziwphd

🔥 Proactive Co-Creator is officially LIVE in @GoogleAIStudio! Stop guessing prompts. Start collaborating. Use it now to remix ideas and generate images, stories, and video with an AI that proactively helps you create. 🔗 Try it here: aistudio.google.com/apps/bundled/p… 📍 At #NeurIPS2025? Come see the live demo TODAY (Dec 3) 9AM - 1:30PM | Google Booth #1533 (Kiosk 3) 🧠 Our research @GoogleDeepMind : We’re turning theory into practice. Read the papers behind the tech: Concept Edits (Tech Report): storage.googleapis.com/concept-edit/r… Proactive Agents (ICML 25'): arxiv.org/abs/2412.06771 QuestBench (NeurIPS 25'): arxiv.org/abs/2503.22674

English
0
2
27
8.1K
Been Kim retweetledi
Zi Wang, Ph.D.
Zi Wang, Ph.D.@ziwphd·
🔥 Proactive Co-Creator is officially LIVE in @GoogleAIStudio! Stop guessing prompts. Start collaborating. Use it now to remix ideas and generate images, stories, and video with an AI that proactively helps you create. 🔗 Try it here: aistudio.google.com/apps/bundled/p… 📍 At #NeurIPS2025? Come see the live demo TODAY (Dec 3) 9AM - 1:30PM | Google Booth #1533 (Kiosk 3) 🧠 Our research @GoogleDeepMind : We’re turning theory into practice. Read the papers behind the tech: Concept Edits (Tech Report): storage.googleapis.com/concept-edit/r… Proactive Agents (ICML 25'): arxiv.org/abs/2412.06771 QuestBench (NeurIPS 25'): arxiv.org/abs/2503.22674
Zi Wang, Ph.D. tweet mediaZi Wang, Ph.D. tweet media
English
0
8
26
12.1K
Been Kim
Been Kim@_beenkim·
Add: 9:30am on Sunday at Neurips, i'll touch upon this at the mech interp workshop keynote mechinterpworkshop.com
English
0
1
4
1.2K
Been Kim
Been Kim@_beenkim·
8/8 Making AI benefit humans takes a village. 🌍 But a village needs a shared language. Let's stop guessing and start measuring the frontier.📷 a short write-up: @beenkim/the-pareto-frontier-of-human-centered-ai-54f90ba5872c" target="_blank" rel="nofollow noopener">medium.com/@beenkim/the-p…
English
1
2
3
1.8K
Been Kim
Been Kim@_beenkim·
1/8 Pareto Frontier 🤠for Human-centered AI 📈: We all want to build AI that is good for humans, but the path is often paralyzed by complexity. Either “oh my god, it’s too complicated😱” or delusional “I have a warm and fuzzy feeling of understanding 🥴”? "It’s hard because it depends.🤷" is the enemy of progress. We need a Pareto Frontier for Human-centered AI. 🧵👇
Been Kim tweet media
English
5
10
80
37.8K