Been Kim (@_beenkim) - Twitter Profili | Zamantika Mersobahis Locabet

Been Kim@_beenkim·10 Tem

I’m giving a talk tmr at PhilML workshop at 10:35am! Please note that they’ll be actual philosophers (other speakers!) at the workshop too, so come by for some spicy thinking questions for you to ponder as you travel back home... :) It’s a great way to end icml.

English

0

1

49

7K

Been Kim@_beenkim·7 Tem

Flying to ICML ✈️ I’ll be spending the week either at the conference or at restaurants/street food stations eating at least 4 meals a day. Also giving a keynote at PhilML workshop on 11th:)

English

3

2

101

8K

Been Kim retweetledi

Pulkit Verma@pulkit_verma·17 Nis

The program for the #ICLR2026 Workshop "From Human Cognition to AI Reasoning" is now available. We have a fantastic lineup of talks. 🔗 hc-air.github.io/hcair26 Invited Speakers: @DocRachidAlami, @_beenkim, @ced_zhang Co-Organizers: @julie_a_shah, @sarath_ssreedh, @si_tulli

English

0

1

15

2.4K

Been Kim@_beenkim·4 Şub

Prompt engineering is still a black box. Why does changing X drastically change Y? Are there governing rules behind this evolution? Our new work proposes a simple way to uncover factors that might matter when refining prompts 👇

Neha Kalibhat@NehaKalibhat

Thrilled to share that our paper on "Interpreting and Controlling Model Behavior via Constitutions for Atomic Concept Edits" has been accepted at AISTATS 2026! 🚀🚀 Read more about how input mutations can be mapped to interpretable behavioral insights. arxiv.org/abs/2602.00092 🧵

English

2

1

37

8.3K

Been Kim@_beenkim·27 Oca

@francoisfleuret As my brain is taken over by “wrongfully accused” sentiment, I typed in my password to their website, brainlessly…

English

1

0

3

152

François Fleuret@francoisfleuret·22 Oca

@_beenkim I do not understand how reading an email leads to losing control of your account, can you elaborate?

English

1

0

2

405

Been Kim@_beenkim·20 Oca

I got my account back! Thank you, first and foremost, to everyone—friends, GDM colleagues---who personally alerted me to this incident and retweeted that I'm hacked, as well as folks at X who helped me regain access. While this incident was terrible (I heard the scammers made huge money out of this), I feel incredibly lucky to have folks who cared♥️♥️♥️ (details of how this happened 👇)

English

24

6

172

23.2K

Been Kim@_beenkim·22 Oca

@savvyRL haha What do you think? Do I really sound like Been Kim now? :) even if you text me, it's possible that scammers has my phone too. even if we can GVC, it might be an AI-generated content. We just have to meet in person, since robotics is not quite there yet. 😵‍💫😵‍💫😵‍💫

English

2

0

12

1.5K

Rosanne Liu@savvyRL·21 Oca

@_beenkim Prove that you are you, Been :)

English

1

0

6

1.8K

Been Kim@_beenkim·13 Ara

@ashVaswani Congrats!! 🎉

English

0

2

859

Ashish Vaswani@ashVaswani·11 Ara

Rnj-1-Instruct is now the #1 trending text generation model on HF!

English

22

33

400

61.2K

Been Kim retweetledi

Susan Zhang@suchenzang·30 Eki

What a privilege it is to have time as your most valuable currency.

English

5

11

205

0

Been Kim retweetledi

Christopher Potts@ChrisGPotts·10 Ara

Safety-oriented interpretability researchers should be focused on AI systems, not individual model artifacts. A snippet from the NeurIPS CogInterp workshop panel on Sunday:

English

6

18

167

16.5K

Been Kim retweetledi

Christopher Potts@ChrisGPotts·2 Ara

This post seems to describe substantially the same view that I offer here: web.stanford.edu/~cgpotts/blog/… Why are people describing the GDM post as concluding that mech-interp is a failed project? Is it the renaming of the field and constant talk of "pivoting"?

Neel Nanda@NeelNanda5

The GDM mechanistic interpretability team has pivoted to a new approach: pragmatic interpretability Our post details how we now do research, why now is the time to pivot, why we expect this way to have more impact and why we think other interp researchers should follow suit

English

4

19

125

32.1K

Been Kim@_beenkim·8 Ara

@giffmana Wait until he says six seven. 🙃

English

0

13

1.3K

Lucas Beyer (bl16)@giffmana·7 Ara

I think i understand now. I need to align him a bit more 😅

Lucas Beyer (bl16)@giffmana

My 5yo just said skibidi toilet out of nowhere. Send help pls.

English

7

0

86

16.1K

Been Kim@_beenkim·7 Ara

Tomorrow 9:30am #NeurIPS2025 Room 30A-E I'll talk about " 📈Towards Pareto frontier of interpretability: 15 years of interpretability research in 15 mins"🚅 @ mech interp workshop mechinterpworkshop.com

English

5

9

81

12.3K

Been Kim@_beenkim·7 Ara

@doomie @shaneguML go harass Dumi 🤣

Filipino

0

5

661

Dumitru Erhan@doomie·7 Ara

@shaneguML @_beenkim Step 1: Become Been 2: ??? 3: profit!

San Diego, CA 🇺🇸 English

1

0

4

927

Been Kim@_beenkim·6 Ara

Take that @doomie Samy Bengio! Hehehe

Indonesia

12

5

103

37.1K

Been Kim@_beenkim·5 Ara

Our work out there in the wild 🥹

Zi Wang, Ph.D.@ziwphd

🔥 Proactive Co-Creator is officially LIVE in @GoogleAIStudio! Stop guessing prompts. Start collaborating. Use it now to remix ideas and generate images, stories, and video with an AI that proactively helps you create. 🔗 Try it here: aistudio.google.com/apps/bundled/p… 📍 At #NeurIPS2025? Come see the live demo TODAY (Dec 3) 9AM - 1:30PM | Google Booth #1533 (Kiosk 3) 🧠 Our research @GoogleDeepMind : We’re turning theory into practice. Read the papers behind the tech: Concept Edits (Tech Report): storage.googleapis.com/concept-edit/r… Proactive Agents (ICML 25'): arxiv.org/abs/2412.06771 QuestBench (NeurIPS 25'): arxiv.org/abs/2503.22674

English

0

3

27

8.2K

Been Kim retweetledi

Zi Wang, Ph.D.@ziwphd·3 Ara

🔥 Proactive Co-Creator is officially LIVE in @GoogleAIStudio! Stop guessing prompts. Start collaborating. Use it now to remix ideas and generate images, stories, and video with an AI that proactively helps you create. 🔗 Try it here: aistudio.google.com/apps/bundled/p… 📍 At #NeurIPS2025? Come see the live demo TODAY (Dec 3) 9AM - 1:30PM | Google Booth #1533 (Kiosk 3) 🧠 Our research @GoogleDeepMind : We’re turning theory into practice. Read the papers behind the tech: Concept Edits (Tech Report): storage.googleapis.com/concept-edit/r… Proactive Agents (ICML 25'): arxiv.org/abs/2412.06771 QuestBench (NeurIPS 25'): arxiv.org/abs/2503.22674

English

0

8

27

13.8K

Been Kim retweetledi

Stanford NLP Group@stanfordnlp·4 Ara

Awesome @NeurIPSConf keynote this morning by @YejinChoinka on The Art of (Artificial) Reasoning – and her broader thoughts and wishes on the future of Artificial Intelligence neurips.cc/virtual/2025/i…

English

1

16

101

12.7K

Been Kim@_beenkim·5 Ara

Add: 9:30am on Sunday at Neurips, i'll touch upon this at the mech interp workshop keynote mechinterpworkshop.com

English

0

1

4

1.3K

Been Kim@_beenkim·5 Ara

8/8 Making AI benefit humans takes a village. 🌍 But a village needs a shared language. Let's stop guessing and start measuring the frontier.📷 a short write-up: @beenkim/the-pareto-frontier-of-human-centered-ai-54f90ba5872c" target="_blank" rel="nofollow noopener">medium.com/@beenkim/the-p…

English

1

2

4

1.9K

Been Kim@_beenkim·5 Ara

1/8 Pareto Frontier 🤠for Human-centered AI 📈: We all want to build AI that is good for humans, but the path is often paralyzed by complexity. Either “oh my god, it’s too complicated😱” or delusional “I have a warm and fuzzy feeling of understanding 🥴”? "It’s hard because it depends.🤷" is the enemy of progress. We need a Pareto Frontier for Human-centered AI. 🧵👇

English

5

13

80

37.9K

Been Kim

Keşfet