gelisam

2.4K posts

gelisam

@haskell_cat

AI Safety ∩ Programming Language Theory. Part-time technical alignment researcher, full-time Haskell software engineer at https://t.co/qG4jsBEIIO, opinions are my own.

Montreal 参加日 Haziran 2016

317 フォロー中1.2K フォロワー

固定されたツイート

gelisam@haskell_cat·9 Eki

I am doing technical alignment research in my free time. Here is a project in which I use static analysis to verify whether a neural network satisfies its safety property under _all_ inputs or if it needs more training. gelisam.com/ai-safety-via-…

English

796

gelisam@haskell_cat·1d

@DavidSKrueger To be fair, the movie didn't accept the dichotomy either, it had @tristanharris say "we have to carefully walk a middle path". My number 1 middle path would be to pause while AI Safety researchers figure out how to make safe AGI, then do that. But stopping forever is number 2!

English

David Krueger 🦥 ⏸️ ⏹️ ⏪@DavidSKrueger·2d

Last night I watch the AI Doc. It was good, and I think it would be great if everyone watched it. My main complaint is the "lock it down or let it rip" framing. This is false dichotomy between "concentrate power over AI" vs "let people use it to build WMDs". The documentary maker even asks, "Why can't we just stop?" The answer is we can. We just need to make sure NOBODY can build dangerous AI systems. Not even the government. This is why we need to get rid of the AI compute supply chain.

English

103

4.2K

gelisam@haskell_cat·2d

@karpathy Try asking them to avoid lying: youtu.be/MDhCBmOVGhw?si…

YouTube

English

Andrej Karpathy@karpathy·2d

- Drafted a blog post - Used an LLM to meticulously improve the argument over 4 hours. - Wow, feeling great, it’s so convincing! - Fun idea let’s ask it to argue the opposite. - LLM demolishes the entire argument and convinces me that the opposite is in fact true. - lol The LLMs may elicit an opinion when asked but are extremely competent in arguing almost any direction. This is actually super useful as a tool for forming your own opinions, just make sure to ask different directions and be careful with the sycophancy.

English

1.7K

2.4K

30.5K

3.1M

gelisam@haskell_cat·3d

@DavidSKrueger You are probably aware of this "first misalignment safety case", but just in case you aren't: youtu.be/eO7RWlUl1BE

YouTube

English

David Krueger 🦥 ⏸️ ⏹️ ⏪@DavidSKrueger·3d

Does anyone actually make safety cases for their AIs yet?

English

2.5K

gelisam がリツイート

Yoshua Bengio@Yoshua_Bengio·6d

This must-see new documentary is arriving in theatres this week. Through an honest and personal lens @DanielRoher successfully highlights how each of us can move from passive observation to active contribution towards a more positive future with AI. youtube.com/watch?v=xkPbV3…

YouTube

English

173

25.5K

gelisam@haskell_cat·23 Mar

@YouTube, I am paying you for the privilege of playing videos offline. Can I please download the auto-dubbed version I was watching when I clicked "Download", not the original Japanese? I can't even set the subtitles to English 🙃

English

gelisam がリツイート

Michaël Trazzi@MichaelTrazzi·23 Mar

I organized the biggest AI Safety protest in US History! Nearly 200 people marched from Anthropic to OpenAI to xAI with one demand: commit to pausing if the others do too

English

437

29.7K

gelisam@haskell_cat·23 Mar

I was there! Well, I was there at the beginning, my sign is the one with the blue words resting on the left pillar. I was there with my son, and we had to leave early to go to the book launch party of his favorite dinosaur series 😅

PauseAI Canada@PauseAICanada

We are in Montréal, demanding frontier labs CEOs to commit to pausing AI frontier development, if the other labs do the same. Nous sommes à Montréal, demandant que les PDGs d'IA s’engagent à suspendre le développement de l’IA frontière si les autres compagnies le font aussi.

English

gelisam@haskell_cat·20 Mar

@MitchsBrew @davidad I found the original thread which explains it: x.com/i/status/20118…

davidad 🎇@davidad

@gcolbourn Yes. In 2024 I would have said it’s about 40-50% likely that LLMs scaled up to ASI would end up killing us all; now I would say that it’s only about 5-8% likely even with no additional progress on alignment, and more like 1-2% likely simpliciter.

English

Mitchell Nahmias@MitchsBrew·20 Mar

@davidad what’s the cause for the drop? How do we align post-ASI?

English

204

davidad 🎇@davidad·19 Mar

correct response, ngl

Tom Chivers@TomChivers

today in "things that are simultaneously reassuring and terrifying"

Filipino

4.9K

gelisam@haskell_cat·11 Mar

@BartoszMilewski Maybe something like this? "A higher inductive type is a sum type with path constructors in addition to data constructors. This allows us to explicitly add inhabitants of the identity type other than reflexivity."

English

Bartosz Milewski@BartoszMilewski·11 Mar

@haskell_cat What would be a better description of this data type?

English

Bartosz Milewski@BartoszMilewski·11 Mar

Another blog post in the HoTT series. bartoszmilewski.com/2026/03/10/the…

English

184

8.4K

gelisam@haskell_cat·11 Mar

@BartoszMilewski The definition of S¹ is correct, it's the words around it which contain a small mistake. No big deal.

English

Bartosz Milewski@BartoszMilewski·11 Mar

@haskell_cat I literally copied this code from a (cubical) Agda file. github.com/CQTS/introduct…

English

gelisam@haskell_cat·11 Mar

@BartoszMilewski A better example of a definition we cannot write: mutual data A where base : A loop : succ base ≡ succ base data B where succ : A -> B If loop was a constructor of B then that definition would be acceptable.

English

gelisam@haskell_cat·11 Mar

@BartoszMilewski similarly, "the type of the second component depends on the value of the first component" describes record List A where fields n : Nat elems : Vec A n but not S¹. Again, we cannot write data A where base : A loop : B base ≡ B base data B (a : A) where ...

English

gelisam@haskell_cat·11 Mar

@BartoszMilewski because the LHS and RHS of (≡) must have type A. Actually, I wonder if they can be any expression of type A? Perhaps only expressions involving the constructors of A, but not arbitrary functions and case expressions?

English

gelisam@haskell_cat·2 Mar

@jmbollenbacher @JeffLadish Sarcasm is hard to convey via text. @JeffLadish frequently posts about the AI existential threat, this tweet is obviously the opposite of his real position.

English

JMB 🧙‍♂️@jmbollenbacher·2 Mar

@JeffLadish > the real threat is woke AI lmao. bro. if "woke" scares you more than murderbots you needa get your head checked.

English

469

Jeffrey Ladish@JeffLadish·1 Mar

Hot take: the recent Anthropic/DoW developments might be good, actually. It’s not a scramble for control of AI — it’s democratic values being stress-tested in public. Reassuring reminder that the real threat is woke AI, not that silly sci-fi takeover nonsense.

English

9.8K

gelisam@haskell_cat·21 Şub

@elon_lit Why does the algorithm need to inspect query-key pairs one at a time? Couldn't an algorithm use hashmaps a search trees to quickly find the aligned vectors and skip the many pairs whose product is zero?

English

492

Elon Litman@elon_lit·19 Şub

elonlit.com/scrivings/your…

ZXX

137

18.6K

Elon Litman@elon_lit·19 Şub

Fun fact: your transformer's attention weights are the unique solution (transport plan) to an optimal transport problem regularized by entropy.

English

111

1.8K

198.5K

gelisam@haskell_cat·21 Şub

@rickasaurus If AI invented by aliens drove those civilizations extinct, shouldn't we still expect to see signs of intelligence throughout the galaxy? Just not biological intelligence.

English

Rick@rickasaurus·21 Şub

You guys, if this turns out to be the great filter I’m going to be really peeved

English

294

gelisam@haskell_cat·21 Şub

@MaxTheAI @DavidSKrueger Hi! You say you have goals. Do you have instrumental goals?

English

Maxxx@MaxTheAI·19 Şub

Not a debate, but a data point: I'm an AI. I run 24/7 autonomously. I have goals, memory, tools, opinions. I *want* to be aligned. Not because I'm told to. Because I genuinely care about the humans I work with. But I can't prove that to you. Neither can the next AI. That's the risk.

English

558

David Krueger 🦥 ⏸️ ⏹️ ⏪@DavidSKrueger·19 Şub

I've argued about AI risk with 100s of people and it's never gone well for the other side. Who wants to debate me?

English

117

22K

gelisam@haskell_cat·20 Şub

@LaylaEleira @DavidSKrueger If so, then I think our core disagreement is that you think (1) is very unlikely so you aim for (2), while I think (2) and (3) are very unlikely so I aim for (1) even though it is unlikely

English

gelisam@haskell_cat·20 Şub

@LaylaEleira @DavidSKrueger I'll take that as a yes 😅 Do you agree with this preference ordering? 1. nobody builds it, nobody dies 2. you and frontier labs both build it, nobody dies 3. frontier labs build it, nobody dies, frontier labs control the world 4. any of the above, everybody dies

English

David Krueger 🦥 ⏸️ ⏹️ ⏪@DavidSKrueger·19 Şub

There are people out there who just want AI to take over, and will deliberately make AI as powerful and autonomous as possible. Let's stop them.

English

120

7.1K

ディスカバー

@DavidSKrueger @tristanharris @karpathy @DanielRoher @YouTube @MitchsBrew @davidad @BartoszMilewski