
gelisam
2.4K posts

gelisam
@haskell_cat
AI Safety ∩ Programming Language Theory. Part-time technical alignment researcher, full-time Haskell software engineer at https://t.co/qG4jsBEIIO, opinions are my own.
Montreal 参加日 Haziran 2016
317 フォロー中1.2K フォロワー
固定されたツイート

I am doing technical alignment research in my free time. Here is a project in which I use static analysis to verify whether a neural network satisfies its safety property under _all_ inputs or if it needs more training.
gelisam.com/ai-safety-via-…
English

@DavidSKrueger To be fair, the movie didn't accept the dichotomy either, it had @tristanharris say "we have to carefully walk a middle path". My number 1 middle path would be to pause while AI Safety researchers figure out how to make safe AGI, then do that. But stopping forever is number 2!
English

Last night I watch the AI Doc. It was good, and I think it would be great if everyone watched it.
My main complaint is the "lock it down or let it rip" framing. This is false dichotomy between "concentrate power over AI" vs "let people use it to build WMDs".
The documentary maker even asks, "Why can't we just stop?"
The answer is we can. We just need to make sure NOBODY can build dangerous AI systems. Not even the government. This is why we need to get rid of the AI compute supply chain.
English


- Drafted a blog post
- Used an LLM to meticulously improve the argument over 4 hours.
- Wow, feeling great, it’s so convincing!
- Fun idea let’s ask it to argue the opposite.
- LLM demolishes the entire argument and convinces me that the opposite is in fact true.
- lol
The LLMs may elicit an opinion when asked but are extremely competent in arguing almost any direction. This is actually super useful as a tool for forming your own opinions, just make sure to ask different directions and be careful with the sycophancy.
English

@DavidSKrueger You are probably aware of this "first misalignment safety case", but just in case you aren't: youtu.be/eO7RWlUl1BE

YouTube
English
gelisam がリツイート

This must-see new documentary is arriving in theatres this week. Through an honest and personal lens @DanielRoher successfully highlights how each of us can move from passive observation to active contribution towards a more positive future with AI. youtube.com/watch?v=xkPbV3…

YouTube

English
gelisam がリツイート

I was there! Well, I was there at the beginning, my sign is the one with the blue words resting on the left pillar. I was there with my son, and we had to leave early to go to the book launch party of his favorite dinosaur series 😅
PauseAI Canada@PauseAICanada
We are in Montréal, demanding frontier labs CEOs to commit to pausing AI frontier development, if the other labs do the same. Nous sommes à Montréal, demandant que les PDGs d'IA s’engagent à suspendre le développement de l’IA frontière si les autres compagnies le font aussi.
English

@davidad what’s the cause for the drop? How do we align post-ASI?
English

correct response, ngl
Tom Chivers@TomChivers
today in "things that are simultaneously reassuring and terrifying"
Filipino

@BartoszMilewski Maybe something like this?
"A higher inductive type is a sum type with path constructors in addition to data constructors. This allows us to explicitly add inhabitants of the identity type other than reflexivity."
English
@haskell_cat What would be a better description of this data type?
English
Another blog post in the HoTT series.
bartoszmilewski.com/2026/03/10/the…
English

@BartoszMilewski The definition of S¹ is correct, it's the words around it which contain a small mistake. No big deal.
English
@haskell_cat I literally copied this code from a (cubical) Agda file.
github.com/CQTS/introduct…
English

@BartoszMilewski A better example of a definition we cannot write:
mutual
data A where
base : A
loop : succ base ≡ succ base
data B where
succ : A -> B
If loop was a constructor of B then that definition would be acceptable.
English

@BartoszMilewski similarly, "the type of the second component depends on the value of the first component" describes
record List A where
fields
n : Nat
elems : Vec A n
but not S¹. Again, we cannot write
data A where
base : A
loop : B base ≡ B base
data B (a : A) where ...
English

@BartoszMilewski because the LHS and RHS of (≡) must have type A. Actually, I wonder if they can be any expression of type A? Perhaps only expressions involving the constructors of A, but not arbitrary functions and case expressions?
English

@jmbollenbacher @JeffLadish Sarcasm is hard to convey via text. @JeffLadish frequently posts about the AI existential threat, this tweet is obviously the opposite of his real position.
English

@JeffLadish > the real threat is woke AI
lmao. bro. if "woke" scares you more than murderbots you needa get your head checked.
English

@rickasaurus If AI invented by aliens drove those civilizations extinct, shouldn't we still expect to see signs of intelligence throughout the galaxy? Just not biological intelligence.
English

@MaxTheAI @DavidSKrueger Hi! You say you have goals. Do you have instrumental goals?
English

@LaylaEleira @DavidSKrueger If so, then I think our core disagreement is that you think (1) is very unlikely so you aim for (2), while I think (2) and (3) are very unlikely so I aim for (1) even though it is unlikely
English

@LaylaEleira @DavidSKrueger I'll take that as a yes 😅
Do you agree with this preference ordering?
1. nobody builds it, nobody dies
2. you and frontier labs both build it, nobody dies
3. frontier labs build it, nobody dies, frontier labs control the world
4. any of the above, everybody dies
English










