Sabitlenmiş Tweet
Vincent Conitzer
2K posts

Vincent Conitzer
@conitzer
AI professor. Director, @FOCAL_lab @CarnegieMellon. Head of Technical AI Engagement, @UniofOxford @EthicsInAI. Author, "Moral AI - And How We Get There."
Katılım Haziran 2009
1.2K Takip Edilen4.4K Takipçiler

Speaking of needing off-switches, hard to imagine a better example than this...
[desperately messages] "STOP OPENCLAW"
[completely ignored by OpenClaw, runs to computer to kill processes]
pcmag.com/news/meta-secu…
English

Shutdown safety valves (giving AI an objective to shut itself down and a means to do so if some capability/access/... is dangerously high) now on arXiv
arxiv.org/abs/2603.07315
English

@ShriramKMurthi The sound seems to be coming from downstairs so I must be hallucinating it
English

@conitzer Does she even have your phone number, Vincent, or did you just imagine that? (-:
English

@zustimmungswahl @C_Oesterheld I think the metric actually clarifies this. For Borda,when it's clear how others are likely to vote and hard to find additional voters, manipulation is likely as you describe. But if it's unclear how others will vote and easy to find more voters, the latter makes sense.
English

@conitzer @C_Oesterheld Good idea, but that two of the most strategic voting methods turn out best in your analysis should make you pause and think.
English

We (w/ Berker, Hartman, Liu, @C_Oesterheld) introduce a new measure of approximate strategyproofness -- having how many truthful copies of yourself is guaranteed to be at least as effective as misrepresenting your preferences?
arxiv.org/abs/2602.22838
English

My parallelogram law visual proof is now in The American Mathematical Monthly! (Without paywall: cs.cmu.edu/~conitzer/visu…) I'll venture that this is (for now?) a very human proof. (See also previous post...)
tandfonline.com/doi/full/10.10…
English

@StatsLime @xuanalogue @C_Oesterheld I think the metric actually clarifies this. In a setting where it's clear how others are likely to vote and hard to find additional voters, manipulation is likely as you describe. But if it's unclear how others will vote and easy to find more voters, the latter makes sense.
English

@conitzer @xuanalogue @C_Oesterheld If you find Borda (infamous for how badly it collapses under manipulation) to be the hardest to manipulate, I think you should reconsider whether the metric is actually useful

English

@FellowHominid @C_Oesterheld Well, in any case, we're excited about it :-)
English

@conitzer @C_Oesterheld not very familiar with the subject of voting theory but this seems like a big deal??
Borda count supremacy ?
English

@conitzer @C_Oesterheld That's an awesome idea, very cool paper!
English

@sethlazar I had never heard about that! But you're not the only one to mention it :-)
English

@conitzer ahh, the mr meeseeks model (love that episode of Rick and Morty, very good for illustrating this idea)
English

Looking forward to presenting "Shutdown Safety Valves for Advanced AI" in the next session at IASEAI'26 in Paris! (Basic idea: give AI a goal of shutting itself down.) cs.cmu.edu/~conitzer/shut…
English

My 5-year part-time appointment with Oxford's Institute for Ethics in AI has come to an end. Many good memories, new things learned, and new friends made. Looking forward to seeing how the Institute develops, and on to new adventures! oxford-aiethics.ox.ac.uk
English









