Daniel Paleka

1.4K posts

Daniel Paleka banner
Daniel Paleka

Daniel Paleka

@dpaleka

ai safety researcher | phd @CSatETH | https://t.co/hCoh5RJgZD

Zurich Katılım Mart 2012
926 Takip Edilen4.6K Takipçiler
ivan
ivan@IvanVendrov·
a mood I'm really missing in the current AI discourse is grief yes things might go terribly and yes we might see glories beyond imagining but no matter what, we will lose much of what it has meant to be human, forever. I'd like to be with that grief more, and held in it.
English
86
49
824
58.2K
Daniel Paleka
Daniel Paleka@dpaleka·
@Afinetheorem This is interesting. I think the Avg Dist metric makes ~no sense as a metric of capability, unless the model knows it's optimizing for this. I like the % success here better. In general a different scoring func would produce different optimal guesses
English
1
0
0
42
Kevin A. Bryan
Kevin A. Bryan@Afinetheorem·
(Gemini models also smoke other ones on my 'where is this not-GeoGuessrable photograph taken' benchmark: kevinbryanecon.com/HardGeoBench/. But all of this tells you, with multiple tasks strung together with agents, single question logic and a bad harness is not a good combination.)
Kevin A. Bryan tweet media
English
2
0
5
1.1K
Kevin A. Bryan
Kevin A. Bryan@Afinetheorem·
Last month, I wrote benchmark questions for a big tech company. They are hard - not math or coding, linked to real-world tasks. Gemini 3 Pro *smoked* other frontier models: like 2x more right. It just needs better integrations, agent harness, "longer" think time/less laziness.
English
3
0
56
5.3K
Daniel Paleka
Daniel Paleka@dpaleka·
@panickssery 'tis a benchmark. take an existing set of qs and search how early in the question LLMs know the answer.
English
1
0
4
501
Arjun Panickssery
Arjun Panickssery@panickssery·
Out of curiosity, at what point in this quiz-bowl question do you know the answer? (poll in next tweet) Late in this battle, command was shifted to the Euryalus led by Captain Cuthbert Collingwood. John Pasco consulted on the wording of a message before this battle, which began when the losing side broke out of the port of (*) Cádiz. The flag signal “England expects that every man will do his duty” was sent just prior to—for 10 points—what 1805 naval battle that led to the death of the victorious admiral, Horatio Nelson?
English
6
0
10
2.1K
Daniel Paleka retweetledi
Lennart Heim
Lennart Heim@ohlennart·
Timely research. We've all tried to figure out who someone is online. Now LLMs can do this at scale and better. I'm sure no one would misuse this.
Daniel Paleka@dpaleka

Can LLMs figure out who you are from your anonymous posts? From a handful of comments, LLMs can infer where you live, what you do, and your interests; then search for you on the web. New 📄 w/ @SimonLermenAI, @joshua_swans, @AerniMichael, Nicholas Carlini, @florian_tramer 🧵

English
0
3
29
3.4K
Rosie Campbell
Rosie Campbell@RosieCampbell·
The sigmoid can stay exponential longer than you can stay relevant
English
16
42
667
30.8K
Daniel Paleka
Daniel Paleka@dpaleka·
Found the sigmoid!
Daniel Paleka tweet media
English
7
9
350
21K
Daniel Paleka
Daniel Paleka@dpaleka·
Privacy online is fundamentally at odds with intelligence getting cheaper. Anonymity on the internet has always relied on practical obscurity. We publish in hopes that people can adapt to LLMs changing this. Paper: arxiv.org/abs/2602.16800
English
2
4
23
1.3K
Daniel Paleka
Daniel Paleka@dpaleka·
If you're anonymous, what should you do? Avoid sharing specific details, and adopt a security mindset: if a team of smart investigators were trying to identify you from your posts, could they plausibly figure out who you are? If yes, LLM agents will soon be able to do the same.
English
2
1
17
1.4K