Dominik Peters

4.9K posts

Dominik Peters banner
Dominik Peters

Dominik Peters

@DominikPeters

CS researcher in Paris (CNRS) on voting theory. I travel by train a lot. From 🇩🇪, studied in 🇬🇧, worked in 🇺🇸 and 🇨🇦, now live in 🇫🇷.

Paris, France Katılım Haziran 2009
354 Takip Edilen493 Takipçiler
Dominik Peters
Dominik Peters@DominikPeters·
@superalbs @muunijs @sxarek I have the opposite preference (feels socially awkward to sleep in a small room with strangers). I’m much happier with the arrangement in an airplane.
English
1
0
2
18
Superalbs
Superalbs@superalbs·
@muunijs @sxarek Yes, you are correct in that it is more efficient than private rooms, however sharing rooms is more efficient. But generally, I would rather share sleeping space with 2 other people, than 20 other people.
English
1
0
1
43
Szarek
Szarek@sxarek·
Dalej nie pojmuję dlaczego na kolei tak się wzbraniają przed dwufunkcyjną klasą business, która za dnia jest produktem premium a w nocy tańszym rozwiązaniem od wagonu sypialnego. Jedynie w Norwegii coś takiego spróbowali ale kompletnie zepsuli puszczając tylko w nocy
Szarek tweet media
Vojtěch Očadlý @vojtaocadly

Proc neni ve vlacich prvni trida jak v letadle? WDYM ze si ve 21:00 nemuzu oblict pyzamo a vycistit zuby, zatimco mi stewardka JLV ustele mou plne polohovatelnou sedacku

Polski
3
1
34
2.9K
David Daniel Gann
David Daniel Gann@DavidDanielGann·
@ryanflorence Absolutely. It’s an overcorrection from the sycophant phase. Now it tries REALLY HARD not to straight up agree with you.
English
2
0
5
171
Dominik Peters
Dominik Peters@DominikPeters·
@thsottiaux @jxnlco I like using 5.4-mini at times but with the new model picker this takes many clicks. Can the model picker remember which models the user has used in the last few days and expose them before going into “more” menus?
English
0
0
0
35
Tibo
Tibo@thsottiaux·
It’s the little things that matter, what are some small papercuts you have noticed in Codex? We’ll fix as many as possible in the next week.
English
1.9K
57
2.3K
241.8K
Dominik Peters
Dominik Peters@DominikPeters·
@thsottiaux @jxnlco When a request fails (eg network error or out of usage limit), there is no button for resending the same message later like “Try Again”. I usually “edit” the message and send it unchanged but that is dirty.
English
0
0
0
14
Richard Ngo
Richard Ngo@RichardMCNgo·
@DominikPeters @lefineder Majority switches from all-blue to mostly red with epsilon ^ (N/2) probability. So switching from blue to red yourself is very very slightly better.
English
1
0
1
131
LiorLefineder
LiorLefineder@lefineder·
Pressing red is the Nash equilibrium, meaning that it's the best choice regardless of the choices of others (which you don't control). So if a group of rational agents (in the game theoretical sense) were presented with this problem, they would all press red and all survive. Although it doesn't look like it, it’s basically the opposite of the Prisoner’s Dilemma, in which rational agents following self-interest leads to a worse outcome for everyone.
Tim Urban@waitbutwhy

Everyone in the world has to take a private vote by pressing a red or blue button. If more than 50% of people press the blue button, everyone survives. If less than 50% of people press the blue button, only people who pressed the red button survive. Which button would you press?

English
8
3
77
10.1K
Dominik Peters
Dominik Peters@DominikPeters·
@MannOhneListe @RichardMCNgo @lefineder In practice, yes, but Nash equilibrium is a precisely defined mathematical concept and it deliberately does not contain uncertainty about the actions of others. If you want to reason about that you’ll want other concepts like Bayes-Nash equilibrium
English
0
0
1
21
Dominik Peters
Dominik Peters@DominikPeters·
@RichardMCNgo @lefineder Hmm which Nash equilibria fail to be trembling hand? A clear blue majority remains blue even with epsilon noise so blue is a best response, and same for all-red
English
1
0
1
178
Richard Ngo
Richard Ngo@RichardMCNgo·
@lefineder Any blue majority is also a nash equilibrium. The criterion that distinguishes them is trembling-hand equilibrium
English
5
0
28
2.9K
Dominik Peters
Dominik Peters@DominikPeters·
@MannOhneListe @RichardMCNgo @lefineder Nash equilibrium means that if we are in a particular situation (eg a 57% blue majority), then each person has no incentive to change their action *assuming that no one else changes their action*
English
1
0
1
31
Dominik Peters
Dominik Peters@DominikPeters·
First impressions of GPT-5.5-Thinking for mathematics are scary good. Many of my old conversations where 5.4 couldn't solve the problem now seem to get solved (but still need to check correctness).
English
0
0
2
140
Aaron Bergman 🔍 ⏸️ (in that order)
You could imagine biometric ~proof of age, at the very least I’d imagine it is not a hard ML task to do “look left, look right, left again, blink, chin up” -> verify that a live person is there + then run an age prediction model and if it says p(>=18) >= 99% or something then throw out the data and let them register
English
4
0
22
1.4K
Aaron Bergman 🔍 ⏸️ (in that order)
Pretty obvious point in hindsight I failed to see: you literally don’t get access by proving your age is above some threshold (not that that would necessarily be good or fine, but still)
Taylor Lorenz@TaylorLorenz

Mind you it’s not age gating, it’s IDENTITY GATING. There is no such thing as “age verification” it’s ID verification. I think we need to stop using the term age verification bc it’s an industry term that obfuscates the reality of what’s happening.

English
3
2
86
14.3K
Dominik Peters
Dominik Peters@DominikPeters·
@btibor91 Sounds like fast answers are cached responses from a database, maybe through some sort of embedding RAG
Dominik Peters tweet media
English
0
0
6
624
Tibor Blaho
Tibor Blaho@btibor91·
OpenAI rolled out new ChatGPT updates including Fast answers, ChatGPT for Clinicians with HealthBench Professional, and ChatGPT for Google Sheets - Fast answers gives quicker replies to simple info-seeking questions when ChatGPT is confident in the answer, skips past chats and memory, works globally on web, iOS, and Android for logged in and logged out users across all plans, and can be turned off in Personalization settings - ChatGPT for Clinicians is a free version for verified US physicians, NPs, PAs, and pharmacists covering evidence review, documentation, and medical research, with clinical search and citations, reusable skills, deep research across medical literature, CME credits on eligible questions, and optional HIPAA support via a BAA, shown as a separate workspace under the same ChatGPT login, with plans to expand to more countries via a Better Evidence Network pilot - OpenAI also released HealthBench Professional, an open benchmark built on HealthBench that uses physician-authored conversations and rubrics to test models on real clinician chat tasks across care consult, writing and documentation, and medical research, where GPT-5.4 inside ChatGPT for Clinicians beat base GPT-5.4, other OpenAI and external models, and human physicians - ChatGPT for Google Sheets lets you create new sheets, ask questions across tabs and formulas, and make updates directly in your sheets
Tibor Blaho tweet mediaTibor Blaho tweet mediaTibor Blaho tweet mediaTibor Blaho tweet media
English
17
25
440
34.9K
Dominik Peters
Dominik Peters@DominikPeters·
@GarettJones I wonder if Parisians could correctly rank-order the arrondissements by rent. I think people think the 16th is more expensive than it really is.
English
0
0
1
79
Garett Jones
Garett Jones@GarettJones·
Average Paris rents differ by about 35% across neighborhoods. Average DC rents differ by about 150% across neighborhoods.
Garett Jones tweet mediaGarett Jones tweet mediaGarett Jones tweet media
English
3
0
8
1.5K
Dominik Peters
Dominik Peters@DominikPeters·
@GarettJones Potentially the DC numbers are skewed a bit if 2BR apartments are bigger in higher-priced areas, since it is not normalized per m^2.
English
1
0
1
71
Dominik Peters
Dominik Peters@DominikPeters·
@WhatIsPrivate1 @emollick yes, it's interesting that AI doesn't seem to understand in what ways earlier image models are bad. The issue was not blurriness
English
1
0
5
100
Some Guy
Some Guy@WhatIsPrivate1·
@emollick my biggest complaint is that there is no way 2020 and 2022 models were as good as this suggests:
Some Guy tweet media
English
1
1
24
1.3K
Ethan Mollick
Ethan Mollick@emollick·
I have been using GPT ImageGen-2 for the past weeks I didn't think that better image-generators would be a big deal but it turns out that there is a quality threshold I didn't expect, where you can now get text, slides, academic papers Look at what it does with my "otter test"!
Ethan Mollick tweet mediaEthan Mollick tweet mediaEthan Mollick tweet mediaEthan Mollick tweet media
English
73
124
2K
196K
Amal Dorai
Amal Dorai@amaldorai·
@matt_beard_ I know an otherwise-vegan who eats oysters because she says they’re incapable of experiencing suffering
English
1
0
5
1K
Dominik Peters
Dominik Peters@DominikPeters·
@jxnlco @ajambrosino I suppose it could auto answer in case of no user input for 60s and tell codex that the user was unavailable and to use its best judgment
English
0
0
0
45
Dominik Peters
Dominik Peters@DominikPeters·
@jxnlco @ajambrosino to be fair, it is nice to know that after I click “implement plan” I know that I can go away from the computer and won’t accidentally stall progress by not timely answering a question. I don’t know the right UI for indicating “I’m open to questions” or not
English
1
0
0
58
jason liu
jason liu@jxnlco·
5 steps to making codex your chief of staff 1. download the desktop app 2. install the plugins you need for work 3. paste this into a thread and pin it 4. ??? 5. monitor the situation gist.github.com/jxnl/e96b08aae…
English
16
22
377
26.2K