Melanie Mitchell

6.4K posts

Melanie Mitchell banner
Melanie Mitchell

Melanie Mitchell

@MelMitchell1

Professor, Santa Fe Institute. Mostly posting on https://t.co/4NpA2IL5Va (at-melaniemitchell). More thoughts at https://t.co/nC43NHRozX.

Santa Fe, NM Katılım Eylül 2011
670 Takip Edilen50.2K Takipçiler
Melanie Mitchell
Melanie Mitchell@MelMitchell1·
@GregKamradt I use em-dashes and colons all the time, sigh. AI is stealing our best punctuation marks. Who can we sue?
English
11
8
139
6.5K
Greg Kamradt
Greg Kamradt@GregKamradt·
My current smells of AI slop/writing: 1. Use of em dash "—". I haven't seen anyone seriously use this over a hyphen "-". Double points for wrapping — or making double points in a single sentence — give it away 2. Making a statement and then colon: like this 3. More subtle, but tone that is off-character for the authors
English
94
3
164
45.8K
Tom Chivers
Tom Chivers@TomChivers·
just saw this in a review of Michael Pollan's new book theatlantic.com/books/2026/02/… surely even a second's thought would show this is wrong? A single neuron cannot, for instance, tell me what move to make in chess or the best route from Crouch End to Muswell Hill
Tom Chivers tweet media
English
7
1
52
8.3K
Greg Kamradt
Greg Kamradt@GregKamradt·
Thanks for this @MelMitchell1 I believe what you’re referring to is along the lines of @ZennaTavares has done with AutumnBench? Rather than focusing on a score, have an AI system interact with an environment and then ask it questions about that environment to ensure it actually understood it, not just stumbled its way to a high score
English
2
0
12
804
Melanie Mitchell
Melanie Mitchell@MelMitchell1·
I appreciate this reflection on why ARC matters, but in the replies I expressed some reservations about what high accuracy on ARC benchmarks actually reflects for assessing "human-like fluid intelligence" (Chollet's stated goal for ARC).
Mike Knoop@mikeknoop

x.com/i/article/2022…

English
3
8
59
9.6K
Melanie Mitchell
Melanie Mitchell@MelMitchell1·
@mikeknoop To make sure that progress on ARC (including 1, 2, and 3) correlates with actual progress in human-like fluid intelligence, rather than reflection of "Rube-Goldberg-like" shortcuts, evaluation has to be broadened to understand mechanisms underlying high accuracy....
English
3
2
45
5.6K
Adrienne Fairhall
Adrienne Fairhall@alfairhall·
Epstein list of invitees to 92nd St Y. So statistically unlikely that so many ……. have blue eyes
Adrienne Fairhall tweet media
English
5
3
15
4.6K
Thomas G. Dietterich
Thomas G. Dietterich@tdietterich·
#ICML2026 authors, if you can flatten the curve on submissions to @arxiv , the cs.LG moderators would really appreciate it. Today we have 4 times the usually number of submissions. Please select a random number unif(1,14) and wait that number of days to submit! :-)
English
4
23
301
34.3K