MMitchell

22.1K posts

MMitchell

MMitchell

@mmitchell_ai

Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics. Similar content in the Skies (this bird has flown).

Katılım Haziran 2016
1.4K Takip Edilen81.8K Takipçiler
Cher Scarlett 🌌
Cher Scarlett 🌌@cherthedev·
Graduated with honors as the commencement speaker. 💙🧡 Shipping off to Boston! 💙❤️
Cher Scarlett 🌌 tweet media
English
13
4
78
2.5K
Nathan Lambert
Nathan Lambert@natolambert·
@reflection_ai i want y'all to succeed but it's a little self-serving to say "America's open source AI lab" having released nothing
English
16
4
792
48.6K
MMitchell retweetledi
Max Zhdanov
Max Zhdanov@maxxxzdn·
🌍Today we release Mosaic, a probabilistic weather model that shifts the Pareto frontier of ML weather forecasting. It matches the skill of state-of-the-art models while generating a 24-member, 10-day global forecast in under 12 s on a single H100. Thread!
English
28
141
1.3K
132.5K
MMitchell retweetledi
Georgia Channing
Georgia Channing@cgeorgiaw·
OlmoEarth v1.1 just dropped (thx @allen_ai) 🌍 This family of Earth observation foundation models for satellite imagery tasks (e.g. mangrove change tracking, forest loss driver classification) just got 3X CHEAPER/FASTER to run. The trick is redesigning what a token represents. Sentinel-2 inputs used to get one token per resolution (10m/20m/60m). v1.1 collapses them → 3x fewer tokens, quadratically cheaper compute.
Georgia Channing tweet media
English
3
24
183
8K
MMitchell
MMitchell@mmitchell_ai·
A propos of nothing, I've been pseudocoding Beatles songs and it is insanely fun. if x.person_type == "baby": shake_it_up(x) twist_and_shout(x) work_it_on_out(x) x.look_so_good = True
English
2
1
1
888
Leo Boytsov
Leo Boytsov@srchvrs·
Not defending the policy, but citations can be completely hallucinated too. Just ask the model to write a paragraph together with all the citations and additional bibtex entries. Of course, it's not something that I recommend, but I am pretty sure a lot of people do this. But, again, I don't approve draconian arxiv policy in this regard.
English
1
0
0
50
MMitchell retweetledi
Andrew Dobrow
Andrew Dobrow@andrewdobrow·
@tatumturnup He needs a doctor. Somebody call 1110001111.
English
27
42
2.9K
80.2K
Tatum Turn Up
Tatum Turn Up@tatumturnup·
This is the greatest video I’ve ever seen. No notes. The lifeless clanker carcass just laying there. No crowd reaction, anything. Just Billie Jean. Until its lifeless shell is shamefully dragged off. Purely amazing.
English
1.4K
8.4K
91.9K
5.5M
MMitchell retweetledi
Loubna Ben Allal
Loubna Ben Allal@LoubnaBenAllal1·
Introducing Carbon 🧬 a family of open generative DNA foundation models. Carbon-3B matches Evo2-7B while running 250x faster at inference. It can generate new DNA sequences and score the functional impact of mutations, zero-shot. We borrowed a lot from how modern LLMs are trained, but DNA isn't language. Genomes are noisy, redundant, and shaped by evolution rather than communication. So we adjusted the recipe: Tokenizer. Most genomic models tokenize at the nucleotide/character level, which blows up sequence length. BPE is the obvious LLM-style fix, but it doesn't behave well on DNA. We use deterministic 6-mer tokens (one token = 6 nucleotides): 6× shorter sequences and cheaper attention. Training loss. With 6-mer tokens, cross-entropy scores a prediction that gets 5/6 nucleotides right the same as one that's completely wrong. This gets brittle late in training and produces loss spikes. We switch mid-training to a more flexible factorized loss (FNS). Data. Genomes are mostly sparse, repetitive background. We curate down to a staged functional DNA + mRNA mixture, with every ratio chosen by ablation, like mixing a web corpus, but for biology. We're releasing the models, training data, training code, evaluation suite, and a demo to play with. More details in the technical report: github.com/huggingface/ca… Demo to play with the model, with a biology primer for our ML friends ;) huggingface.co/spaces/Hugging…
English
16
82
358
38.6K
MMitchell retweetledi
Georgia Channing
Georgia Channing@cgeorgiaw·
Hugging Science just got a whole lot more huggier 🤗🤗🤗 Today, we’re releasing a family of genomics models, which we call Carbon
Georgia Channing tweet media
English
8
45
281
18.8K
MMitchell
MMitchell@mmitchell_ai·
Against the constant pressure of *genAI, genAI, genAI*, I am really appreciating @allen_ai 's work on creating tools for critical needs -- like crop maps and forest loss analysis. They just did a nice release on @huggingface , check it out (linked below)
MMitchell tweet media
English
2
4
22
1.3K
MMitchell retweetledi
Karen Hao
Karen Hao@_KarenHao·
So much attention has been paid to the Musk v. Altman trial. But real accountability for the AI industry will not come from a billionaire mudfight. It will come from the movements around the world resisting the empires of AI. My op-ed for @guardian. theguardian.com/technology/com…
English
38
310
836
54.8K
MMitchell
MMitchell@mmitchell_ai·
AI world! I am at a non-AI conference, and they have solved a *fundamental* conference issue. Get this: The name tags are printed on *both* sides. So when it inevitably flips, people can still see your details. 🤯 This changes the game!
English
2
1
14
1.6K
Hugo
Hugo@Chilka_·
@remilouf @dottxtai It looks cool Until you found out you need 45 min to buy bread. The wifi connection is working when it’s not raining only. There a ghost in the castle
English
8
0
21
5.1K
Rémi
Rémi@remilouf·
Starting a company in a garage is boring so we started @dottxtai in a French castle instead
Rémi tweet media
English
93
37
1.4K
185.2K
MMitchell
MMitchell@mmitchell_ai·
@roydanroy Rather than “average human”, I believe the terminology many making these claims prefer is “median human” 🫠
English
1
0
1
1.2K
Dan Roy
Dan Roy@roydanroy·
Hot take 🔥: any company that thinks their company will reach AGI/ASI/whatever first and who is concerned about the average person and their livelihood due to their own products, should either be public or raise their next round in a way that the average person can invest. Otherwise, you're just enriching the billionaires at this point.
English
17
9
184
15.2K
MMitchell retweetledi
NeurIPS Conference
NeurIPS Conference@NeurIPSConf·
This year, to improve transparency and responsible use of datasets in the NeurIPS 2026 Evaluations and Datasets Track, all dataset submissions are now required to include Responsible AI (RAI) metadata as part of the dataset’s Croissant file. Find out more about this and RAI in our blog post: blog.neurips.cc/2026/05/04/res…
English
6
22
203
43.9K
MMitchell
MMitchell@mmitchell_ai·
@GlennMatlin @vidalthi (Not unrelated to the issues with the Stochastic Parrots paper, btw -- we had both had extensive experience with what happens when reviewers know who we are vs when they don't)
English
0
0
1
44
MMitchell
MMitchell@mmitchell_ai·
@GlennMatlin @vidalthi It really depends on who you are. I've published >100 papers, and have pretty consistently found that when people *don't* know who I am, they like the work better. Sometimes this has been a night-and-day difference. It may be, in part, a gender effect.
English
1
0
1
47
Thibaut Vidal
Thibaut Vidal@vidalthi·
Happy to announce that our paper was rejected as a spotlight (5/5/4) at #ICML2026. If the methodology was complex enough to confuse the metareviewer, perhaps it may still be of broader interest to you 🙂. Happy to discuss the work if you are into optimal counterfactual maps that permit explanations in milliseconds, or into the occasional ups and downs of academic publishing 🚣
Thibaut Vidal tweet media
English
5
7
203
29.2K