MMitchell

22.1K posts

MMitchell

@mmitchell_ai

Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics. Similar content in the Skies (this bird has flown).

Katılım Haziran 2016

1.4K Takip Edilen81.8K Takipçiler

MMitchell@mmitchell_ai·1d

@cherthedev Amazing. You’re such a role model!

English

640

Cher Scarlett 🌌@cherthedev·2d

Graduated with honors as the commencement speaker. 💙🧡 Shipping off to Boston! 💙❤️

English

2.6K

MMitchell@mmitchell_ai·3d

@natolambert @reflection_ai <3

1.4K

Nathan Lambert@natolambert·3d

@reflection_ai i want y'all to succeed but it's a little self-serving to say "America's open source AI lab" having released nothing

English

792

48.6K

Reflection@reflection_ai·3d

x.com/i/article/2057…

ZXX

38.8K

MMitchell retweetledi

Max Zhdanov@maxxxzdn·5d

🌍Today we release Mosaic, a probabilistic weather model that shifts the Pareto frontier of ML weather forecasting. It matches the skill of state-of-the-art models while generating a 24-member, 10-day global forecast in under 12 s on a single H100. Thread!

English

141

1.3K

132.5K

MMitchell retweetledi

Georgia Channing@cgeorgiaw·4d

OlmoEarth v1.1 just dropped (thx @allen_ai) 🌍 This family of Earth observation foundation models for satellite imagery tasks (e.g. mangrove change tracking, forest loss driver classification) just got 3X CHEAPER/FASTER to run. The trick is redesigning what a token represents. Sentinel-2 inputs used to get one token per resolution (10m/20m/60m). v1.1 collapses them → 3x fewer tokens, quadratically cheaper compute.

English

183

8.1K

MMitchell@mmitchell_ai·5d

A propos of nothing, I've been pseudocoding Beatles songs and it is insanely fun. if x.person_type == "baby": shake_it_up(x) twist_and_shout(x) work_it_on_out(x) x.look_so_good = True

English

890

MMitchell@mmitchell_ai·5d

@srchvrs @Aaroth Sure, and I believe there should be ramifications for that!

English

Leo Boytsov@srchvrs·5d

Not defending the policy, but citations can be completely hallucinated too. Just ask the model to write a paragraph together with all the citations and additional bibtex entries. Of course, it's not something that I recommend, but I am pretty sure a lot of people do this. But, again, I don't approve draconian arxiv policy in this regard.

English

Aaron Roth@Aaroth·5d

A clearly hallucinated citation! NeurIPS 2026 decisions aren't out yet. But wait --- the hallucination is also present in the bibtex entries from openreview openreview.net/forum?id=fAjbY… and Google Scholar scholar.googleusercontent.com/scholar.bib?q=…

English

22.1K

MMitchell@mmitchell_ai·5d

@andrewdobrow @tatumturnup Bahaha this is brilliant.

English

329

MMitchell retweetledi

Andrew Dobrow@andrewdobrow·5d

@tatumturnup He needs a doctor. Somebody call 1110001111.

English

2.9K

80.2K

Tatum Turn Up@tatumturnup·6d

This is the greatest video I’ve ever seen. No notes. The lifeless clanker carcass just laying there. No crowd reaction, anything. Just Billie Jean. Until its lifeless shell is shamefully dragged off. Purely amazing.

English

1.4K

8.4K

91.9K

5.5M

MMitchell retweetledi

Loubna Ben Allal@LoubnaBenAllal1·6d

Introducing Carbon 🧬 a family of open generative DNA foundation models. Carbon-3B matches Evo2-7B while running 250x faster at inference. It can generate new DNA sequences and score the functional impact of mutations, zero-shot. We borrowed a lot from how modern LLMs are trained, but DNA isn't language. Genomes are noisy, redundant, and shaped by evolution rather than communication. So we adjusted the recipe: Tokenizer. Most genomic models tokenize at the nucleotide/character level, which blows up sequence length. BPE is the obvious LLM-style fix, but it doesn't behave well on DNA. We use deterministic 6-mer tokens (one token = 6 nucleotides): 6× shorter sequences and cheaper attention. Training loss. With 6-mer tokens, cross-entropy scores a prediction that gets 5/6 nucleotides right the same as one that's completely wrong. This gets brittle late in training and produces loss spikes. We switch mid-training to a more flexible factorized loss (FNS). Data. Genomes are mostly sparse, repetitive background. We curate down to a staged functional DNA + mRNA mixture, with every ratio chosen by ablation, like mixing a web corpus, but for biology. We're releasing the models, training data, training code, evaluation suite, and a demo to play with. More details in the technical report: github.com/huggingface/ca… Demo to play with the model, with a biology primer for our ML friends ;) huggingface.co/spaces/Hugging…

English

358

38.7K

MMitchell retweetledi

Georgia Channing@cgeorgiaw·6d

Hugging Science just got a whole lot more huggier 🤗🤗🤗 Today, we’re releasing a family of genomics models, which we call Carbon

English

281

18.8K

MMitchell@mmitchell_ai·6d

huggingface.co/blog/allenai/o…

ZXX

571

MMitchell@mmitchell_ai·6d

Against the constant pressure of *genAI, genAI, genAI*, I am really appreciating @allen_ai 's work on creating tools for critical needs -- like crop maps and forest loss analysis. They just did a nice release on @huggingface , check it out (linked below)

English

1.3K

MMitchell@mmitchell_ai·18 May

Also women. (Which requires AI companies to consistently hire, retain, and promote them)

Chris Olah@ch402

The questions posed by AI are bigger than the AI community. We urgently need the world – religions, civil society, academics, governments – to participate in creating a positive outcome. I'm glad the Catholic Church is engaging, and honored to speak at the presentation.

English

2.4K

MMitchell retweetledi

Karen Hao@_KarenHao·14 May

So much attention has been paid to the Musk v. Altman trial. But real accountability for the AI industry will not come from a billionaire mudfight. It will come from the movements around the world resisting the empires of AI. My op-ed for @guardian. theguardian.com/technology/com…

English

310

836

55K

MMitchell@mmitchell_ai·15 May

AI world! I am at a non-AI conference, and they have solved a *fundamental* conference issue. Get this: The name tags are printed on *both* sides. So when it inevitably flips, people can still see your details. 🤯 This changes the game!

English

1.6K

MMitchell@mmitchell_ai·8 May

@Chilka_ @remilouf @dottxtai The ghost is a perk tho.

English

Hugo@Chilka_·7 May

@remilouf @dottxtai It looks cool Until you found out you need 45 min to buy bread. The wifi connection is working when it’s not raining only. There a ghost in the castle

English

5.1K

Rémi@remilouf·6 May

Starting a company in a garage is boring so we started @dottxtai in a French castle instead

English

1.4K

185.3K

MMitchell@mmitchell_ai·8 May

@roydanroy Rather than “average human”, I believe the terminology many making these claims prefer is “median human” 🫠

English

1.2K

Dan Roy@roydanroy·8 May

Hot take 🔥: any company that thinks their company will reach AGI/ASI/whatever first and who is concerned about the average person and their livelihood due to their own products, should either be public or raise their next round in a way that the average person can invest. Otherwise, you're just enriching the billionaires at this point.

English

184

15.2K

MMitchell retweetledi

NeurIPS Conference@NeurIPSConf·4 May

This year, to improve transparency and responsible use of datasets in the NeurIPS 2026 Evaluations and Datasets Track, all dataset submissions are now required to include Responsible AI (RAI) metadata as part of the dataset’s Croissant file. Find out more about this and RAI in our blog post: blog.neurips.cc/2026/05/04/res…

English

203

43.9K

MMitchell@mmitchell_ai·2 May

@GlennMatlin @vidalthi (Not unrelated to the issues with the Stochastic Parrots paper, btw -- we had both had extensive experience with what happens when reviewers know who we are vs when they don't)

English

MMitchell@mmitchell_ai·2 May

@GlennMatlin @vidalthi It really depends on who you are. I've published >100 papers, and have pretty consistently found that when people *don't* know who I am, they like the work better. Sometimes this has been a night-and-day difference. It may be, in part, a gender effect.

English

Thibaut Vidal@vidalthi·1 May

Happy to announce that our paper was rejected as a spotlight (5/5/4) at #ICML2026. If the methodology was complex enough to confuse the metareviewer, perhaps it may still be of broader interest to you 🙂. Happy to discuss the work if you are into optimal counterfactual maps that permit explanations in milliseconds, or into the occasional ups and downs of academic publishing 🚣

English

203

29.2K

Keşfet

@cherthedev @natolambert @reflection_ai @allen_ai @srchvrs @Aaroth @andrewdobrow @tatumturnup