Mitchell Wortsman

423 posts

Mitchell Wortsman banner
Mitchell Wortsman

Mitchell Wortsman

@Mitchnw

@AnthropicAI | prev @uwcse

Katılım Ekim 2011
1K Takip Edilen1.9K Takipçiler
Mitchell Wortsman retweetledi
Mike A. Merrill
Mike A. Merrill@Mike_A_Merrill·
New job! I’m hiring folks interested in building and researching the next generation of evals and eval infa. DMs are open :)
Mike A. Merrill tweet media
English
114
69
2.2K
144.6K
Sebastian Jaszczur
Sebastian Jaszczur@S_Jaszczur·
Today I’m joining @AnthropicAI ! After finishing my PhD last month at University of Warsaw and IDEAS NCBR, I’m excited to bring my experience to this amazing company and hope to make Claude even better - and push AI frontier even further :)
English
29
17
589
42.3K
Mitchell Wortsman retweetledi
Ludwig Schmidt
Ludwig Schmidt@lschmidt3·
Very excited to finally release our paper for OpenThoughts! After DataComp and DCLM, this is the third large open dataset my group has been building in collaboration with the DataComp community. This time, the focus is on post-training, specifically reasoning data.
Ludwig Schmidt tweet media
English
22
202
1.3K
187.2K
Mitchell Wortsman retweetledi
Anthropic
Anthropic@AnthropicAI·
Introducing the next generation: Claude Opus 4 and Claude Sonnet 4. Claude Opus 4 is our most powerful model yet, and the world’s best coding model. Claude Sonnet 4 is a significant upgrade from its predecessor, delivering superior coding and reasoning.
Anthropic tweet media
English
932
3.2K
20.8K
4.3M
Mitchell Wortsman retweetledi
Cade Gordon
Cade Gordon@CadeGordonML·
Excited to share that I'll be joining @Anthropic to work on pretraining science! I've chosen to defer my Stanford PhD, where I'm honored to be supported by the Hertz Fellowship. There's something special about the science, this place, and these people. Looking forward to joining some of my most brilliant and compassionate colleagues!
English
42
10
762
58.7K
Mitchell Wortsman retweetledi
Mike A. Merrill
Mike A. Merrill@Mike_A_Merrill·
Many agents (Claude Code, Codex CLI) interact with the terminal to do valuable tasks, but do they currently work well enough to deploy en masse? We’re excited to introduce Terminal-Bench: An evaluation environment and benchmark for AI agents on real-world terminal tasks. Tl;dr lots of room for improvement! tbench.ai
Mike A. Merrill tweet media
English
16
61
243
51.7K
Mitchell Wortsman retweetledi
Alex Li
Alex Li@alexlioralexli·
Excited to be presenting at #ICLR2025 at 10am today on how generative classifiers are much more robust to distribution shift. Come by to chat and say hello!
Alex Li tweet media
English
2
5
91
6.5K
Mitchell Wortsman retweetledi
Alex Li
Alex Li@alexlioralexli·
I'm presenting our #NeurIPS2024 work on Attention Transfer today! Key finding: Pretrained representations aren't essential - just using attention patterns from pretrained models to guide token interactions is enough for models to learn high-quality features from scratch and match ImageNet performance! 🤯 Chat with me and @endernewton Dec 12 (today), 4:30 -7:30 pm PST, East Exhibit Hall #1900
Alex Li tweet media
English
2
22
156
14.1K
Mitchell Wortsman retweetledi
Akari Asai
Akari Asai@AkariAsai·
🚨 I’m on the job market this year! 🚨 I’m completing my @uwcse Ph.D. (2025), where I identify and tackle key LLM limitations like hallucinations by developing new models—Retrieval-Augmented LMs—to build more reliable real-world AI systems. Learn more in the thread! 🧵
Akari Asai tweet media
English
27
114
814
126.6K
Mitchell Wortsman retweetledi
Ofir Press
Ofir Press@OfirPress·
I'm on the academic job market! I develop autonomous systems for: programming, research-level question answering, finding sec vulnerabilities & other useful+challenging tasks. I do this by building frontier-pushing benchmarks and agents that do well on them. See you at NeurIPS!
Ofir Press tweet media
English
9
38
230
24K
Mitchell Wortsman retweetledi
Anthropic
Anthropic@AnthropicAI·
Introducing an upgraded Claude 3.5 Sonnet, and a new model, Claude 3.5 Haiku. We’re also introducing a new capability in beta: computer use. Developers can now direct Claude to use computers the way people do—by looking at a screen, moving a cursor, clicking, and typing text.
Anthropic tweet media
English
473
1.8K
10K
3.7M
Mitchell Wortsman retweetledi
Ross Wightman
Ross Wightman@wightmanr·
OpenCLIP passed 10K stars on GitHub this week. A big milestone for any open-source project. 🍻 to the many collaborators that made that possible. Coincidentally, I pushed a new release with a port of the largest multi-lingual SigLIP -- a SO400M/16 @ 256x256 that appeared on big_vision a little while back. Now on the @huggingface hub and useable via timm or OpenCLIP (update your timm too)! huggingface.co/timm/ViT-SO400…
English
5
15
147
26.2K
Mitchell Wortsman retweetledi
Katie Everett
Katie Everett@_katieeverett·
Come chat with me and @Locchiu at our ICML poster session 1:30-3pm CEST (Vienna time) today at Hall C 4-9 #2500 and see how our theory lets all parameterizations perform hyperparameter transfer! arxiv.org/abs/2407.05872
Katie Everett tweet media
English
6
3
28
65.7K
Mitchell Wortsman retweetledi
Vaishaal Shankar
Vaishaal Shankar@Vaishaal·
We have released our DCLM models on huggingface! To our knowledge these are by far the best performing truly open-source models (open data, open weight models, open training code) 1/5
English
9
63
287
51.2K
Mitchell Wortsman retweetledi
Katie Everett
Katie Everett@_katieeverett·
We've gotten some great questions about the notion of alignment in our width-scaling parameterization paper! arxiv.org/abs/2407.05872 A deep dive into the alignment metric and intuition 🧵 [1/16]
Katie Everett tweet mediaKatie Everett tweet media
English
4
17
68
14.8K
Mitchell Wortsman retweetledi
Tomer Porian
Tomer Porian@tomerporian·
🧵1/8 We resolve the discrepancy between the compute optimal scaling laws of Kaplan (exponent 0.88, Figure 14, left) et al. and Hoffmann et al. (“Chinchilla”, exponent 0.5). Paper: arxiv.org/abs/2406.19146 Data + Code: github.com/formll/resolvi…
Tomer Porian tweet media
English
6
33
172
36K
Mitchell Wortsman retweetledi
Anthropic
Anthropic@AnthropicAI·
We're also launching a preview of Artifacts on claude.ai. You can ask Claude to generate docs, code, mermaid diagrams, vector graphics, or even simple games. Artifacts appear next to your chat, letting you see, iterate, and build on your creations in real-time.
English
37
188
1.6K
587.5K
Mitchell Wortsman retweetledi
Anthropic
Anthropic@AnthropicAI·
Introducing Claude 3.5 Sonnet—our most intelligent model yet. This is the first release in our 3.5 model family. Sonnet now outperforms competitor models on key evaluations, at twice the speed of Claude 3 Opus and one-fifth the cost. Try it for free: claude.ai
Anthropic tweet media
English
421
1.5K
7.1K
2.5M