Shaden Smith

195 posts

Shaden Smith

Shaden Smith

@shaden_smith

Technical Staff at @MicrosoftAI. Prev. @InflectionAI, @MSFTDeepSpeed, and @Intel. Into horror, herpetology, and high performance computing. he/him

Bellevue, WA Katılım Ocak 2017
687 Takip Edilen256 Takipçiler
Sabitlenmiş Tweet
Shaden Smith retweetledi
Paul Soulos
Paul Soulos@paulsoulos·
It was an honor working on MAI-Thinking-1! A lot of thought and effort went into this model, and we want to share some learnings with you. 109 pages worth 😅 microsoft.ai/wp-content/upl…
Paul Soulos tweet media
Mustafa Suleyman@mustafasuleyman

Super excited to announce seven new world-class MAI models today. They represent what we consider a new era in AI designed to keep you in control and on the frontier. First is our text foundation model, MAI-Thinking-1, exceptionally strong on reasoning and SWE tasks. - It’s a 35B active parameter MoE with a 256K context window. Independent human raters on Surge prefer it for overall quality in blind side-by-sides versus Sonnet 4.6, and it’s achieved 97% on AIME 2025, the key measure of its general-purpose reasoning abilities. - It's at 53% on SWE Bench Pro, placing it right alongside Opus 4.6 on one of the toughest coding benchmarks. - And since we co-designed our models with our own silicon, MAI-Thinking-1 is optimized on our MAIA 200 chip. Benchmarking head-to-head against the GB200, we see 30% better performance per dollar as well as a 1.4x performance-per-watt gain when running our MAI models on the MAIA 200 end-to-end. Next is MAI-Image-2.5 and its Flash variant. Two super strong models now at #2 on the leaderboards, surpassing the score of Nano Banana 2 on image editing. Last for now is MAI-Code-1-Flash, our new inference efficient coding model, especially tuned for VS Code and GitHub Copilot CLI. - Code-1-Flash achieves 51% on SWE Bench Pro, despite having just 5B parameters, putting it closer to Haiku in size but cheaper in cost. All of this is the foundation for Microsoft Frontier Tuning. It lets you customize our models to create custom, company-specific agents that only you control. You can make our model, your model. Your data. Your agents. Your moat. Early adopters are already seeing a difference. When we tuned our models for McKinsey’s tasks, MAI delivered the highest win rate, outperforming GPT-5.5 on quality, while being 10x lower on cost. Also really excited to be collaborating with the amazing team at Mayo Clinic to jointly train a new frontier AI model for healthcare. Our announcements today mark another milestone on the road to humanist superintelligence. You can learn more and about our other new models in our latest blog: microsoft.ai/news/building-…

English
3
2
45
4.9K
Shaden Smith retweetledi
Mustafa Suleyman
Mustafa Suleyman@mustafasuleyman·
Super excited to announce seven new world-class MAI models today. They represent what we consider a new era in AI designed to keep you in control and on the frontier. First is our text foundation model, MAI-Thinking-1, exceptionally strong on reasoning and SWE tasks. - It’s a 35B active parameter MoE with a 256K context window. Independent human raters on Surge prefer it for overall quality in blind side-by-sides versus Sonnet 4.6, and it’s achieved 97% on AIME 2025, the key measure of its general-purpose reasoning abilities. - It's at 53% on SWE Bench Pro, placing it right alongside Opus 4.6 on one of the toughest coding benchmarks. - And since we co-designed our models with our own silicon, MAI-Thinking-1 is optimized on our MAIA 200 chip. Benchmarking head-to-head against the GB200, we see 30% better performance per dollar as well as a 1.4x performance-per-watt gain when running our MAI models on the MAIA 200 end-to-end. Next is MAI-Image-2.5 and its Flash variant. Two super strong models now at #2 on the leaderboards, surpassing the score of Nano Banana 2 on image editing. Last for now is MAI-Code-1-Flash, our new inference efficient coding model, especially tuned for VS Code and GitHub Copilot CLI. - Code-1-Flash achieves 51% on SWE Bench Pro, despite having just 5B parameters, putting it closer to Haiku in size but cheaper in cost. All of this is the foundation for Microsoft Frontier Tuning. It lets you customize our models to create custom, company-specific agents that only you control. You can make our model, your model. Your data. Your agents. Your moat. Early adopters are already seeing a difference. When we tuned our models for McKinsey’s tasks, MAI delivered the highest win rate, outperforming GPT-5.5 on quality, while being 10x lower on cost. Also really excited to be collaborating with the amazing team at Mayo Clinic to jointly train a new frontier AI model for healthcare. Our announcements today mark another milestone on the road to humanist superintelligence. You can learn more and about our other new models in our latest blog: microsoft.ai/news/building-…
Mustafa Suleyman tweet media
English
192
541
3.8K
1.3M
Shaden Smith retweetledi
Nando de Freitas
Nando de Freitas@NandoDF·
I’d like to hire strong data engineers to join our Microsoft Super Intelligence (MSI) team. I am interested in people who are good at processing PDFs and other documents at billion scale, and people good at parsing the web at trillion scale. If you dream of processing all of human knowledge to advance science and engineering, this is for you. Also looking for strong evaluation and post-training engineers. Be part of our first launches this year 🚀 We have all the resources in the world to support you, working in startup mode, while powering a large organisation with billions of users. Hiring in London, Zurich, New York, Boston, Toronto, Seattle and SF. Please send your CV to JoinAITeam@microsoft.com
English
19
48
523
103.5K
Shaden Smith retweetledi
Aaron
Aaron@aaronbatilo·
Y'all watch @karpathy to learn about the whole network but I watch him to learn about F2L
English
0
1
1
74
Shaden Smith retweetledi
Nando de Freitas
Nando de Freitas@NandoDF·
We are hiring star research and data engineers to invent the future of AI. JoinAITeam@microsoft.com If you’re finishing your undergrad or PhD at Imperial, Cambridge, Oxford, UCL, Toronto, MIT, MILA, UBC, ETH, Stanford, Caltech, UCLA, Berkeley, CMU, UW, NYU, Princeton, Columbia, Harvard, Yale or any other top school in STEM, please apply too. I love working with energetic people, who are prepared to work on what is needed to shape AI, make it safe, make it brilliant, make it creative, and make it useful in math, science, healthcare, education, energy and environment.
Brad Gerstner@altcap

Look forward to having @satyanadella & @sama on @BG2Pod tomorrow. The deal. The skeptics. The re-industrialization of America. Power. Chips. Models. Agents. AGI. Regulation. Jobs. And more… 🧐🚀🇺🇸

English
45
68
675
223K
Shaden Smith retweetledi
Mustafa Suleyman
Mustafa Suleyman@mustafasuleyman·
Excited to share our first @MicrosoftAI in-house models: MAI-Voice-1 and MAI-1-preview. Details and how you can test below, with lots more to come⬇️
Mustafa Suleyman tweet media
English
90
169
961
348.3K
Shaden Smith retweetledi
Charlie Marsh
Charlie Marsh@charliermarsh·
uv now ships with dedicated documentation for PyTorch
Charlie Marsh tweet media
English
12
60
514
35.8K
Aaron
Aaron@aaronbatilo·
Find me a genre that makes you feel more badass than heavy metal bagpipes. Go on. I'll wait
English
1
0
1
65
Shaden Smith
Shaden Smith@shaden_smith·
@aaronbatilo Wanna throw back a couple of cold ones at the Spaghetti Factory
English
1
0
1
88
Aaron
Aaron@aaronbatilo·
Why don't people order milk more often at restaurants
English
1
0
3
86
Shaden Smith
Shaden Smith@shaden_smith·
@aaronbatilo I appreciate your unwavering dedication to accurate telemetry.
English
0
0
1
46
Aaron
Aaron@aaronbatilo·
New weight loss meta. I'm too lazy to record the calories in Fitbit, so I just don't eat
English
1
0
1
82
Shaden Smith retweetledi
Nando de Freitas
Nando de Freitas@NandoDF·
I’ve joined @Microsoft AI to advance the frontier of large scale multimodal AI research and to build products for people to achieve meaningful goals and dreams. The MAI team is small, but well resourced and ambitious. We are now looking for exceptional ICs, who like to ship. If you you’re interested in multimodal AI, both recognition and generation, love to collaborate and empower others, believe in diversity and inclusion, have a growth mindset, and want to impact the future of AI in a positive and profound way, please message me directly. I believe this is a rare and unique opportunity to join a new AI team that will shape the future. @black_in_ai @WiMLworkshop @_LXAI
Nando de Freitas tweet media
English
62
46
956
108.6K
Shaden Smith retweetledi
Aaron
Aaron@aaronbatilo·
Y'all out here judging LLMs on instruction following but have you ever asked a human to follow an onboarding document before?
English
0
1
3
208
Shaden Smith retweetledi
Mustafa Suleyman
Mustafa Suleyman@mustafasuleyman·
The UK has phenomenal AI talent and a long established culture of responsible AI development. Today I’m proud to be opening a new office: Microsoft AI London. If you’d like to join us, get in touch. We’re hiring! blogs.microsoft.com/blog/2024/04/0…
English
91
250
1.9K
358.4K
Shaden Smith retweetledi
Inflection AI
Inflection AI@inflectionAI·
Happy #PiDay, let's toast to infinite adventures with @pi - now available on 13 platforms. Ready for wherever life’s adventures take you. Cheers to 3.14 and beyond! inflection.ai/pi-is-availabl…
English
19
87
154
100.5K
Shaden Smith retweetledi
Inflection AI
Inflection AI@inflectionAI·
Pi just got a huge upgrade! It’s now powered by our latest LLM: Inflection-2.5, which is neck and neck with GPT-4 on all benchmarks and used less than half the compute to train. Pi now has world class IQ, combined with its distinctively kind and curious personality. Give it a go at pi.ai inflection.ai/inflection-2-5
English
84
245
957
684.5K
Shaden Smith retweetledi
Stas Bekman
Stas Bekman@StasBekman·
The other news is the introduction of @MSFTDeepSpeed Meetups, which will be conducted once about every 3 months. The inaugural one will be on Feb 12 6:00 PM - 8:00 PM at Redmond Reactor developer.microsoft.com/en-us/reactor/… Quote: "This will be the first ever meetup for the DeepSpeed open-source project. We will have an overview of DeepSpeed, latest features and release, and deeper dive talks on particular new/important features and use cases." I think they plan to organize it at other locales as well, such as Seattle and San Francisco. I hope they record those!
English
0
2
24
2.4K
Shaden Smith retweetledi
Stas Bekman
Stas Bekman@StasBekman·
Finally I'm being told MSFT allocated more engineering positions on the @MSFTDeepSpeed team. As a long time Deepspeed user for the first few years I had to fix many bugs myself since the team was so small, and finally the time has come where I can just report them and the growing Deepspeed team is there to fix them and add new features quickly. So if you're a good ML engineer I'm selfishly interested for you to join the Deepspeed team as it'll help me and numerous other Deepspeed users to get the best scalability and speed of LLM training and inference. So if it resonates please head here: - Principal Software Engineer jobs.careers.microsoft.com/global/en/job/… - Senior Software Engineer jobs.careers.microsoft.com/global/en/job/… edit: actually I see there are 6 openings! jobs.careers.microsoft.com/global/en/sear… Thank you!
English
0
15
91
29.5K