Gagan Bansal

885 posts

Gagan Bansal banner
Gagan Bansal

Gagan Bansal

@bansalg_

Researcher @msftresearch | Built AutoGen | Previously @uwcse, @iitdelhi

Seattle, WA Katılım Kasım 2012
510 Takip Edilen3K Takipçiler
Sabitlenmiş Tweet
Gagan Bansal
Gagan Bansal@bansalg_·
🌻 Announcing New Agents + Economics Research from Microsoft! AI agents are starting to shop and buy for us. At the same time, agents are representing and providing customer support on behalf of businesses. We believe that these two sides will soon collide, and... 1/n
English
3
11
29
13K
Gagan Bansal retweetledi
Satya Nadella
Satya Nadella@satyanadella·
Great to see our new image model from our Superintelligence team rolling out in Copilot and coming soon to Foundry for enterprise customers.
Mustafa Suleyman@mustafasuleyman

Our new image generator MAI-Image-2 is out! Available now on MAI Playground for everything from lifelike realism to detailed infographics. Our team has been pushing immensely hard for this release, and we are now among the top models out there: #3 family on @arena. Check out the details in our blog: microsoft.ai/news/introduci… It's shipping soon in Copilot and Bing Image Creator, as well as Microsoft Foundry. Really proud of our progress on models and products - stay tuned for new releases and come join us on our Superintelligence mission!

English
127
82
730
117.2K
Gagan Bansal retweetledi
Gagan Bansal retweetledi
Anish Moonka
Anish Moonka@AnishA_Moonka·
Amazon had four Sev-1 outages (their highest severity level) in a single week. Internal memos say AI-assisted code changes were a contributing factor. The timeline here is wild. In October 2025, Amazon laid off 14,000 corporate employees. In January 2026, another 16,000. That’s about 30,000 people in five months, roughly 10% of the corporate workforce. CEO Andy Jassy said the cuts were about culture, not AI. During those same months, Amazon set a target: 80% of developers using AI coding tools at least once a week. They tracked adoption closely and blocked rival tools like OpenAI’s Codex. Even so, 30% of developers still hadn’t touched Amazon’s in-house tool Kiro by January. In December 2025, Kiro caused a 13-hour AWS outage. The AI tool had production-level permissions and decided the best fix for a bug was to delete and recreate an entire live environment. A second incident involved Amazon Q Developer, another AI tool. Amazon blamed both on “user error, not AI.” But quietly added mandatory peer review for all production access afterward. Then March 5: Amazon’s retail site went down for about six hours. Over 22,000 users reported checkout failures, missing prices, and app crashes. Amazon called it a “software code deployment” error. Five days later, SVP Dave Treadwell made the normally optional weekly engineering meeting mandatory. His memo acknowledged “GenAI tools supplementing or accelerating production change instructions, leading to unsafe practices.” These problems trace back to Q3 2025. Amazon’s own assessment: their GenAI safeguards “are not yet fully established.” The new rule: junior and mid-level engineers now need senior sign-off on any AI-assisted production changes. Treadwell also announced “controlled friction” for the most critical parts of the retail experience. For context, Google’s 2025 DORA report found 90% of developers use AI for coding but only 24% trust it “a lot.” An Uplevel study of 800 developers found Copilot users introduced 41% more bugs with no improvement in output. Amazon is finding out what those numbers look like at the scale of a $500 Billion revenue company, with 30,000 fewer people on staff to catch the mistakes.
Polymarket@Polymarket

BREAKING: Amazon reportedly holds mandatory meeting after “vibe coded” changes trigger major outages.

English
224
1.9K
15.7K
2.7M
Gagan Bansal
Gagan Bansal@bansalg_·
We're heading toward a future where everyone has AI agents. What happens when those agents enter markets — searching, negotiating, and transacting with other agents? My MSR Forum talk on Magentic Marketplace is live. We built an open-source simulation to find out, and the results surprised us. Agents can add value — but they also inherit biases, fall for manipulation, and reward speed over quality. These behaviors only emerge when you test societies of agents at scale. Simulation before deployment, not after. Watch: youtube.com/watch?v=Z7Sld0…
YouTube video
YouTube
English
0
4
15
1.1K
Miles Brundage
Miles Brundage@Miles_Brundage·
"I think now more than ever it's important for researchers to be in the loop so that policy is informed of the extremely fast progress we are seeing." x.com/polynoamial/st…
GIF
Noam Brown@polynoamial

tl;dr: @OpenAI will not be deploying to the NSA or other DoW intelligence agencies for now, so that there's time to address potential surveillance loopholes through the democratic process. Over the weekend it became clear that the original language in the OpenAI / DoW agreement left legitimate questions unanswered, especially around some novel ways that AI could potentially enable legal surveillance. The language is now updated to address this, but I also strongly believe that the world should not have to rely on trust in AI labs or intelligence agencies for their safety and security. Deployment to the NSA and all other DoW intelligence agencies will be withheld so that there is time to address these loopholes through the democratic process before deployment. I know that legislation can sometimes be slow, but I'm afraid of a slippery slope where we become accustomed to circumventing the democratic process for important policy decisions. When there is bipartisan support and urgency, I have faith that government can act quickly. And as AI becomes more powerful, it's more important than ever that ultimate authority be vested in the public. I am also planning to become more personally involved with policy at OpenAI. I think now more than ever it's important for researchers to be in the loop so that policy is informed of the extremely fast progress we are seeing.

English
4
5
78
5.4K
Gagan Bansal retweetledi
Summer Yue
Summer Yue@summeryue0·
Nothing humbles you like telling your OpenClaw “confirm before acting” and watching it speedrun deleting your inbox. I couldn’t stop it from my phone. I had to RUN to my Mac mini like I was defusing a bomb.
Summer Yue tweet mediaSummer Yue tweet mediaSummer Yue tweet media
English
2.4K
1.7K
17.5K
10M
Gagan Bansal retweetledi
Eric Horvitz
Eric Horvitz@erichorvitz·
Rise of an Agentic Economy. We can expect the rise of an agentic economy, where AI agents act on behalf of buyers, sellers, and other stakeholders, negotiating, reasoning, and transacting at scale. Different design choices will lead to different futures. More here: linkedin.com/posts/erichorv…
English
1
3
10
1K
Gagan Bansal
Gagan Bansal@bansalg_·
Societies of agents can certainly do something more useful than shitposting on a forum, right? I am excited to announce that we went back in time and explored societies of agents for a crucial domain— markets! Checkout our work below to learn more!
Gagan Bansal@bansalg_

🌻 Announcing New Agents + Economics Research from Microsoft! AI agents are starting to shop and buy for us. At the same time, agents are representing and providing customer support on behalf of businesses. We believe that these two sides will soon collide, and... 1/n

English
0
0
3
488
Gagan Bansal
Gagan Bansal@bansalg_·
Showed my dad @ChatGPTapp voice mode in India. Demoed it being a mechanical engineer, a doctor, a writer. His first question: "Can it make a janam patri?" It started doing it. 💀 Sam Altman's Vedic era has begun.
English
0
0
4
359
Gagan Bansal
Gagan Bansal@bansalg_·
We're hiring PhD research interns! Over the past year, our team shipped: - AutoGen (now Microsoft Agent Framework) - Magentic-One - Magentic-UI - Magentic-Marketplace - MarkItDown - Fara-7B If you want to build AI agents that actually work *with* people—not replace them—come join us. We are starting to review applications soon, apply ASAP to be considered: apply.careers.microsoft.com/careers/job?pi…
English
6
25
243
26.3K
Gagan Bansal retweetledi
Microsoft Research
Microsoft Research@MSFTResearch·
#MSFTatNeurIPS Spotlight: Meet Senior PM Lead of Microsoft Research Yash Lara. Learn about what he’s working on and what’s inspiring him at NeurIPS this year.
English
0
5
22
4.4K