Google Site Reliability Engineering
832 posts

Google Site Reliability Engineering
@googlesre
⌨️🛠️📊 #SRE resources from the @Google Site Reliability Engineering team.
Katılım Ekim 2010
8 Takip Edilen15.3K Takipçiler

We are looking forward to next week’s SREcon in Seattle! We are excited for the opportunity to hear from and connect with reliability engineering folks from many organizations. Lots of listening and learning ahead for us!
google.smh.re/5QPq
English

@irohanrajput a greatly-expanded and very up-to-date second edition is coming out later this year!
English

YouTube: @GoogleSREProdcast" target="_blank" rel="nofollow noopener">youtube.com/@GoogleSREProd…
Web: #season5-episode7" target="_blank" rel="nofollow noopener">sre.google/prodcast/#seas…
... and wherever you enjoy finding and listening to podcasts ;)
English

Systems engineering and AI meet in the latest episode of the Prodcast, with guest is Damion Yates, an #SRE at @GoogleDeepMind. google.smh.re/5OwR
English

YouTube: @GoogleSREProdcast/" target="_blank" rel="nofollow noopener">youtube.com/@GoogleSREProd…
Web: #season5-episode7" target="_blank" rel="nofollow noopener">sre.google/prodcast/#seas…
... and wherever you enjoy finding and listening to podcasts ;)
English

When you have a novel incident and no playbook, you have a crisis. Join Carla Geisser of @LayerAleph to explore how you can lead through the fundamental surprise of a crisis. google.smh.re/5NmO
English

YouTube: @GoogleSREProdcast/" target="_blank" rel="nofollow noopener">youtube.com/@GoogleSREProd…
Web: sre.google/prodcast
... and wherever you enjoy finding and listening to podcasts ;)
English

Guests Felipe Tiengo Ferreira and Parker Barnes join hosts Matt Siegler and Steve McGhee to discuss AI model safety, from examining content to emerging security risks. The discussion emphasizes the vital role of SREs in managing safety at scale, detailing multi-layered defenses, including system instructions, LLM classifiers, and Automated Red Teaming (ART).
English

Explore the challenges around AI safety and production software in the latest episode of the Prodcast ... google.smh.re/5NQ_
English

YouTube: @GoogleSREProdcast/" target="_blank" rel="nofollow noopener">youtube.com/@GoogleSREProd…
Web: sre.google/prodcast
... and wherever you enjoy finding and listening to podcasts ;)
English

It's the year of Linux everything ... explore how you can bring reliability to a massive fleet of Linux devices in this episode of Prodcast! google.smh.re/5Mms
English

YouTube: @GoogleSREProdcast/" target="_blank" rel="nofollow noopener">youtube.com/@GoogleSREProd…
Web: sre.google/prodcast
and wherever you enjoy finding and listening to podcasts ;)
English

In this episode of The Prodcast, Google SRE Denia del Cid breaks down how her team is leveraging AI to transform production workflows. Denia details practical applications like early outage detection, incident similarity analysis, and toil reduction. She explains the critical importance of validating against "golden data sets" and keeping humans in the loop to build trust. Discover how SREs are evolving from skepticism to strategic adoption with Gemini.
English

Explore the impact of AI on Site Reliability Engineering in this episode of The Prodcast, with Google SRE Denia del Cid.
Tune in for a pragmatic, measured look at the future of reliability. google.smh.re/5MJ6
English

We explore the intersection of SRE and security , unpacking the "Secure by Design" philosophy and the shared DNA of incident management.
Heather candidly discusses the rise of "Agentic AI hackers" and polymorphic malware , revealing how defenders can use AI to stay ahead. From "castle" defense strategies to "nodal biology" theories, this episode is a must-listen for anyone navigating the new era of AI-driven threats.
English

in this week's episode of the Prodcast, join Heather Adkins (@argvee ), leader of Google’s Office of Cybersecurity Resilience, for a critical look at the future of digital defenses google.smh.re/5LTz
English
