hyperdx retweetledi

We wanted to believe LLMs could replace SREs.
After running our experiment…. we don’t.
clickhou.se/41823HM
Here’s what happened when we put GPT-5, Claude Sonnet 4, GPT-o3, and Gemini 2.5 Pro to the test on observability data.
English


















