Content Paint

Outages

If it’s always DNS, what’s the answer?

"We need more builders who can innovate on the foundations of the internet. More internet plumbers and network janitors."

Beyond backups: Why SaaS outages are the new disaster recovery challenge

"The goal is shifting from automating recovery to sustaining continuity. When a SaaS outage occurs, resilience depends not on faster failover, but on clear processes, tested workarounds, and business-led decisions"

Azure in major network outage

Redmond to AWS: Hold my DNS beer...

AWS Outage: It’s always DNS, but… sometimes it’s overloaded network hardware

Incident "originated from within the EC2 internal network"

AWS' infamous US-EAST-1 glitches out across K8, identity – and AWS support

One-region trouble takes down AWS' own support tools too.

A bus stop advertising screen showing a blue "recovery" window after the Crowdstrike outage in 2024

AI and consolidation drive sales, but briefly bringing down large swathes of the planet is still a drag.

github outage

"We have identified the cause and are rolling out changes to restore normal service"

Google Cloud outage: Thundering herds and missing feature flags…

"If this had been flag protected, the issue would have been caught in staging"

Major Google Cloud and Cloudflare outages: What happened?

GCP promises in future to "prevent metadata from propagating globally without appropriate protection, testing and monitoring in place."

Search the site

Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Great! You've successfully signed up.
Great! You've successfully signed up.
Welcome back! You've successfully signed in.
Success! You now have access to additional content.