Content Paint

Outages

Beyond backups: Why SaaS outages are the new disaster recovery challenge

"The goal is shifting from automating recovery to sustaining continuity. When a SaaS outage occurs, resilience depends not on faster failover, but on clear processes, tested workarounds, and business-led decisions"

Azure in major network outage

Redmond to AWS: Hold my DNS beer...

AWS Outage: It’s always DNS, but… sometimes it’s overloaded network hardware

Incident "originated from within the EC2 internal network"

AWS' infamous US-EAST-1 glitches out across K8, identity – and AWS support

One-region trouble takes down AWS' own support tools too.

A bus stop advertising screen showing a blue "recovery" window after the Crowdstrike outage in 2024

AI and consolidation drive sales, but briefly bringing down large swathes of the planet is still a drag.

github outage

"We have identified the cause and are rolling out changes to restore normal service"

Google Cloud outage: Thundering herds and missing feature flags…

"If this had been flag protected, the issue would have been caught in staging"

Major Google Cloud and Cloudflare outages: What happened?

GCP promises in future to "prevent metadata from propagating globally without appropriate protection, testing and monitoring in place."

bank outages IT changes the stack

"Expired certificates, DDoS attacks; broken hardware or networks; Crowdstrike’s outage; third-party issues: those looking at what UK banks have blamed for their IT outages over the past two years will soon see that the list is wide. But the biggest culprit by far?"

Search the site

Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Great! You've successfully signed up.
Great! You've successfully signed up.
Welcome back! You've successfully signed in.
Success! You now have access to additional content.