Resilience

"Expired certificates, DDoS attacks; broken hardware or networks; Crowdstrike’s outage; third-party issues: those looking at what UK banks have blamed for their IT outages over the past two years will soon see that the list is wide. But the biggest culprit by far?"

The $925 billion financial services firm pushed AWS to create new stress-test capabilities for Lambda...

"Our mitigative actions haven't provided relief as expected, and a portion of infrastructure remains in an unhealthy state"

Both primary and backup systems collapsed in 20 seconds, forcing the cancellation of thousands of flights.
"When incidents occur, the most important thing is fixing, learning and applying that knowledge institutionally. Finger-pointing and ceremonial sacrifices are not a mature response. Bias towards concrete action and banish politics"

Network configuration, CA and SWIFT issues, and certificate expiration blamed for a series of RTGS outages the past year.