AWS apologizes for February 28 outage, takes steps to prevent similar events

AWS apologizes for February 28 outage, takes steps to prevent similar events

AWS apologizes for February 28 outage, takes steps to prevent similar events

It appears a single programming misstep caused a cascade of events that resulted in intermittent outages for Amazon clients ranging from government websites to music streaming services.

AWS took a lot of heat when its S3 storage component went down for several hours on Monday, and rightly so, but today they published a post-mortem explaining exactly what happened complete with technical details and how they plan to prevent a similar event from occurring again in the future.

Amazon Web Services, the remote data centers that power some of the world's most popular websites, experienced a major disruption lasting several hours on Tuesday that left numerous apps and websites - including Business Insider - hard to access for many users. Without these two systems operating, Amazon said it was unable to handle any customer requests for S3 itself, or those from services like EC2 and Lambda functions connected to S3.

Amazon.com's cloud computing unit said that the outage that shook up a sizable part of the internet Tuesday was caused by human error.

Not only that, but other Amazon AWS services also use S3, so they, too, were affected.

An enormous number of sites, including Airbnb, Business Insider, Expedia, Medium, Netflix, Quora, Slack, Trello, and the Securities and Exchange Commission experienced issues related to the outage, VentureBeat reported at the time of the outage. Both systems required a full restart.

Trump Rebuts Criticism of Yemen Raid in Address to Congress
Special Ops raid in Yemen last month, was brought to tears by a two-minute standing ovation during Trump's speech . The success of the raid has been publicly debated, and the White House has continued to defend the operation.

Spotify Moves On Hi-Res Streaming
According to a thread on Reddit , multiple users were offered upgrades to Hi-Fi at varying price points: $5, $7.50, and $10. This seems to be so-called A/B testing to discover how much extra users would be willing to pay for improved sound quality.

ASU's Zane Gonzalez ready for the NFL Combine
Quarterbacks, wide receivers and tight ends receive measurements, medical exams, overflow testing and participate in interviews. The 32 teams of the NFL will be descending upon Indianapolis and Lucas Oil Stadium this week for the NFL Scouting Combine.

"As of 1:49 PM PST, we are fully recovered for operations for adding new objects in S3, which was our last operation showing a high error rate".

AWS said it has modified the ability for "too much capacity to be removed too quickly".

That, according to AWS, should prevent an incorrect input from triggering another outage. To be fair, AWS outages like this one are extremely rare.

The team also reprioritized work to partition one of the affected subsystems into smaller "cells", which was planned for later this year but will now begin right away.

"We want to apologize for the impact this event caused for our customers".

"While we are proud of our long track record of availability with Amazon S3, we know how critical this service is to our customers, their applications and end users, and their businesses", the company wrote in an online message.

Related news