EAB Navigate Slowness

Incident Report for Wentworth Institute of Technology

Postmortem

[RESOLVED] Increased Error Rates and Latencies

Oct 20 3:53 PM PDT Between 11:49 PM PDT on October 19 and 2:24 AM PDT on October 20, we experienced increased error rates and latencies for AWS Services in the US-EAST-1 Region. Additionally, services or features that rely on US-EAST-1 endpoints such as IAM and DynamoDB Global Tables also experienced issues during this time. At 12:26 AM on October 20, we identified the trigger of the event as DNS resolution issues for the regional DynamoDB service endpoints. After resolving the DynamoDB DNS issue at 2:24 AM, services began recovering but we had a subsequent impairment in the internal subsystem of EC2 that is responsible for launching EC2 instances due to its dependency on DynamoDB. As we continued to work through EC2 instance launch impairments, Network Load Balancer health checks also became impaired, resulting in network connectivity issues in multiple services such as Lambda, DynamoDB, and CloudWatch. We recovered the Network Load Balancer health checks at 9:38 AM. As part of the recovery effort, we temporarily throttled some operations such as EC2 instance launches, processing of SQS queues via Lambda Event Source Mappings, and asynchronous Lambda invocations. Over time we reduced throttling of operations and worked in parallel to resolve network connectivity issues until the services fully recovered. By 3:01 PM, all AWS services returned to normal operations. Some services such as AWS Config, Redshift, and Connect continue to have a backlog of messages that they will finish processing over the next few hours. We will share a detailed AWS post-event summary.

Posted Oct 21, 2025 - 10:34 EDT

Resolved

This incident has been resolved.

Posted Oct 21, 2025 - 10:33 EDT

Identified

EAB reports degraded performance of Navigate due to the service disruption with AWS and are continuing to monitor the widespread AWS outage. Currently AWS is not launching additional webserver capacity required to fully service Navigate requests. You may follow the AWS issue directly here: https://health.aws.amazon.com/health/status

Posted Oct 20, 2025 - 15:00 EDT

This incident affected: Other Services.