Intermittent HTTP 401 errors in us-east-1
Incident Report for Oso
Resolved
From 22:03 - 22:26 UTC, requests to tasks deployed on the corrupted container instance incorrectly failed with HTTP 401 status codes. The corrupted instance was terminated at 22:26 at which point service was restored. The container instance was corrupted due to a race condition triggered by how AWS ECS launches tasks on container instances. We have since made changes to safeguard against this race condition.
Posted Jan 15, 2025 - 22:00 UTC