Date of Incident: 2025-04-22
Time of Incident (EST): 8:51am - 9:31am
Service(s) Affected: SSO, Web Sign In, Sign Up, Web Interface, CLI
Impact Duration: 40 minutes
Summary
On April 22, 2025 1Password’s web interface and APIs were unavailable for all customers accessing the US region. This was not a result of a security incident and customer data was not affected. A database query consumed database resources to the point where other queries began to fail. As a result, requests to the web interface and to APIs in the region returned with an error.
Impact on Customers
During the duration of the incident:
- Web interface, Administration: Customers were unable to log in to the web interface for 1Password. Users already logged in could not use the web application. Administrators were unable to use the administration tools. Users were presented with the error “upstream connect error or disconnect/reset before headers. reset reason: overflow.”
- Single Sign-on (SSO), Multi-factor Authentication (MFA): Users on accounts with SSO or MFA enabled were unable to sign in and were presented with the error above, or “An unexpected error occurred.”
- Command Line Interface (CLI): CLI users received the above errors when accessing our web APIs.
- Browser Extension: Users who needed to authenticate via the web interface were unable to unlock their vaults.
Scope
- All users accessing services in the US/Global (non-Canada and non-EU) region were affected.
What Happened?
A problem with database infrastructure prevented services from reading or writing data. There were no recent changes to the infrastructure, but additional load from multiple operations contributed to the issue.
How Was It Resolved?
- Mitigation Steps: We halted non-critical processes to reduce load on the database.
- Resolution Steps: All database connections were routed to a healthy replica. This restored service.
- Verification of Resolution: Monitoring systems were closely observed for 1 hour to ensure error rates returned to normal.
What We Are Doing to Prevent Future Incidents
- Improving monitors: We are updating our monitoring systems to better detect database issues like this before they can impact customers.
- Improve database performance: We are tuning queries and refactoring services to improve performance and reduce load on the database.
- Review database configuration: We are reviewing database size and configuration to optimize performance and enhance monitoring capabilities.
Next Steps
- No action is needed from customers
We are committed to providing a reliable and stable service, and we are taking the necessary steps to learn from this event and prevent it from happening again. Thank you for your understanding.
Sincerely,
The 1Password Team