Uptimus
uptimus
Sign inGet alerts
HomePricingFeaturesIncidentsReports

Status Provider

Home

/

Status

/

Service

ThisIsDownThisIsDown

uptimus

Public status intelligence for services, apps, games, and official status page providers.

Product

HomeFeaturesPricingStatus providers

Monitoring

WebsitesMobile appsMobile gamesSteam games

Community

IncidentsReportsSign inSign up

© 2026 Uptimus. All rights reserved.

Know what is down before your users do.

Opsgenie logo
Operational

Opsgenie Status

On-call and alert management for always-on services.

Source

auto

Category

Development

Adapter

STATUSPAGE IO

Verified

Pending review

Get alertsOfficial status page

Current state

Operational

Checked 42m ago

47

Components

0

Active incidents

0

Maintenance

100%

90d uptime

Atlassian Cloud Services impacted

Oct 20, 7:56 AM

Service health overview

Operational snapshot

Normalized official status-page data for incidents, maintenance, components, and history.

100%

Known uptime

1 known history days

47

Components tracked

0 outage, 0 degraded

50

Incidents indexed

0 active right now

34

Maintenance windows

0 active or scheduled

Top affected components

Components with the most recent status-page events.

Alert Flow

Operational

1

Alert Flow

Operational

1

Alert REST API

Operational

1

Alert REST API

Operational

1

Configuration REST APIs

Operational

1

90-day status history

Daily rollup

Component changes, incidents, and maintenance windows grouped by day.

operational

degraded

outage

maintenance

unknown

1

operational days

0

degraded days

0

outage days

0

maintenance days

89

unknown days

Recent incidents

Official incidents

Latest outages and degradations detected from the official status page.

Atlassian Cloud Services impacted
Postmortem

### Postmortem publish date: Nov 19th, 2025 ### Summary All dates and times below are in UTC unless stated otherwise. Customers utilizing Atlassian products experienced elevated error rates and degraded performance between Oct 20, 2025 06:48 and Oct 21, 2025 04:05. The service disruptions were triggered due to an [AWS DynamoDB outage](https://aws.amazon.com/message/101925/#:~:text=1%3A50%20PM.-,DynamoDB,-Between%2011%3A48) and further affected by subsequent failures in [AWS EC2](https://aws.amazon.com/message/101925/#:~:text=service%20disruption%20event.-,Amazon%20EC2,-Between%2011%3A48) and [AWS Network Load Balancer](https://aws.amazon.com/message/101925/#:~:text=service%20disruption%20event.-,Amazon%20EC2,-Between%2011%3A48) within the us-east-1 region. The incident started at Oct 20, 2025 06:48 and was detected within six minutes by our automated monitoring systems. Our teams worked to restore all core services by Oct 21, 2025 04:05. Final cleanup of backlogged processes and minor issues was completed on Oct 22, 2025. We recognize the critical role our products play in your daily operations, and we offer our sincere apologies for any impact this incident had on your teams. We are taking immediate steps to enhance the reliability and performance of our services, so that you continue to receive the standard of service you have come to trust. ### IMPACT Before examining product-level impacts, it's helpful to understand Atlassian's service topology and internal dependencies. Products such as Jira and Confluence are deployed across multiple AWS regions. The data for each tenant is stored and processed exclusively within its designated host region. This design is intentional and represents the desired operational state, as it limits the impact of any regional outage strictly to tenants in-region, in this case us-east-1. While in-scope application data is pinned to the region selected by the customer, there are times when systems need to call other internal services that may be based in a different region. If a problem occurs in the main region where these services operate, systems are designed to automatically fail over to a backup region, usually within three minutes. However, if unexpected issues arise during this failover, it can take longer to restore services. In rare cases, this could affect customers in more than one region. It’s important to note that all in-scope application data for supported products is pinned according to a customer’s chosen region. **Jira** Between Oct 20, 2025 06:48 and Oct 20, 2025 20:00, customers with tenants hosted in the us-east-1 region experienced increased error rates when accessing core entities such as Issues, Boards, and Backlogs. This disruption was caused by AWS's inability to allocate AWS EC2 instances and elevated errors in AWS Network Load Balancer \(NLB\). During this window, users may also have observed intermittent timeouts, slow page loads, and failures when performing operations like creating or updating issues, loading board views, and executing workflow transitions. Between Oct 20, 2025 08:36 and Oct 20, 2025 09:23, customers across all regions experienced elevated failure rates when attempting to load Jira pages. This disruption was caused by the regional frontend service entering an unhealthy state during this specific time interval. Normally, the frontend service connects to the primary AWS DynamoDB instance located in the us-east-1 to retrieve the most recent configuration data necessary for proper operation. Additionally, the service is designed with a fallback mechanism that references static configuration data in the event that the primary database becomes inaccessible. Unfortunately, a latent bug existed in the local fallback path. When the frontend service nodes restarted, they were unable to load critical operational configuration data from primary or fallback sources, leading to the observed failures experienced by customers. Between Oct 20, 2025 06:48 and Oct 21, 2025 06:30, customers experienced significant delays and missing Jira in-app notifications across all regions. The notification ingestion service, which is hosted exclusively in us-east-1, exhibited an increased failure rate when processing notification messages due to AWS EC2 and NLB issues. This issue resulted in notifications being delayed - and in some cases, not delivered at all - to users worldwide. **Jira Service Management \(JSM\)** JSM was impacted similarly to Jira above, with the same timeframes and for the same reasons. Between Oct 20, 2025 08:36 and Oct 20, 2025 09:23, customers across all regions experienced significantly elevated failure rates when attempting to load JSM pages. This affected all JSM experiences including the Help Centre, Portal, Queues, Work Items, Operations, and Alerts. **Confluence** Between Oct 20, 2025 06:48 and Oct 21, 2025 02:45, customers using Confluence in the us-east-1 region experienced elevated failure rates when performing common operations such as editing pages or adding comments. The primary cause of this service degradation was the system's inability to auto-scale due to AWS EC2 issues to manage peak traffic load effectively. Though the AWS outage ended at Oct 20, 21:09, a subset of customers continued to experience failures as some Confluence web server nodes across multiple clusters remained in an unhealthy state. This was ultimately mitigated by recycling the affected nodes. To protect our systems while AWS recovered, we made a deliberate decision to enable node termination protection. This action successfully preserved our server capacity but, as a trade-off, it extended the time required for a full recovery once AWS services were restored. **Automation** Between Oct 20, 2025 06:55 and Oct 20, 2025 23:59, automation customers whose rules are processed in us-east-1 experienced delays of up to 23 hours in rule execution. During this window, some events triggering rule executions were processed out of order because they arrived later during backlog processing. This caused potential inconsistencies in workflow executions, as rules were run in the order events were received, not when the action causing the event occurred. Additionally, some rule actions failed because they depend on first-party and third-party systems, which were also affected by the AWS outage. Customers can see most of these failures in their audit logs; however, a few updates were not logged due to the nature of the outage. By Oct 21, 2025 5:30, the backlog of rule runs in us-east-1 was cleared. Although most of these delayed rules were successfully handled, there were some additional replays of events to ensure completeness. Our investigation confirmed that a few events may never have triggered their associated rules due to the outage. Between Oct 20, 2025 06:55 and Oct 20, 2025 11:20, all non-us-east-1 regional automation services experienced delays of up to 4 hours in rule execution. This was caused by an upstream service that was unable to deliver events as expected. The delivery service encountered a failure due to a cross-region dependency call to a service hosted in the us-east-1 region. Because of this dependency issue, the delivery service was unable to successfully deliver events throughout this time frame, resulting in customer-defined rules not being executed in a timely manner. **Bitbucket and Pipelines** Between Oct 20, 2025 06:48 and Oct 20, 2025 09:33, Bitbucket experienced intermittent unavailability across core services. During this period, users faced increased error rates and latency when signing in, navigating repositories, and performing essential actions such as creating, updating, or approving pull requests. The primary cause was an AWS DynamoDB outage that impacted downstream services. Between Oct 20, 2025 06:48 and Oct 20, 2025 22:46, numerous Bitbucket Pipeline steps failed to start, stalled mid-execution, or experienced significant queueing delays. Impact varied, with partial recoveries followed by degradation as downstream components re-synchronized. The primary cause was an AWS DynamoDB outage, compounded by instability in AWS EC2 instance availability and AWS Network Load Balancers. Furthermore, Bitbucket Pipelines continued to experience a low but persistent rate of step timeouts and scheduling errors due to AWS bare-metal capacity shortages in select availability zones. Atlassian coordinated with AWS to provision additional bare-metal hosts and addressed a significant backlog of pending pods, successfully restoring services by 01:30 on Oct 21, 2025. **Trello** Between Oct 20, 2025 06:48 and Oct 20, 2025 15:25, users of Trello experienced widespread service degradation and intermittent failures due to upstream AWS issues affecting multiple components, including AWS DynamoDB and subsequent AWS EC2 capacity constraints. During this period, customers reported elevated error rates when loading boards, opening cards, adding comments or attachments. **Login** Between Oct 20, 2025 06:48 and Oct 20, 2025 09:30, a small subset of users experienced failures when attempting to initiate new login sessions using SAML tokens. This resulted in an inability for those users to access Atlassian products during that time period. However, users who already had valid active sessions were not affected by this issue and continued to have uninterrupted access. The issue impacted all regions globally because regional identity services relied on a write replica located in the us-east-1 region to synchronize profile data. When the primary region became unavailable, the failover to a secondary database in another region failed, which delayed recovery. This failover defect has since been addressed. **Statuspage** Between Oct 20, 2025 06:48 and Oct 20, 2025 09:30, Statuspage customers who were not already logged in to the management portal were unable to log in to create or update incident statuses. This impact was restricted only to users who were not already logged in at the time. The root cause was the same as described in the Login section above, and it was resolved by the same remediation steps. ### REMEDIAL ACTION PLAN & NEXT STEPS We have completed the following critical actions designed to help prevent cross-region impact from similar issues: * Resolved the code defect in the fallback option to ensure that Jira Frontend Services in other regions remain unaffected during a region-wide outage. * Fixed the issue that prevented timely failover of the identity service which impacted new login sessions. * Resolved the code defect so that delivery services in unaffected regions remain operational during region-wide outages. Additionally, we are prioritizing the following improvement actions: * Implement mitigation strategies to strengthen resilience against region-wide outages in the notification ingestion service. Although disruptions to our cloud services are sometimes unavoidable during outages of the underlying cloud provider, we continuously evaluate and improve test coverage to strengthen resilience of our cloud services against these issues. We recognize the critical importance of our products to your daily operations and overall productivity, and we extend our sincere apologies for any disruptions this incident may have caused your teams. If you were impacted and require additional details for internal post-incident reviews, please reach out to your Atlassian support representative with affected timeframes and tenant identifiers so we can correlate logs and provide guidance. Thanks, Atlassian Customer Support

Oct 20, 7:56 AMResolved Oct 21, 5:24 AMMajorOutgoing Integration FlowReporting & Analytics
Delays in Opsgenie, JSM Ops and Compass Ops notifications
Resolved

Between 13:10 UTC to 14:00 UTC, we experienced notification delay for Jira Service Management, Opsgenie, and Compass in only EU-region. The issue has been resolved and the service is operating normally.

Sep 30, 2:07 PMResolved Sep 30, 2:32 PMMinorSMS Notification DeliveryVoice Notification Delivery
Delays observed at JSM, Opsgenie and Compass alert search functionality in US Region
Postmortem

### **SUMMARY** Between September 10, 2025, at 13:56 UTC and September 11, 2025, at 14:40 UTC, Jira Service Management, Compass Cloud Operations, and Opsgenie customers in the U.S. region experienced degraded performance across web and mobile applications, as well as Alert REST APIs. Certain customers experienced difficulties with alert searches and page loading, particularly on alert-related pages. This incident was initiated by an EBS volume upgrade conducted within the cloud-based managed ElasticSearch clusters. Our automated monitoring systems identified the incident within minutes, and it was resolved after 24 hours and 43 minutes by manually implementing vertical and horizontal scaling measures. ### **IMPACT** Between September 10, 2025, 13:56 UTC and September 11, 2025, 14:40 UTC, some customers in the U.S. region experienced degraded performance in Jira Service Management, Opsgenie, and Compass Cloud. During this time, web and mobile applications, as well as the Alert REST APIs, were impacted. Some customers may have seen issues with alert searches and slow loading of alert pages. During the incident, 22% of customers in the US region experienced failures in web API functionality, 8.6% encountered failures with the REST Alert API, and 1.3% experienced delays in notification delivery. Importantly, there was no loss of data during this event. ### **ROOT CAUSE** The degradation was caused by an EBS volume upgrade in the cloud-based managed ElasticSearch clusters, which necessitated a Blue/Green deployment strategy. One of the ElasticSearch nodes approached its shard size threshold, prompting the upgrade and subsequent deployment. This deployment resulted in elevated latency, increased 4xx HTTP response codes, and timeouts affecting both search and indexing operations. Recovery time exceeded expectations due to the prolonged blue/green deployment. After completion, the ElasticSearch cluster remained unhealthy and did not return to its normal state. Additionally, the switchover to the backup region failed because it was configured similarly in size and setup to the primary cluster. ### **REMEDIAL ACTIONS PLAN & NEXT STEPS** We understand that outages can affect your productivity. We are prioritizing the following improvement actions to help prevent similar incidents in the future: * Increase ElasticSearch cluster capacity and optimize settings to handle peak search loads efficiently. * Strengthen our high availability \(HA\) and failover architecture to ensure rapid and reliable recovery in the event of primary region failures. * Optimize cluster upgrade and shard rebalancing strategies to prevent similar issues. We apologize to customers whose services were impacted during this incident. We are taking steps to improve the platform’s performance and availability.   Thanks, Atlassian Customer Support

Sep 10, 3:23 PMResolved Sep 11, 6:45 PMMajorOutgoing Integration FlowReporting & Analytics
Degraded performance in Opsgenie and JSM operations
Resolved

This incident has been resolved.

Jun 19, 3:21 PMResolved Jun 19, 4:37 PMNone
Degraded performance in Opsgenie and JSM operations
Resolved

This incident has been resolved, and Jira Service Management and Opsgenie are back in operational mode.

Jun 17, 12:37 PMResolved Jun 17, 12:47 PMNone
Delays observed at JSM and Opsgenie alert search functionality
Resolved

All search functionality is operational without any latency. Thank you for your patience.

May 10, 7:08 AMResolved May 10, 10:51 AMMinorWeb Application
Schedule API are getting timed out
Resolved

This incident has been resolved.

Apr 4, 10:05 AMResolved Apr 4, 10:52 AMNone
EU OpsGenie API calls having intermittent routing issues
Resolved

Issues where some API calls configured to use api.opsgenie.com/eu were intermittently returning a 404 error with the message 'no Route matched with those values' should now be resolved. API calls to api.eu.opsgenie.com and api.opsgenie.com (without /eu) were not affected at this time.

Mar 5, 6:34 AMResolved Mar 5, 7:24 AMMajorAlert Flow
Services menu in Opsgenie is not responding
Resolved

Our engineers have been closely monitoring the platform and are declaring this incident resolved. Thank you for your patience.

Jan 14, 3:48 PMResolved Jan 14, 4:47 PMNone
Elevated 5XX errors in Schedule API at Opsgenie USA region
Resolved

Our team has identified the issue in Schedule API between 15:30 UTC and 17:00 UTC. We saw performance degradation and 5XX errors in response. Faulty deployment has been reverted quickly in the USA region and rapid recovery is seen. We are monitoring the system for a full recovery right now. The Schedule API is up and running again without any data loss.

Dec 16, 3:30 PMResolved Dec 16, 3:30 PMMinor

Recent maintenance

Maintenance windows

Scheduled and completed maintenance windows are separated from incidents.

Opsgenie Reporting & Analytics is under maintenance in US
Completed

The scheduled maintenance has been completed.

Apr 20, 3:30 AMResolved Apr 20, 11:30 AMMaintenanceReporting & Analytics
Opsgenie Reporting & Analytics is under maintenance in US
Completed

Changes completed and verified

Oct 9, 8:30 AMResolved Oct 9, 10:30 AMMaintenanceReporting & Analytics
Opsgenie Reporting & Analytics is under maintenance in US
Completed

The scheduled maintenance has been completed.

Mar 19, 9:30 AMResolved Mar 19, 11:00 AMMaintenanceReporting & Analytics
Opsgenie Reporting & Analytics is under maintenance in US
Completed

Scheduled maintenance has been completed for Opsgenie Analytics. It now has an improved visual and filtering experience. Thank you for your patience.

Aug 16, 5:00 AMResolved Aug 16, 6:54 AMMaintenanceReporting & Analytics
Opsgenie Reporting & Analytics is under maintenance in US
Completed

Scheduled maintenance has been completed for Opsgenie Analytics. Thank you for your patience.

Jul 6, 9:30 AMResolved Jul 6, 2:11 PMMaintenanceReporting & Analytics
Opsgenie Analytics is under maintenance in the EU
Completed

Scheduled maintenance has been completed for Opsgenie Analytics. It now has an improved visual and filtering experience. Thank you for your patience.

Jun 30, 4:00 AMResolved Jun 30, 5:20 AMMaintenanceReporting & Analytics
Opsgenie Reporting & Analytics is under maintenance in US
Completed

The scheduled maintenance for US region Reporting and Analytics is complete

Dec 21, 6:30 AMResolved Dec 21, 7:13 AMMaintenanceReporting & Analytics
Opsgenie Reporting & Analytics is under maintenance in EU
Completed

The scheduled maintenance has been completed.

Dec 21, 5:30 AMResolved Dec 21, 6:30 AMMaintenanceReporting & Analytics
Opsgenie Reporting & Analytics is under maintenance in EU & US
Completed

The scheduled maintenance has been completed.

Dec 13, 9:00 AMResolved Dec 13, 11:00 AMMaintenanceReporting & AnalyticsReporting & Analytics
Opsgenie Reporting & Analytics is under maintenance in EU
Completed

The scheduled maintenance has been completed.

Nov 9, 5:30 AMResolved Nov 9, 6:30 AMMaintenanceReporting & Analytics

About the Opsgenie status page integration

Uptimus tracks the official Opsgenie status page, normalizes upstream events, and separates incidents from scheduled maintenance.

Official source

https://opsgenie.status.atlassian.com

Adapter

STATUSPAGE IO

Alert streams

Incidents, component changes, and maintenance windows.

Public SEO page

Indexable status history for users searching outage information.

Outage map preview

Regional reports can be layered on top of official provider status when user signals are available.

Opsgenie components

Tracked components

Showing 1 to 25 of 47 tracked components.

ComponentStatusTypeLast changed

US

Operational

Group

Not recorded

EU

Operational

Group

Not recorded

Incident Flow

Operational

Component

Not recorded

Incident Flow

Operational

Component

Not recorded

Alert Flow

Operational

Component

Not recorded

Alert Flow

Operational

Component

Not recorded

Email Notification Delivery

Operational

Component

Not recorded

Email Notification Delivery

Operational

Component

Not recorded

Public Website

Public Website (https://www.opsgenie.com/)

Operational

Component

Not recorded

SMS Notification Delivery

Operational

Component

Not recorded

SMS Notification Delivery

Operational

Component

Not recorded

Voice Notification Delivery

Operational

Component

Not recorded

Voice Notification Delivery

Operational

Component

Not recorded

Mobile Notification Delivery

Operational

Component

Not recorded

Mobile Notification Delivery

Operational

Component

Not recorded

Heartbeat Monitoring

Operational

Component

Not recorded

Heartbeat Monitoring

Operational

Component

Not recorded

Incident REST API

Operational

Component

Not recorded

Incident REST API

Operational

Component

Not recorded

Alert REST API

Operational

Component

Not recorded

Alert REST API

Operational

Component

Not recorded

Heartbeat REST API

Operational

Component

Not recorded

Heartbeat REST API

Operational

Component

Not recorded

Incoming Email Service

Operational

Component

Not recorded

Incoming Email Service

Operational

Component

Not recorded

Get notified when Opsgenie changes status

Follow outages, degraded components, and maintenance updates in your Uptimus workspace with email, push, and webhook alerts.

Start monitoringView plans

Show Opsgenie on your own status page

Official provider components

Incident and maintenance separation

Workspace alerts and webhooks

Users also follow these services

Related status pages based on category, adapter type, and operational history.

Atlassian Status

Adapter Tests

GitHub

Development

GitHub Configurable Adapter

Adapter Tests

Vercel

Cloud

Cal.com Status

Adapter Tests

Knowbe4

Security

Gotoconnect

Communication

Egnyte

AI

Pexip

Communication

Smtp2go

Communication

Practice Fusion

Healthcare

Voucherify

Analytics

Frequently asked questions

Is Opsgenie down right now?

Opsgenie is currently marked as Operational in Uptimus based on the latest official status page check.

How often does Uptimus check Opsgenie?

Supported status page providers are checked continuously by our scraper scheduler. The public page is cached briefly for SEO and performance.

Are maintenance windows counted as incidents?

No. Uptimus stores incidents and maintenance windows separately when the upstream provider exposes enough detail.

Can I get alerts for Opsgenie?

Yes. Create an Uptimus workspace, follow this provider, and choose email, push, or webhook notifications.