Experiencing Alerting failure issue in Azure Portal for Many Data Types – 09/25 – Resolved

This post has been republished via RSS; it originally appeared at: New blog articles in Microsoft Tech Community.

Final Update: Wednesday, 25 September 2019 19:25 UTC

We've confirmed that all systems are back to normal with no customer impact as of 09/25, 19:15 UTC. Our logs show the incident started on 09/11,16:00 UTC and that during the 14 days,3 hours and 15 minutes that it took to resolve the issue some of the customers may have experienced Alerting failure for Exception based metric alerts in Application Insights.
  • Root Cause: The failure was due to deployment in one of the backend services.
  • Incident Timeline: 14 days,3 hours and 15 minutes - 09/11, 16:00 UTC through 09/25, 19:15 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.

-Jayadev

Update: Wednesday, 25 September 2019 17:49 UTC

The fix had been deployed to 16 of 18 regions. East US and West Europe are two regions to be deployed further. Customers in the remaining regions will not experience Alerting failure and the metrics will work as expected.
  • Next Update: Before 09/25 22:00 UTC
-Jayadev

Update: Wednesday, 25 September 2019 13:33 UTC

The fix had been deployed to 15  of 18 regions, East US, West Europe and South Central US are three region to be deployed further. Customers in the remaining regions will not experience Alerting failure and the metrics will work as expected.
  • Next Update: Before 09/25 19:00 UTC
-Monish

Update: Wednesday, 25 September 2019 10:35 UTC

Root cause has been isolated to Alerting Failure for Exception based metric which was impacting the alert been resolved and generating new alerts.  Engineers are involved in fixing the issue
  • Next Update: Before 09/25 15:00 UTC
-Monish

Initial Update: Wednesday, 25 September 2019 06:17 UTC

We are aware of issues within Application Insights and are actively investigating. Some customers may experience Alerting failure for Exception metrics
  • Next Update: Before 09/25 10:30 UTC
We are working hard to resolve this issue and apologize for any inconvenience.
-Monish



Leave a Reply

Your email address will not be published. Required fields are marked *

*

This site uses Akismet to reduce spam. Learn how your comment data is processed.