The Related Incidents feature provides incident responders with the suggested 20 most recent Related Incidents that are impacting other responders and PagerDuty services. This feature uses a completely online and real-time machine learning algorithm as well as your service dependency data to provide these insights, giving responders an at-a-glance-view of the full breadth and scope of incident impact. Related Incidents extends our machine learning capabilities beyond noise reduction, enriching incidents with deep contextual insights to help responders coordinate an effective team response and mitigate business disruption.
This feature is currently available as part of our Digital Operations plan or can be purchased via Event Intelligence, an add-on to our Platform Team or Platform Business plans. If you would like to sign up for a trial of this and other Event Intelligence features, please contact our Sales team.
To view Related Incidents, click your desired incident’s Title to see the incident details page, and then select the Related Incidents tab. The Related Incidents tab uses Event Intelligence to display a list of incidents on your PagerDuty account that are potentially related to the incident you are currently investigating.
The Related Incidents tab includes the following information:
- Related Incidents -
#: A total count of related incidents.
- Impact Summary: A total list of impacted Responders, impacted Business Services and impacted Technical Services.
- List of Related Incidents on a Timeline: Related Incidents on the timeline include the following:
- Incident Details: You may click on the Incident Title, # of Alerts, Assigned To User or Service to view more details.
- Dependency Pane: If incidents are related due to a Technical or Business Service dependency relationship, that dependency will be shown here. If there is no service dependency relationship for that incident, you can click Add One to define your service dependencies. Related Incidents with no dependencies are those that were sorted via Machine Learning. Read our section on types of Related Incidents to learn more.
- Information Icon: View why the incident was suggested as related by clicking on the icon in the top right corner of the Related Incidents card.
- Thumbs Up/Thumbs Down: Indicate whether you believe an incident is or isn’t related to the one you are investigating, and give our machine learning model feedback. Feedback is real-time, completely processed online, and it is applied to suggest future Related Incidents. You can click the numbers below the thumbs up/down to view who voted for each option. Note: the thumbs up/thumbs down component is only visible on incidents generated by the machine learning algorithm, and it takes about 5–10 human inputs to train our algorithm on a new behavior.
There are two types of Related Incidents:
Machine Learning Based Related Incidents are determined by three main factors:
- How close in time the incidents were created.
- How related the alert metadata is.
- Human response behavior: Thumbs up/thumbs down and/or merging incidents across services.
You can identify Machine Learning Based Related Incidents by clicking on the icon or by the presence of the thumbs up/thumbs down feedback mechanism.
Some Related Incidents may be determined by your Technical or Business Service dependencies. When two incidents are triggered within five minutes of each other and one service is directly dependent on the other, these incidents will be marked as related. Similarly, if incidents occur within this time frame and share a parent Business Service, they will also be marked as related. These service relationships will be identified in the Dependency pane on individual Related Incident cards.
To view Related Incidents in the mobile app, tap your desired incident title to enter the incident details screen. If there are Related Incidents, at the bottom of the incidents details screen you will see a blue banner which prompts you to view these Related Incidents.
Past Incidents (previously named Similar Incidents) is a tab that displays resolved incidents from the past that were on the same service that may be similar to the current triggered incident. Related Incidents uses Event Intelligence to determine what other active incidents going on in real-time across all services on your account are related, to help you better understand the breadth and scope of impact.
Here are some common reasons:
- The incident seems unique: The algorithm was unable to find another recent incident with metadata related to this one. If it identifies any while this incident is still going on, they will appear in the Related Incidents tab.
- There isn’t enough data yet: Your account doesn’t have enough data yet to start finding relationships among incidents. Continue to use PagerDuty normally and related incidents will begin to appear.
The incident title is often determined by the title of the first alert created for a given incident. It is likely that our machine learning algorithm deemed that alerts created close in time and with the same title are related.
Responders find it is useful to see recently resolved related incidents to gain additional context during major incident response. For example: who resolved the incident and what actions were taken to resolve it.
This is by design. If you notice a high volume of related or duplicate incidents on a particular service, we recommend enabling Intelligent Alert Grouping to reduce noise on that service and prevent duplicate incidents from being created.
This feature is available completely out of the box, and will begin working right away with no setup needed.
Please keep in mind that you must have access to the Event Intelligence add-on product, or be on the Digital Operations plan to access this feature.
Our machine learning algorithm can be influenced by user feedback. The machine learning algorithm looks at human response behavior including thumbs up/down feedback as well as cross-service merges in order to improve future Related Incidents.
Updated 7 months ago