Connectors, Tools, and Skills
Overview
SRE Agent retrieves log data, metrics, and runbooks from observability platforms and knowledge bases to guide responders through investigation, triage, and resolution. By connecting your observability stack and institutional knowledge, SRE Agent moves beyond simple alert analysis.
- Connectors & Tools give SRE Agent real-time access to your logs and metrics, so every triage analysis is grounded in the actual state of your systems - not just the alert payload.
- Skills give SRE Agent institutional knowledge specific to your environment - your runbooks, your escalation paths, your standards - so it can recommend precise next steps instead of generic ones.
Together, these capabilities help your team triage faster with less human involvement, reducing MTTA and MTTR while building confidence in AI-assisted operations.
Go to AI Settings SRE Agent Configuration to get started.
SaaS IntegrationsAll of these integrations are SaaS-based.
Availability
Connectors & Tools (Available in Early Access)Retrieve real-time logs and metrics by connecting SRE Agent to third-party observability platforms via API or MCP.
Skills (Coming soon to Early Access - May 30, 2026)Arm your agent with specialized domain expertise using custom instructions, scripts, and runbooks.
Requires the PagerDuty Claude plugin.
Prerequisites
Before configuring connectors, ensure the following:
- Your account is enrolled in the SRE Agent Early Access program.
- You have Admin or Account Owner permissions in PagerDuty.
- You have credentials or API keys for the third-party integrations you plan to connect.
- The SRE Agent is enabled and configured to run via an Incident Workflow.
Connectors & Tools
Connectors link SRE Agent to your external data sources. Once a connector is active, the agent can use its associated Tools - such as querying logs or pulling metrics - will happen automatically or via user request. Currently, Knowledge Base Integrations are invoked automatically when URL is in the event payload, and Observability tools are user invoked unless invoked via an Incident Workflow.
Supported Connectors
Navigate to AI Settings SRE Agent Configuration Connectors to view and manage your connectors.
| Connector | Tools | Connection Type | Availability* |
|---|---|---|---|
| GitHub | Code, Knowledge Base | API, MCP | Knowledge Base (GA) Code (EA) |
| ServiceNow | Knowledge Base | API | GA |
| Datadog | Logs, Metrics, Traces | API, MCP | GA |
| Grafana | Logs, Metrics | API | GA |
| New Relic | Logs | API | GA |
| Sumo Logic | Logs | API | GA |
| Splunk | Logs, Metrics | API | GA |
| AWS (CloudWatch) | Logs | API | GA |
| Elasticsearch | Logs | API | GA |
| Dynatrace | Logs | API | GA |
| Honeycomb.io | Logs, Metrics | MCP | EA |
| Observe Inc. | Logs | MCP | EA |
*(GA - Generally Available, EA - Early Access)
Set up a Connector
- In the PagerDuty web app, go to AI Settings SRE Agent Configuration and click the Connectors tab.
- Find the connector to add and click Connect. If the connector is already listed but shows Unhealthy, click the menu to edit or reconnect it.
- On the **Connect to {{Your integration}} Tools** screen, select an existing connection from the dropdown or click Manage connections to create a new one.
- Under Agent may use these tools, check the tools you want to enable:
- Logs (API) - allows SRE Agent to query log data from this source during incident triage.
- Metrics (MCP) - allows SRE Agent to pull metric data (available on connectors that support it).
- Click Save. The connector will appear in your list with a status of
Activeonce the connection is established.
Connector Statuses
- Active: The connector is healthy and SRE Agent can use its tools.
- Unhealthy: The connection exists but is failing. Check your credentials or API key and reconnect.
- Not connected: No connection has been set up. Click Connect to configure it.
Skills
Skills allow you to arm SRE Agent with custom instructions, runbooks, and scripts that give it specialized domain expertise. Rather than relying solely on the data it can retrieve, a Skill tells the agent exactly how to handle specific situations, so it can intelligently deduce the next steps for troubleshooting issues even before a human looks at the incident.
What a Skill Contains
Each Skill is a structured set of instructions with the following components:
| Component | Details |
|---|---|
| When to use | Conditions that tell the agent when to invoke this Skill automatically |
| Prerequisites | What integrations or data the agent needs to execute the Skill |
| Execution steps | Step-by-step instructions the agent follows during triage |
| Error handling | How the agent should gracefully degrade if a step fails |
| Success criteria | How the agent determines whether the Skill ran successfully |
| Examples | Trigger phrases or alert conditions that invoke this Skill |
Example Skill: detect-security-incident-logs
The following example shows a Skill that automatically searches Datadog and AWS CloudWatch logs for critical security events during incident triage.
Skill: detect-security-incident-logs
Searches Datadog and CloudWatch logs for critical security incidents. When SECURITY_INCIDENT: CRITICAL log entries are found, it recommends triggering the 'Alert Security Team' incident workflow.
When to invoke:
- Automatically during initial triage when the alert custom details include type: possible_security_issue
- When a responder asks the agent to check for security incidents
What it does:
- Searches Datadog and CloudWatch logs for the exact phrase
SECURITY_INCIDENT: CRITICAL - If matches are found: surfaces key log details and recommends the Alert Security Team Incident Workflow
- Degrades gracefully if one integration is unavailable - continues with the other
Create a Skill
Skills are configured through the PagerDuty Claude plugin. Each agent type (SRE Agent, Insights Agent, Shift Agent) supports up to five Skills.
- Install the PagerDuty Claude plugin.
- Run the
/pagerduty:create-pagerduty-skillcommand and follow the guided prompts to define the Skill's name, instructions, trigger conditions, and metadata. - Once created, the Skill is immediately available to the SRE Agent for all future incidents.
- Test by triggering an incident matching your Skill's conditions and review what the agent reports in the SRE Agent tab on the incident details page or in ChatOps (Slack, Microsoft Teams)
Updated 4 days ago
