PagerDuty
Integrate cost alerts with PagerDuty's incident management system. Get instant escalation for critical cost events and ensure rapid response.
PagerDuty Integration Benefits
- Immediate escalation for critical cost events
- Intelligent alert routing to responsible teams
- SLA-driven incident response for cost issues
- Integration with existing on-call rotations
- Automated incident creation and resolution
Key Features
Critical Cost Alerts
Escalate major cost incidents through PagerDuty's incident management workflow
Intelligent Escalation
Smart alert routing based on cost severity and team responsibilities
SLA Protection
Ensure cost incidents are resolved within SLA timeframes
Cost Alert Types
Critical Budget Overrun
Immediate escalation when spending exceeds critical budget thresholds
Cost Anomaly Detection
Automated alerts for unusual spending patterns and cost spikes
Resource Waste Alert
Notifications for significant resource waste and optimization opportunities
Budget Forecast Warning
Proactive alerts when projected spending will exceed monthly budgets
Easy Setup in 4 Steps
Connect PagerDuty Account
Authenticate and connect your PagerDuty account to DeepCost
Configure Services
Map cost alert types to appropriate PagerDuty services and teams
Set Escalation Rules
Define escalation policies and severity levels for different cost events
Test & Monitor
Verify alert routing and monitor incident response times
PagerDuty Service Configuration
# DeepCost PagerDuty Integration Configuration
services:
cost-management:
name: "Cloud Cost Management"
description: "Critical cost incidents and budget overruns"
escalation_policy: "cost-team-escalation"
alert_grouping: "intelligent"
auto_resolve_timeout: 1800 # 30 minutes
resource-optimization:
name: "Resource Optimization"
description: "Resource waste and optimization opportunities"
escalation_policy: "engineering-escalation"
alert_grouping: "time"
auto_resolve_timeout: 3600 # 1 hour
alert_rules:
critical_budget_overrun:
condition: "monthly_spend > budget * 1.2"
severity: "critical"
service: "cost-management"
message: "Monthly spending exceeded budget by {{ percentage }}%"
cost_anomaly:
condition: "daily_increase > baseline * 2"
severity: "warning"
service: "cost-management"
message: "Anomalous cost spike detected: {{ service_name }}"
resource_waste:
condition: "unused_resources > $1000"
severity: "info"
service: "resource-optimization"
message: "Significant resource waste detected: {{ waste_amount }}"
escalation_policies:
cost-team-escalation:
teams:
- name: "FinOps Team"
delay: 0
- name: "Engineering Managers"
delay: 900 # 15 minutes
- name: "VP Engineering"
delay: 1800 # 30 minutes
integration_settings:
webhook_url: "https://events.pagerduty.com/v2/enqueue"
api_key: "{{ PAGERDUTY_API_KEY }}"
retry_attempts: 3
timeout: 30
notification_preferences:
suppress_low_priority: false
group_similar_alerts: true
auto_resolve_fixed_issues: true
include_cost_context: trueCost Incident Response Workflow
1. Detection & Alert
- • Automatic cost anomaly detection
- • Real-time threshold monitoring
- • Intelligent alert categorization
- • Context-rich incident creation
2. Escalation & Response
- • Automatic team notification
- • SLA-driven escalation
- • On-call engineer engagement
- • Incident severity assessment
3. Resolution & Learning
- • Automated remediation actions
- • Cost optimization implementation
- • Incident documentation
- • Post-incident analysis