Alert System
Alerts Dashboard
The Alerts page has three tabs for managing monitoring rules, notifications, and alert history. Auto-refreshes every 30 seconds.
Tab 1: Alert History
- Stat Cards: Counts for Critical, Warning, Info, Active Total, and Resolved Today
- Filters: By status (All/Active/Acknowledged/Resolved) and severity (All/Critical/Warning/Info)
- Alert Cards: Severity badge, status badge, time since triggered, rule name, agent name, description, AI Analysis badge
| Alert Status | Available Actions |
|---|---|
| Active | Acknowledge, Resolve, View Details |
| Acknowledged | Resolve, View Details |
| Resolved | View Details |
View Details opens a modal with full alert info, triggering metrics, and AI root cause analysis with recommendations.
Tab 2: Alert Rules
Define conditions that trigger alerts. Click Create Rule to open the Rule Builder.
Rule Builder Fields
- Rule Name & Description
- Severity: Critical, Warning, or Info (radio buttons)
- Conditions: One or more conditions with AND/OR logic
- Notification Channels: Select channels to notify
- Enabled Toggle: On/off without deleting
Available Metrics (11 Total)
| Category | Metrics |
|---|---|
| Performance | CPU Usage %, Memory Usage %, Disk Usage %, Network Bytes Sent, Network Bytes Received, Active Connections |
| Agent Properties | Agent Name, Agent ID, Agent Type, Agent Tags, OS |
Condition Operators
| Numeric | String |
|---|---|
| > ≥ < ≤ = != | = != contains not_contains starts_with ends_with |
Each condition supports an optional Duration requirement (e.g., “CPU > 90% for 300 seconds”) to avoid alerting on brief spikes.
Saved rules display as cards with name, severity, evaluation interval, channel count, and edit/delete/toggle controls.
Tab 3: Notification Channels
Configure where alerts are delivered. Channels are reusable across multiple rules.
| Type | Configuration | Use Case |
|---|---|---|
| Recipient addresses, subject template | Individual notifications, reports | |
| Slack | Webhook URL | Team chat, DevOps channels |
| Microsoft Teams | Webhook URL | Enterprise collaboration |
| Webhook | Custom URL, HTTP method, headers | ITSM, automation, custom integrations |
- Test: Send a test notification to verify configuration
- Enable/Disable: Toggle without deleting
- Each channel card shows type icon, name, type label, and toggle switch
AI-Powered Alert Analysis
Every alert receives automatic Claude AI analysis including root cause, impact assessment, immediate actions, long-term recommendations, and confidence level. Analysis is cached for 24 hours — similar alerts reuse cached results.