Alert System

Alerts Dashboard

The Alerts page has three tabs for managing monitoring rules, notifications, and alert history. Auto-refreshes every 30 seconds.

Tab 1: Alert History

  • Stat Cards: Counts for Critical, Warning, Info, Active Total, and Resolved Today
  • Filters: By status (All/Active/Acknowledged/Resolved) and severity (All/Critical/Warning/Info)
  • Alert Cards: Severity badge, status badge, time since triggered, rule name, agent name, description, AI Analysis badge
Alert StatusAvailable Actions
ActiveAcknowledge, Resolve, View Details
AcknowledgedResolve, View Details
ResolvedView Details

View Details opens a modal with full alert info, triggering metrics, and AI root cause analysis with recommendations.

Tab 2: Alert Rules

Define conditions that trigger alerts. Click Create Rule to open the Rule Builder.

Rule Builder Fields

  • Rule Name & Description
  • Severity: Critical, Warning, or Info (radio buttons)
  • Conditions: One or more conditions with AND/OR logic
  • Notification Channels: Select channels to notify
  • Enabled Toggle: On/off without deleting

Available Metrics (11 Total)

CategoryMetrics
PerformanceCPU Usage %, Memory Usage %, Disk Usage %, Network Bytes Sent, Network Bytes Received, Active Connections
Agent PropertiesAgent Name, Agent ID, Agent Type, Agent Tags, OS

Condition Operators

NumericString
> ≥ < ≤ = !== != contains not_contains starts_with ends_with

Each condition supports an optional Duration requirement (e.g., “CPU > 90% for 300 seconds”) to avoid alerting on brief spikes.

Saved rules display as cards with name, severity, evaluation interval, channel count, and edit/delete/toggle controls.

Tab 3: Notification Channels

Configure where alerts are delivered. Channels are reusable across multiple rules.

TypeConfigurationUse Case
EmailRecipient addresses, subject templateIndividual notifications, reports
SlackWebhook URLTeam chat, DevOps channels
Microsoft TeamsWebhook URLEnterprise collaboration
WebhookCustom URL, HTTP method, headersITSM, automation, custom integrations
  • Test: Send a test notification to verify configuration
  • Enable/Disable: Toggle without deleting
  • Each channel card shows type icon, name, type label, and toggle switch

AI-Powered Alert Analysis

Every alert receives automatic Claude AI analysis including root cause, impact assessment, immediate actions, long-term recommendations, and confidence level. Analysis is cached for 24 hours — similar alerts reuse cached results.