Question 1

How does Xitoring's anomaly detection work?

Accepted Answer

Xitoring builds a per-host, per-metric baseline using machine learning over a learning window — typically 1–2 weeks. The baseline accounts for daily, weekly, and seasonal patterns, so a healthy nightly batch spike doesn't trigger an alert. When a metric drifts from its baseline in a way that's statistically significant, the AI raises a graded anomaly alert.

Question 2

Is this just a smart threshold?

Accepted Answer

No. A smart threshold still uses one number — it just calculates it for you. Xitoring's detection models the full distribution of each metric over time, captures periodicity, and correlates across signals. It catches slow drifts and pattern changes that any single threshold misses.

Question 3

What is root cause analysis?

Accepted Answer

When an incident is open, Xitoring's RCA engine pulls every metric anomaly, deploy event, configuration change, and similar past incident in the relevant time window, then ranks the most likely contributing causes with evidence. It's not a guess — it's a correlation report you can use to decide where to look first.

Question 4

Do I still need static thresholds?

Accepted Answer

For some metrics — yes. Hard SLA thresholds (e.g. p99 latency under 200 ms) are easier to reason about as fixed numbers. Anomaly Detection runs alongside them, catching the slow drifts that static alerts will never trip. The two are complementary, not exclusive.

Question 5

How long does the learning period take?

Accepted Answer

Most metrics produce a usable baseline within 24–48 hours and a high-confidence baseline within 1–2 weeks. The system improves continuously as it sees more data and learns your specific workload patterns.

Question 6

Will this make my alert volume go up?

Accepted Answer

Usually the opposite. Severity scoring suppresses low-impact deviations and known seasonal patterns, so on-call gets paged less often — but for issues that matter earlier. Teams typically see fewer wake-up calls and faster mean-time-to-detect after enabling it.

Question 7

Which metrics support anomaly detection?

Accepted Answer

All time-series metrics Xitoring collects: CPU, memory, disk, I/O, network, response time, request rate, and any custom metric you push. Detection works the same way regardless of the underlying source.

Question 8

Does this require extra setup or a new agent?

Accepted Answer

No. If you're already collecting metrics with Xitogent or via any of Xitoring's monitor types, anomaly detection is a panel toggle. No new agent, no new exporter, no new pipeline.

Question 9

How does pairing anomaly detection with root cause analysis shorten incident response?

Accepted Answer

A modern anomaly detection system pairs that detection layer with root cause analysis: when something is off, it correlates the anomalous signal against deploys, configuration changes, related metrics, and historical incidents to point at probable causes. The goal isn't to replace SRE judgment — it's to short-circuit the dashboard scavenger hunt that eats the first 30 minutes of every incident. Xitoring runs detection and RCA continuously across every host and metric in your account, with no per-metric tuning and no new agent.

Anomaly Detection &
Root Cause Analysis

Trusted by thousands — rated on

What is anomaly detection?

Key Features

Predictive AI Detection

Root Cause Management

Auto-Learned Baselines

Multi-Signal Correlation

Lower Alert Fatigue

Incident Forecasts

Find Issues Before They Become Incidents

Anomaly Detection Use Cases

Cloud Fleets

Database Operations

E-Commerce Reliability

SaaS Platforms

FinTech & Compliance

DevOps & SRE Teams

Why Anomaly Detection

Root Cause Analysis, Automated

How It Works

No Manual Tuning

Severity-Aware

Works With Your Channels

AIOps — Ask Your Infrastructure Anything

Frequently asked questions

Stop Reacting. Start Predicting.

More from Xitoring

Anomaly Detection & Root Cause Analysis