Which ITOps & DevOps Monitoring Tools are Real(ly) Real-Time Solutions?
Sahil Khanna | May 18, 2017

A lot of monitoring vendors claim to have real-time capabilities, but that’s not often the case, thanks to refresh latency.

A lot of monitoring vendors claim to have real-time capabilities, but that’s not often the case, thanks to refresh latency.

Whether you’re a day-trader, a DevOps engineer or Lindsey Lohan in Mean Girls, you need access to critical information as soon as possible. In other words, you need real-time insight.

The term “real time,” however, is somewhat fuzzy. It’s thrown around quite a bit, and can mean different things depending on the context.

For example, according to Paul Buchheit, the creator of Gmail, every interaction should be faster than 100ms. Why? Because 100ms is the threshold where interactions feel instantaneous.

For ITOps and DevOps professionals that work against KPIs, like Mean-Time-To-Detect and Mean-Time-To-Acknowledge incidents with strict SLAs, the last thing you need is your monitoring technology acting as the bottleneck. In this context, “real time” involves a tool’s frequency of retrievals or updates to present new information.

Tweet Section

Tiered architectures require you to perform jobs to retrieve the data — leading to significant latency that increases as the volumes get larger.

What Does Real-Time Monitoring Really Mean for ITOps & DevOps? 

Since alert volumes are exponentially larger than personal email volumes, today’s universal standard for real-time is 1 second.

This means that the difference between the presentation of a data point (metric, Event, Alert, etc.) into the monitoring systems, and the creation of that data point, should be 1 second or less.

What is Near Real Time?

Based on our real-time definition, technologies that aggregate or refresh less frequently than every minute are not real-time. Rather, we refer to them as near real-time if they are still in the 1-3 minute range.

Why do certain tools have this latency? It’s typically a result of a tiered architecture, created to help ingest large volumes of events. The problem is that tiered architectures require you to perform jobs to retrieve the data — leading to significant latency that increases as the volumes get larger.

Since these jobs or refreshes tend to get very computationally expensive, the intervals of data retrieval increase to near real-time frequencies.

Which Tools Are Real Time? 

It’s clear that every IT monitoring vendor claims real-time, but which ones really past the test? The bottom table include common monitoring technologies, their minimum refresh latencies, and default latencies.

TechnologyTechMin. LatencyDefault LatencyReal-Time
Web
Pingdom1 minute1 minuteNo
APM
AppDynamics1 minute1 minuteNo
NewRelic1 minute1 minuteNo
Dynatrace1 minute1 minuteNo
Instana1 second1 secondYes
Time Series
DataDog15 seconds15 secondsNo
Wavefront1 second1 secondYes
SignalFx1 second1 secondYes
Network
ExtraHop30 seconds5 minutesNo
SolarWinds1 minute5 minutesNo
ThousandEyes1 minute1 minuteNo
Infrastructure
Nagios1.5 minutes5 minutesNo
Zenoss5 seconds5 minutesNo
ScienceLogic1 minute5 minutesNo
Logs
Splunk5 seconds5 minutesNo
Elastic1 second1 secondYes

Clock photo © Christian Schnettelker

Moogsoft is a pioneer and leading provider of AIOps solutions that help IT teams work faster and smarter. With patented AI analyzing billions of events daily across the world’s most complex IT environments, the Moogsoft AIOps Platform helps the world’s top enterprises avoid outages, automate service assurance, and accelerate digital transformation initiatives.
See Related Posts by Topic:

About the author

Sahil Khanna

Sahil Khanna is a Sr. Product Marketing Manager at Moogsoft, where he focuses on the emergence of Algorithmic IT Operations. In his free time, Sahil enjoys banging on drums and participating in high-stakes bets.

All Posts by Sahil Khanna

Moogsoft Resources

August 4, 2020

Telemetry Everywhere: Observability in the DevOps Cosmos

July 22, 2020

What’s Observability with AIOps? Check Out Our New Book, Webinars and Infographic

July 21, 2020

Why Observability Matters to Site Reliability Engineers

June 29, 2020

Moogsoft Express Helps DevOps and SRE Teams Develop More and Operate Less

Loading...