Phil Tee: Observability Requires the Marriage of AI, Metrics and Logs
Juan Perez | December 17, 2019
Observability without AIOps is just noise.

Phil Tee Presentation at AWS re:Invent 2019

Moogsoft CEO Phil Tee delivered the proverbial good news / bad news combo to those who attended his presentation at AWS re:Invent 2019 in Las Vegas.

First the bad news: statistical analysis of IT metrics data is “stuck in the stone age,” hampering IT Ops and DevOps teams’ ability to detect and fix problems that impact digital services.

And the good news? Applying AI and ML algorithms is equally as relevant to time series metrics as it is to other areas of AIOps.

“Our belief is that the combination of AI, metrics and logs is where we need to be thinking,” Phil told the audience.

The complexity problem

As IT environments get increasingly hybrid, ephemeral and distributed to support an organization’s digital transformation efforts, they become correspondingly more complex and harder to monitor.

“The ‘software-defined everything’ world is incredibly changeable and almost impossible to pin down in terms of its configuration,” Phil said. “There is practically no ‘steady state.’”

While IT operators struggle to maintain the quality, stability and performance of services, customers get more demanding every day, expecting zero downtime and latency.

“There’s an avalanche of complexity out there which we’ve got to collectively tame,” he said.

He cited results from a recent survey in which a majority of CIOs reported troubling trends that point to “systematic failures” in service assurance, including that:

  • Customers notice service incidents before their IT support team does
  • Existing monitoring solutions detect less than half of performance issues and outages
  • Growing IT complexity leads to more outages

What to do?

Phil, a serial innovator who has devoted 25-plus years to improving IT operational management, co-founded Moogsoft in 2011 to apply AI and ML to this problem, after noticing legacy rules-based products couldn’t handle the scale, speed and dynamism of modern environments.

At first, most questioned this approach, but Phil has been proven right. Moogsoft is a leader in the hot, new market for AIOps (Artificial Intelligence for IT Operations), which is now acknowledged as truly transformative for IT Ops and DevOps teams.

 

Moogsoft AI Portfolio

 

More than 130 companies, including 20 of the Fortune 1000, use the Moogsoft AIOps platform to streamline IT operations, quickly detect and fix incidents, prevent outages, meet SLAs, and boost their digital transformation.

 

Moogsoft Innovation

 

Most recently, Moogsoft has tackled the monitoring of metrics data, a key feature in their AIOps platform and available in the Express entry level package, now in beta. Compared to logs, a key difference when monitoring  metrics is the volume of data. An at-scale environment may generate thousands of log messages per second, while producing hundreds of thousands of metrics per second, Phil explained.

Observability without AIOps is just noise

But all that raw metrics data by itself contains little information because it lacks context. Knowing that the CPU usage on a server is at 94% is of little value if you don’t know if it indicates normal functioning or a potential problem.

“You need to know a lot more, like: What was it like yesterday? What was it when the system was doing something different? Do I have only one of those servers or is it part of a 1000-server farm?,” Phil said.

This is where AIOps comes in. Moogsoft ingests monitoring data, and applies AI and ML algorithms to it, eliminating noise, detecting anomalies, correlating relevant metrics, alerts and log events, surfacing them as contextualized incidents and identifying problem root causes.

 

Moogsoft Integrations

 

The bottom line is giving IT operators and DevOps teams more visibility and a better handle on the true state of their digital infrastructure — true observability — so that they can maintain service quality by fixing the right incidents all the time on a timely fashion, Phil said.

 

Phil Tee in action at AWS re:Invent

 

“AIOps is the path to continuous assurance, the path to making sure there’s 100% service availability,” Phil said.

Watch the full video of Phil’s AWS re:Invent 2019 presentation for a lot more details about these and other topics related to AIOps trends and to Moogsoft’s technology and products.

Moogsoft is a pioneer and leading provider of AIOps solutions that help IT teams work faster and smarter. With patented AI analyzing billions of events daily across the world’s most complex IT environments, the Moogsoft AIOps platform helps the world’s top enterprises avoid outages, automate service assurance, and accelerate digital transformation initiatives.
mm

About the author Juan Perez

Longtime tech journalist turned digital marketer, Juan is now Moogsoft's lead content machine.

All Posts by Juan Perez

See Related Posts by Topic:     

Moogsoft Resources

January 17, 2020

Why AIOps? Because End Users Are Your Incident Detection System

December 30, 2019

How AIOps Reveals Deep Value from Log Analysis

December 17, 2019

Phil Tee: Observability Requires the Marriage of AI, Metrics and Logs

December 13, 2019

Trouble Ticketing is Dead. Long Live Collaborative Ticketing!