Actionable Insights - Faster Incident Resolution with Datadog and Moogsoft Observability Cloud
Richard Whitehead | February 3, 2021

A bi-directional integration between the two solutions enriches both with context, and exposes Moogsoft’s advanced AI algorithms to Datadog users.

A bi-directional integration between the two solutions enriches both with context, and exposes Moogsoft’s advanced AI algorithms to Datadog users.

Context is king, they say, and anything you can do to improve context both makes decisions and assessments more reliable and speeds up the decision process.

A new, bi-directional integration between Moogsoft Observability Cloud and Datadog does just that.

Many SRE teams rely on Datadog to provide comprehensive information about their application stacks. This includes bringing together monitoring basics such as infrastructure utilization with more in-depth APM capabilities such as logging, tracing and synthetic transactions.

Leveraging Datadog’s recently added support for Incidents, related events that have been enriched, and clustered (correlated) by Moogsoft’s algorithms can now automatically be presented to Datadog users as a pre-built, context-rich Incident, within the Incident Dashboard. That’s intelligent observability.

As both solutions are cloud-based, Moogsoft uses the Datadog API to register for alerts (and metrics where relevant). The alerts are then processed by the Moogsoft app, and then another API call to Datadog creates the incident.

By adding the sophistication of Moogsoft’s algorithms, correlation engine, and powerful Workflow Engine, you can achieve levels of automation and noise reduction that would otherwise be cost-prohibitive — automation of course, being the cornerstone of reducing toil in modern DevOps environments.

Moogsoft and Datadog

The benefits of automation are compelling —obviously no SRE wants to be bogged down with repetitive tasks— but the economic benefits are profound as well. Any time spent identifying, collating and triaging errors delays real diagnostic work and erodes error budgets needlessly. Automating this process dramatically reduces triage effort, and the time spent sifting through events.


Moogsoft Observability


The combination of the two products yields some compelling benefits:

  • Achieve more context across incidents: Joint customers can use Moogsoft to enrich telemetry from Datadog Metrics, Events and Logs APIs, as well as CI data from Datadog Topology APIs, with other data from across the full stack, including configuration database management systems (CMDBs), asset management databases, discovery systems and more.
  • Accelerate identification of root cause: Moogsoft automatically adds key incident information such as location, department, business criticality, service relationships, runbooks and escalation processes, allowing Datadog users to begin investigations with actionable intelligence.
  • Minimize the impact and time spent on service-impacting outages: Moogsoft algorithms automatically reduce the “haystack” of data to make anomalies obvious to Datadog customers, helping them address incidents before they introduce service impacts.
  • Accelerate Monitoring Transformation: With Moogsoft, Datadog users can automate event correlation across legacy on-premises systems and modern multi-cloud environments in one place, eliminating duplicates and effectively reducing the overall number of incidents across the full stack.

To see for yourself, sign up for a free trial of Moogsoft Observability Cloud. Or you can find the Moogsoft Observability Cloud in the Datadog Marketplace. There you will also find full instructions on how to configure Moogsoft Observability Cloud to send and receive data from Datadog.

Moogsoft is the AI-driven observability leader that provides intelligent monitoring solutions for smart DevOps. Moogsoft delivers the most advanced cloud-native, self-service platform for software engineers, developers and operators to instantly see everything, know what’s wrong and fix things faster.

About the author


Richard Whitehead

As Moogsoft's Chief Evangelist, Richard brings a keen sense of what is required to build transformational solutions. A former CTO and Technology VP, Richard brought new technologies to market, and was responsible for strategy, partnerships and product research. Richard served on Splunk’s Technology Advisory Board through their Series A, providing product and market guidance. He served on the Advisory Boards of RedSeal and Meriton Networks, was a charter member of the TMF NGOSS architecture committee, chaired a DMTF Working Group, and recently co-chaired the ONUG Monitoring & Observability Working Group. Richard holds three patents, and is considered dangerous with JavaScript.

All Posts by Richard Whitehead

Moogsoft Resources

May 5, 2022

More Tools + More People = Increased Complexity

April 26, 2022

Continuous Availability vs. Continuous Change

April 7, 2022

Episode 4: Mooving to… Successful Engineering in the Remote World

March 24, 2022

Continuous Availability: How It’s Changed, and Why It’s Critical