Theory | Datadog

Service level objectives 101: Establishing effective SLOs

Setting service level objectives for critical user journeys helps organizations understand how they should ...

Alerting 101: Status checks

Learn how to use status checks on hosts, services, processes, and network endpoints to generate actionable ...

Alerting 101: Timeseries metric checks

Learn how metric checks can help you monitor the health and performance of your infrastructure and ...

Monitoring services and setting SLAs with Datadog

In this post, we'll explain how to set SLAs and monitor service-level metrics over time.

Metric graphs 101: Graphing anti-patterns

In this post, we explore three ways that metric graphs are commonly misused and then suggest better solutions ...

Metric graphs 101: Summary graphs

Learn how to effectively use summary graphs: visualizations that ​flatten​ a particular span of time to ...

The power of tagged metrics

Tagged metrics let you add infrastructural dimensions to your metrics on the fly—without modifying the way ...

Metric graphs 101: Timeseries graphs

To help you effectively visualize your metrics, this post explores 4 types of timeseries graphs: Line graphs, ...

Why 2016 is the year of monitoring at scale

Since launching Datadog, we've seen our thesis validated on a far broader scale than we had originally ...

Monitoring 101: Investigating performance issues

Once your monitoring system has notified you of real performance issues that require attention, its next job ...

Monitoring 101: Alerting on what matters

Automated alerts allow you to spot problems anywhere in your infrastructure, so that you can rapidly identify ...

Monitoring 101: Collecting the right data

Collect metrics and classify data so that you can receive meaningful, automated alerts about potential ...

Crossing Streams: a love letter to Go io.Reader

The Go io.reader allows for better control buffering resulting in faster code that uses less memory. Learn ...

Go Performance Tales

Looking for performance tips for Go applications? In this blog, read about one software engineer's quest to ...

Learning from AWS failure

Failures are a fact of life. AWS failure just gets more publicity. Instead let's focus on t he more ...

Triggered Alerts in Datadog - providing context to alerts

Datadog's new Triggered Alert screen works to make available important contextual information with one click ...

StatsD, what it is and how it can help you

Learn what StatsD is, how it works, what sets it apart from the rest and what problems it solves.

AWS EBS latency and IOPS: The surprising truth

Performance issues with Amazon Web Services' Elastic Block Storage (EBS) are complex. Learn how to detect and ...

Getting optimal performance with AWS EBS Provisioned IOPS

Optimize your AWS EBS performance by using Provisioned IOPS. Learn more!

Where is open source monitoring going?

We are proud to sponsor the very first edition of Monitorama, the conference of open source monitoring that ...