Datadog APM | Datadog

Datadog Application Observability

Troubleshoot, optimize, and secure your applications faster with end-to-end distributed tracing and service-centric observability at scale, correlated with all telemetry types.

Product Benefits

Find Root Causes Faster with Thread-Level Distributed Tracing

Easily pin bottlenecks down to the method or line of code, including slow I/O and lock contention and inefficient garbage collection, with full-stack distributed tracing and thread-level continuous profiling
Identify root causes quickly with traces automatically correlated with your logs, infrastructure metrics, database queries, network calls, frontend telemetry and more—all in one view
Monitor OTel-instrumented apps with support for OTel API and Collector within the Datadog Agent for full interoperability

Get Live Visibility and Complete Control Over Traces

Search and analyze your ingested traces live over the last 15 minutes
Filter traces based on trace-level attributes, service relationships, endpoints, and other properties, all without needing to learn a complex query language
Retain errors and high latency traces automatically for 15 days
Control cost-visibility tradeoffs with fine-grained ingestion controls and tag-based retention filters

Instantly Generate Logs, Spans, Metrics, and Span Tags without Changing Code

Expedite debugging of production issues with granular insights into application behavior and service interactions by adding log statements without changing code
Add spans to troubleshoot slow requests and specific operations in your application without leaving the Datadog platform
Create metrics on the fly that measure the time any method in your code is consuming in production, and use metric expressions to focus on specific requests

products/dynamic-instrumentation/dynamic-logs.png

Receive Alerts Only for the Issues that Matter and Eliminate False-Positives

Set up recommended alerts with 1 click for anomalies and outliers that account for daily, weekly, and seasonal fluctuations
Proactively prevent outages and errors in the future by alerting on metric forecasts
Combine alerts into composite alerts for greater granularity and stronger signal to reduce the noise
Automatically detect unanticipated outliers, anomalies, and errors with Watchdog

products/alerts/watchdog-machine-learning-alerts.jpeg

Centralize Your Service Knowledge and Operations

Achieve end-to-end service ownership at scale, get real-time performance insights, detect and address reliability and security risks, and manage application dependencies all in one place
Get RED metrics based on 100% of traffic with 15-month retention so you can search, analyze, and visualize any trace using any tag
Automatically discover, catalog, and monitor services—no instrumentation code changes necessary—with Universal Service Monitoring
Reduce mean time to detection through automatic dependency mapping, powered by eBPF technology

Full-Stack Defense across Apps, Workloads, and Infrastructure

Track your security posture easily with out-of-the-box threat activity, exposure, and vulnerabilities ratings captured in the Datadog Severity Score
Triage vulnerability impact in full context with continuous runtime scans across open source libraries
Remediate issues with out-of-the-box actionable guidance and automatic correlation between your application and infrastructure
Quickly discover code vulnerabilities and attack attempts in your Java, .NET, PHP, Node.js, Ruby, Python, Go, and C++ applications

products/app-sec/asm-vulnerability-found-v2.png

Thread-level Insights into Performance Bottlenecks in Production

Reduce your production code’s latency and resource consumption with continuous code profiling
Identify inefficiencies and apply suggested fixes to optimize application code, sidestepping lengthy reproduction processes with automated insights
Leverage a detailed chronological visualization of code and runtime activity—grouped by threads, fibers, goroutines, or event loops—filtered to a container or a specific trace to diagnose performance issues

Ensure Smooth Deployments and Eliminate Performance Regressions

Follow issues over time to know when they first started, if they are still ongoing, and how often they occur with Error Tracking
Compare application performance and impact across hosts, versions, and time ranges during rolling, canary, blue/green, or shadow deploys
Quickly troubleshoot faulty deployments with automatic faulty deployment detection and decide whether to roll back or ship a fix