Datadog APM | Datadog

Datadog Application Observability

Troubleshoot, optimize, and secure your applications faster with end-to-end distributed tracing and service-centric observability at scale, correlated with all telemetry types.

Product Benefits

Find Root Causes Faster with Thread-Level Distributed Tracing

  • Easily pin bottlenecks down to the method or line of code, including slow I/O and lock contention and inefficient garbage collection, with full-stack distributed tracing and thread-level continuous profiling
  • Identify root causes quickly with traces automatically correlated with your logs, infrastructure metrics, database queries, network calls, frontend telemetry and more—all in one view
  • Monitor OTel-instrumented apps with support for OTel API and Collector within the Datadog Agent for full interoperability
dg/waterfall_thread_timeline.png

Get Live Visibility and Complete Control Over Traces

  • Search and analyze your ingested traces live over the last 15 minutes
  • Filter traces based on trace-level attributes, service relationships, endpoints, and other properties, all without needing to learn a complex query language
  • Retain errors and high latency traces automatically for 15 days
  • Control cost-visibility tradeoffs with fine-grained ingestion controls and tag-based retention filters

Instantly Generate Logs, Spans, Metrics, and Span Tags without Changing Code

  • Expedite debugging of production issues with granular insights into application behavior and service interactions by adding log statements without changing code
  • Add spans to troubleshoot slow requests and specific operations in your application without leaving the Datadog platform
  • Create metrics on the fly that measure the time any method in your code is consuming in production, and use metric expressions to focus on specific requests
products/dynamic-instrumentation/dynamic-logs.png

Receive Alerts Only for the Issues that Matter and Eliminate False-Positives

  • Set up recommended alerts with 1 click for anomalies and outliers that account for daily, weekly, and seasonal fluctuations
  • Proactively prevent outages and errors in the future by alerting on metric forecasts
  • Combine alerts into composite alerts for greater granularity and stronger signal to reduce the noise
  • Automatically detect unanticipated outliers, anomalies, and errors with Watchdog
products/alerts/watchdog-machine-learning-alerts.jpeg

Centralize Your Service Knowledge and Operations

  • Achieve end-to-end service ownership at scale, get real-time performance insights, detect and address reliability and security risks, and manage application dependencies all in one place
  • Get RED metrics based on 100% of traffic with 15-month retention so you can search, analyze, and visualize any trace using any tag
  • Automatically discover, catalog, and monitor services—no instrumentation code changes necessary—with Universal Service Monitoring
  • Reduce mean time to detection through automatic dependency mapping, powered by eBPF technology

Full-Stack Defense across Apps, Workloads, and Infrastructure

  • Track your security posture easily with out-of-the-box threat activity, exposure, and vulnerabilities ratings captured in the Datadog Severity Score
  • Triage vulnerability impact in full context with continuous runtime scans across open source libraries
  • Remediate issues with out-of-the-box actionable guidance and automatic correlation between your application and infrastructure
  • Quickly discover code vulnerabilities and attack attempts in your Java, .NET, PHP, Node.js, Ruby, Python, Go, and C++ applications
products/app-sec/asm-vulnerability-found-v2.png

Thread-level Insights into Performance Bottlenecks in Production

  • Reduce your production code’s latency and resource consumption with continuous code profiling
  • Identify inefficiencies and apply suggested fixes to optimize application code, sidestepping lengthy reproduction processes with automated insights
  • Leverage a detailed chronological visualization of code and runtime activity—grouped by threads, fibers, goroutines, or event loops—filtered to a container or a specific trace to diagnose performance issues

Ensure Smooth Deployments and Eliminate Performance Regressions

  • Follow issues over time to know when they first started, if they are still ongoing, and how often they occur with Error Tracking
  • Compare application performance and impact across hosts, versions, and time ranges during rolling, canary, blue/green, or shadow deploys
  • Quickly troubleshoot faulty deployments with automatic faulty deployment detection and decide whether to roll back or ship a fix

Loved & Trusted by Thousands

Washington Post logo 21st Century Fox Home Entertainment logo Peloton logo Samsung logo Comcast logo Nginx logo