The Monitor

Centrally set up and scale monitoring of your infrastructure and apps with Datadog Fleet Automation

4 minute read

Published

Share

Centrally set up and scale monitoring of your infrastructure and apps with Datadog Fleet Automation
Ethan Debnath

Ethan Debnath

David Iparraguirre

David Iparraguirre

Vignesh Palaniappan

Vignesh Palaniappan

Setting up and scaling observability across large, distributed environments often requires platform and SRE teams to coordinate access to infrastructure hosts and switch between configuration management tools and product-specific documentation. These tasks increase setup time and create delays in establishing visibility of critical services in Datadog. As teams expand their infrastructure, they need to coordinate Datadog configuration changes in a consistent and auditable way.

To help solve these challenges, Datadog Fleet Automation now enables you to set up monitoring across your environments by configuring Datadog products and Agent integrations across your entire Agent fleet from one place. This functionality builds on Fleet Automation’s earlier capabilities that provide visibility into Agent configuration and health and centralized Agent upgrades and policy-based configuration. With this release, platform and SRE teams can manage Agents across all hosts from a centralized interface, reducing manual effort while implementing full-stack observability coverage and standardizing setup across environments.

In this post, we’ll explore how you can use Fleet Automation to:

Easily set up monitoring for your infrastructure and applications

Setting up monitoring for infrastructure and services often involves switching between documentation and configuration repos, in addition to manually accessing hosts. You can now configure your Agents directly from Fleet Automation, enabling your teams to centralize setup and get visibility into their environments faster.

Screenshot of the Fleet Automation editor that shows unconfigured integrations detected on hosts. The user has selected to configure the MySQL integration.
Screenshot of the Fleet Automation editor that shows unconfigured integrations detected on hosts. The user has selected to configure the MySQL integration.

With Fleet Automation, you can roll out configuration changes across your Agents through guided workflows or bring your own YAML. When teams need to configure integrations such as Redis, Apache, or NGINX, they can follow UI-driven steps that provide required parameters and best-practice defaults. Platform engineers who prefer code-driven workflows can edit configurations directly in the advanced YAML editor, which includes syntax validation and semantic checks to help ensure accuracy before deployment.

With this configuration centralized, teams no longer need to log on to hosts or rely on separate processes and tools to make Agent configuration changes. Instead, they can deploy new configuration updates, such as enabling Log Management, Network Device Monitoring (NDM), or Cloud Security, directly from the Fleet Automation UI. Fleet Automation also offers an API to help teams use their existing automation or internal platform tooling to configure Agents.

Maintain complete observability coverage across your environments

Achieving full observability coverage requires continuous visibility into where telemetry data is missing. Fleet Automation helps you identify coverage gaps, validate data flow, and update configurations across your environments, reducing management overhead.

Fleet Automation surfaces hosts where telemetry is missing or misconfigured—whether due to unconfigured integrations or missing Agent settings—so that you can quickly identify and close visibility gaps. You can make targeted configuration changes, such as updating an integration, managing tags, and adjusting log collection settings directly from the Fleet Automation UI or API. You can apply these changes in real time to accelerate troubleshooting and reduce time to resolution.

Screenshot of the Fleet Automation Agent details side panel where you can directly edit and deploy configurations to the Agent in YAML.
Screenshot of the Fleet Automation Agent details side panel where you can directly edit and deploy configurations to the Agent in YAML.

To help you maintain full observability coverage of your infrastructure, Fleet Automation gives teams visibility into whether Agents are actively reporting telemetry data and where data might be missing. This insight enables teams to quickly resolve misconfigurations, address gaps in instrumentation, and keep visibility consistent across environments.

Centralize Datadog management and deployment from a single platform

As environments grow, coordinating Datadog configuration updates across teams and infrastructure layers can become complex. Fleet Automation helps platform teams manage this complexity by providing deployment controls, role-based access control (RBAC), and auditability.

Teams can tailor deployment settings by using phased and concurrent rollouts. For example, you can deploy configuration updates to a small group of Agents first, validate performance or compatibility, and then expand the rollout to the full environment. You can also define maintenance windows for recurring deployments and set RBAC permissions that determine who can initiate changes.

Screenshot of a side panel in Fleet Automation that shows a completed Datadog Agent configuration deployment and a chart of deployment progress over time.
Screenshot of a side panel in Fleet Automation that shows a completed Datadog Agent configuration deployment and a chart of deployment progress over time.

To support distributed teams, Fleet Automation provides deployment notifications and detailed event history from Datadog Audit Trail. Teams can review past configuration changes, see who initiated them, compare previous versions, and understand how updates impacted telemetry data. This historical context helps you troubleshoot and reduce configuration drift without requiring you to access individual hosts.

Simplify observability coverage at scale with Fleet Automation

Fleet Automation helps platform and SRE teams reduce Datadog onboarding time, maintain complete observability coverage of their environments, and simplify configuration changes across the fleet. Whether you’re rolling out configuration updates to monitor Redis across your infrastructure footprint or standardizing logging configurations for a new business unit, Fleet Automation gives you a unified management interface for observability at scale. The capabilities highlighted in this blog post are generally available. To learn more, check out the Fleet Automation documentation and explore how to configure Agents remotely.

If you don’t already have a Datadog account, you can to get started.

Related Articles

Manage all your OpenTelemetry collectors with Datadog Fleet Automation

Manage all your OpenTelemetry collectors with Datadog Fleet Automation

How to install Datadog on AWS hosts with Ansible dynamic inventories

How to install Datadog on AWS hosts with Ansible dynamic inventories

Troubleshoot your monitoring with Agent check status reporting

Troubleshoot your monitoring with Agent check status reporting

How to automate Ansible reporting + deployment of the Datadog Agent

How to automate Ansible reporting + deployment of the Datadog Agent

Start monitoring your metrics in minutes