Today, at Dash 2020, we unveiled new products and features on the Datadog platform that provide engineering teams with even deeper visibility into their distributed applications and infrastructure. With these additions, Datadog further streamlines every step of your release cycle, from shift-left testing in your CI/CD pipelines to profiling your code in production. We also launched the Datadog mobile app to make on-call easier, Compliance Monitoring for enhanced cloud security, and the Marketplace as a collaboration platform for our Partners. Get more details about these developments—and everything else we announced at Dash—in this comprehensive roundup.
With Datadog Incident Management, your teams can now use Datadog to easily create incidents and track them from notification to resolution and postmortem. The new Incidents UI provides a central place for reporting and monitoring the status of all your incidents. Tag each incident with important information like severity, customer impact, incident commander, and affected environments. An incident’s timeline automatically reflects any new information or status changes, and anybody can add additional markdown or interactive graphs from inside Datadog. Incidents also works seamlessly with other new features, including our mobile app, which allows on-call engineers to triage alerts on the go, as well as our enhanced Notebooks, which now supports live, multi-user editing for collaborative post-incident reviews. Read our blog post here for more information about how Datadog can help improve your incident response workflow.
The Datadog mobile app is here! Available for both iOS and Android devices, the app provides on-call engineers with instant access to your Datadog dashboards and alerts from anywhere. Now, when you get paged by one of your integrated on-call services like PagerDuty or OpsGenie, the notification contains a link to the relevant in-app Datadog alert, where you’ll see a summary of recent environment activity along with interactive metric graphs for immediate context. For metric correlations and a better view of the health of your services, you can access mobile-native versions of all your dashboards. You can also view related events around the alert to get a fuller picture of why it was triggered, making it easy to quickly triage the problem’s severity and decide your next course of action—all without needing to hop on your laptop.
With Datadog Security Monitoring, you can detect potential security issues and threats to your environment in real time. To help you more effectively investigate these issues and the context in which they arise, we are introducing out-of-the-box investigation dashboards for Security Monitoring. These dashboards visualize security trends in your environment broken down by key entities, such as IP addresses and users. They also include correlated observability data for richer context, such as HTTP activity tied to an IP or user. Use these dashboards to review your security posture at a glance. If you notice an anomaly, like several Security Signals related to a single IP, you can easily filter the dashboard to drill down further into that entity’s behavior.
We are adding threat intelligence enrichment to Datadog Security Monitoring. Threat intelligence provides valuable external context to ingested log events so you can more quickly triage Security Signals. Threat intelligence identifies whether specific log attributes (e.g., IPs, file hashes, URLs, etc.) may be malicious or benign and provides an associated rationale. For example, if Datadog detects that an IP address associated with a potential attack has also been observed exploiting a known vulnerability, Datadog will flag the resulting Security Signal as malicious so you can prioritize this investigation. Conversely, if the IP is known to be benign, it is likely a scanner gathering intelligence (e.g., shodan.io), and any associated Security Signal can be safely ignored, reducing alert fatigue. Datadog maintains and updates threat intelligence feeds curated by trusted providers, so logs are enriched with up-to-date and accurate context.
Back in April, we announced Datadog Security Monitoring, which lets you detect and track security threats across your infrastructure in real-time, giving development, operations, and security teams the insights they need to secure your environment. Now, with the release of Datadog Compliance Monitoring, we are helping you to continuously monitor production environments for misconfigurations and maintain compliance with rigorous industry standards and frameworks, such as PCI DSS, SOC 2, or CIS. Compliance Monitoring automatically watches the state of key cloud resources, such as your security groups, storage buckets, and load balancers, in order to detect potential misconfigurations and compliance violations across every level of your stack. The Datadog Agent now checks the local configuration of your servers, containers, and Kubernetes clusters. The Agent also actively monitors the filesystem of containerized environments to provide a production-ready file integrity monitoring (FIM) solution. Finally, with custom detection and compliance rules, users can expand the scope of their out-of-the-box detection capabilities without needing to learn a proprietary query language. Read our blog post here and sign up for the beta here.
Datadog Continuous Profiler allows you to analyze your application’s performance in production—down to single lines of code and with minimal overhead. Continuous Profiler collects multiple types of profiles (CPU, memory allocation, concurrent locks, I/O, and more) across your entire stack. Inspecting these profiles helps you identify and optimize the most resource-consuming methods in your application, allowing you to reduce end-user latency and cloud provider costs. It also automatically performs heuristic code analysis and shares actionable insights for performance improvements, so anyone on your team can quickly address main problem areas. Whether you’re releasing a new version, investigating an outage, or troubleshooting slow requests, Continuous Profiler helps you debug code-level performance issues across any scope, from a single request to a fleet of applications. Learn more about Continuous Profiler in our blog post.
Datadog Synthetic tests are both robust and easy to create, which saves your team time and prevents alert fatigue. With Synthetic CI/CD Testing, you’re able to integrate your existing, production-level synthetic tests into your CI/CD pipeline in order to prevent bad merges, to automate rollbacks, and to run tests alongside canary deployments. This shift-left approach to testing can be achieved with both our flexible API endpoints and CLI, so you can seamlessly incorporate synthetic tests into the tooling you’ve already set up. Read our blog post to learn more about Synthetic CI/CD Testing.
Like web and backend developers, mobile developers need real-time visibility into application performance and user experience. But mobile applications run at the edge of your infrastructure, making it more challenging to see the full picture. Datadog Mobile RUM helps mobile developers navigate this complexity and deploy applications with confidence. With Mobile RUM, you can analyze usage information across any app version, OS, or device—and get end-to-end visibility for investigating errors and crashes. Because Mobile RUM is tightly integrated with Datadog APM, teams can quickly understand if an issue originated at the level of the application, backend server, or network, and get the insights they need to improve their users’ mobile experience. Read more about Mobile RUM here.
Datadog APM Live Search allows you to search 100 percent of ingested traces by any tag on any span over the last 15 minutes. Traces are streamed live as users navigate your website or mobile application, giving you complete access to user requests during an outage or performance investigation. Filter down to traces matching infrastructure, application, or custom tags and watch real-time analytics for requests, errors, and latency percentiles as you deploy a new version of code to production.
With the new Datadog Marketplace, Datadog Partners can now use Datadog as a development and collaboration platform to build custom apps, including third-party integrations and monitoring tools, and offer them to their customers and the Datadog community at large. Backed by strict vetting standards and a fully managed billing system, the Marketplace opens up new ways for our Partners to increase the scope of their monitoring solutions and provide original integrations to customers. For more information, see our blog post here.
Smart alerting is essential to any monitoring workflow; alerts draw your attention to problems in your system before they become critical. But as your stack grows and evolves, it’s not always clear which monitoring data is most important. Now, Datadog provides Recommended Monitors, a suite of preconfigured, customizable alert queries and thresholds that enable customers to enact monitoring best practices for the technologies they rely on with the click of a button. Recommended Monitors are based on the expertise of our many technology partners, as well as our own experience and the experience of thousands of customers, so you can trust that problems in your system will not go undetected. Recommended Monitors are currently in private beta. For more information, read our blog post here.
BigPanda is a data analysis platform that takes alert input from monitoring tools and uses machine learning to correlate them and perform automatic root cause analysis. With Datadog’s new integration, you can send your Datadog alerts to BigPanda in order to surface patterns and identify the cause more quickly. After setting up the integration, Datadog will forward all your configured Datadog alerts to BigPanda for processing. BigPanda correlates them with your other app data to determine the cause of a change in a service. When BigPanda identifies a correlation and creates an incident, it will also push events back to Datadog.
Datadog customers can now send their AWS service logs directly to Datadog with an Amazon Kinesis Data Firehose delivery stream. Through this easy-to-configure process, users can explore and analyze service logs to gain deeper insights into the state of their AWS infrastructure and applications. Logs streaming from Kinesis Data Firehose automatically include key metadata such as their source, so users can quickly identify which AWS service generated the log. And with features like Logging without Limits™, users can analyze and generate metrics from all their logs, letting them uncover and alert on trends in their AWS services while only storing the logs they need. Read more about how to get started here.
Datadog now integrates with AWS Serverless Application Model (SAM) and AWS Cloud Developer Kit (CDK) to provide zero instrumentation serverless observability. With these new integrations—and our preexisting Serverless Framework plugin—teams can use their choice of developer tooling to streamline how they build, manage, and troubleshoot their serverless applications. Datadog’s new CloudFormation macro automatically ingests enhanced Lambda metrics, traces, and logs, without any code changes. Once you’re monitoring your serverless infrastructure with Datadog, you can easily search and filter functions by any tag, alert on cold starts, visualize how requests flow across Lambda functions and their dependencies, and more. Read our blog post to get started with the macro.
At Dash 2020, a global community of leaders and developers came together to bring their teams, infrastructure, and applications to the next level of scale and velocity. If you missed our virtual event, stay tuned for videos from our keynotes and technical talks to learn how companies like Zillow, Nextdoor, and Wayfair develop a culture of observability. If you’re new to Datadog, you can sign up for a 14-day free trial now to start using all our latest features and products.