LegalZoom reduces MTTR and transforms incident management with Datadog Incident Response | Datadog
LegalZoom reduces MTTR and transforms incident management with Datadog Incident Response

Case Study

LegalZoom reduces MTTR and transforms incident management with Datadog Incident Response

About LegalZoom

LegalZoom is an online legal technology and services company that helps individuals and businesses with a variety of legal needs, such as forming a business, creating estate plans, protecting intellectual property, and managing compliance.

Legal technology
~1,400 Employees
California
“Engineers can now move from detection to understanding to resolution more quickly, without the burden of navigating multiple systems during high-pressure situations.”
case-studies/legalzoom/brett-dudo
“Engineers can now move from detection to understanding to resolution more quickly, without the burden of navigating multiple systems during high-pressure situations.”
Brett Dudo Principal Software Engineer LegalZoom

Why Datadog?

  • Unified platform for paging, incident management, and observability
  • Team-based paging model aligned with their operational model
  • Fewer tools to maintain and less duplicated data
  • AI-assisted incident and postmortem summaries reduce toil
  • Slack and Zoom integrations ease collaboration

Challenge

LegalZoom’s existing incident response tooling had grown rigid and slow to evolve, forcing engineers to work around the tool when managing incidents, postmortems, and follow-up action items. The result was higher cognitive load and slower coordination during critical incidents.

Key results

Reduced MTTR

Eliminated context switching during incidents

Improved collaboration

Simplified team-based paging and communication

Reduced cognitive load

Transformed data into actionable insights

Rigid incident management tooling and cognitive overload hinder engineering team efficiency

LegalZoom is an online legal technology and services company that helps individuals and businesses with a variety of legal needs. Since its founding in 2001, the company has served over 10 million customers, helped more than 2 million people start their businesses, and assisted in the creation of over 3.5 million estate planning documents.

LegalZoom automates routine legal tasks, enhances efficiency, and improves customer experience by combining an intuitive online platform with advanced tools, with a focus on artificial intelligence and data analytics. The company employs approximately 1,400 people, including more than 200 engineers who build the tools and platforms that help automate and scale LegalZoom’s legal services.

As LegalZoom scaled, the Engineering team experienced rising cognitive load during incidents and a growing mismatch between their tooling and how they operated. Their previous solution suffered from configuration drift and lacked the product investment needed to keep pace with their organization, creating friction during incidents.

The team needed a more self-service incident response solution that empowered engineers to manage incidents directly, rather than relying on a traditional NOC service model, while reducing the burden on engineers during critical moments. LegalZoom began searching for a new incident response solution that would modernize its approach to incident management and enable their engineering teams to work more efficiently.

LegalZoom engineering team collaborating

Unified incident response and observability reduce complexity and improve collaboration

LegalZoom has been a Datadog customer since 2019 and is fully committed to the platform. The company uses a broad suite of Datadog solutions, including Application Performance Monitoring (APM), Internal Developer Portal (IDP), Real User Monitoring (RUM), CI Visibility, and more.

Recently, LegalZoom added Datadog Incident Response to replace its existing solution and help achieve its goal of revamping incident operations and consolidating tools. The decision was driven by Datadog’s ability to unify incident response with the observability tools the team already relied on. “Having paging, incident management, and observability in one platform was a big selling point,” says Brett Dudo, Principal Software Engineer in the Engineering Services organization.

As LegalZoom’s architecture expanded, the team needed clearer visibility across an increasingly diverse set of services. Years of legacy systems, new microservices, and multiple languages made consistent monitoring difficult. Universal Service Monitoring (USM) changed that by giving engineers a single, consistent set of metrics across every service. “USM provides universal metrics for all of your services,” says Dudo. “You can create monitors and SLOs that cover everything, instead of maintaining ad hoc monitors for each team or tech stack.”

With every service automatically detected and mapped, engineers enter incidents with a shared understanding of how upstream and downstream components relate, even for systems they don’t own directly. This visibility also strengthens LegalZoom’s team-based paging model. “If a downstream service is failing, I can find the owner instantly,” says Dudo. “All that data is right there—click it, page them, and you’re talking.”

The consolidated platform delivered immediate benefits to LegalZoom’s Engineering teams. “Engineers can now move from detection to understanding to resolution more quickly, without the burden of navigating multiple systems during high-pressure situations,” says Dudo.

Reducing MTTR and transforming incident management culture

LegalZoom has strengthened its incident management culture by centralizing how engineers collaborate, communicate, and investigate. Having one shared process backed by shared data reduces friction across teams and reinforces consistent operational behavior during high-pressure moments. “We’re not paging services anymore; we’re paging teams. It’s faster and cleaner, and it aligns with how we actually operate,” says Dudo.

Engineers can now manage incidents and perform post-incident actions in the same place where they monitor and investigate, without switching tools or losing context. Metrics, logs, and traces all flow into a single view that connects observability with response. Collaboration begins instantly when an incident is declared: “The automation kicks in right away. You get a Slack channel, start a Zoom, and begin investigating without missing a beat. It keeps everyone on the same page and gets the right people in faster,” Dudo explains.

“The automation kicks in right away. You get a Slack channel, start a Zoom, and begin investigating without missing a beat. It keeps everyone on the same page and gets the right people in faster.”

AI also helps engineers get up to speed fast. “The incident AI automatically summarizes what’s happened in Slack. It’s great when you’re jumping in mid-incident,” he adds. “It saves you from scrolling through tons of messages and helps new responders understand what’s already been discussed, so they can contribute right away.”

“The incident AI automatically summarizes what's happened in Slack. It's great when you're jumping in mid-incident. It saves you from scrolling through tons of messages and helps new responders understand what's already been discussed, so they can contribute right away.”

LegalZoom has now transformed how it documents and communicates during incidents. Every event, link, and update lives within a shared timeline, giving responders an immediate, trustworthy source of truth. “The timeline’s fantastic. You can add any link—dashboard, APM page, whatever—and tell the full story. After an incident, you have a complete record: Slack thread, timeline, impacts. It’s all there,” says Dudo.

The team now treats every incident as an opportunity to learn, not just react. AI-assisted summaries and templated postmortems make it easy to capture insights while memories are fresh. “Postmortems can be summarized automatically with generative AI. It builds out the document for you; templated, clean, ready to go,” Dudo says.

Looking ahead, LegalZoom plans to deepen its investment in operational maturity, expanding Workflow Automation, introducing dedicated incident commander schedules, and integrating more Datadog AI capabilities. These changes reinforce a culture of ownership and ensure every incident follows a consistent, blameless, and data-rich process.

“When systems feel infinite, that's overwhelming. Datadog helps make them feel finite.”

Ultimately, LegalZoom’s Engineering team has transformed a once overwhelming process into something structured, transparent, and collaborative. “When systems feel infinite, that’s overwhelming,” Dudo explains. “Datadog helps make them feel finite.”

리소스