Monitor NGINX Plus load-balancing metrics

Carlo Cabanilla

In a short 5 years, NGINX has gone from powering 0 to powering 1 in every 6 of the busiest websites on the Internet. Datadog uses NGINX and chances are, you do too. Datadog is pleased to announce the expansion of our current NGINX monitoring to support the additional features offered through NGINX Plus.

Launched in 2013, NGINX Plus is a fully supported, commercial version of NGINX that offers basic open source modules and additional enterprise-class features. In particular, the premium version provides:

Advanced activity monitoring of HTTP requests by status and upstream server in much greater detail
Health checks for custom pages and applications to support more complex failure cases
And, advanced load balancing features including on-the-fly reconfiguration of upstream server groups

Monitor NGINX Plus metrics in Datadog

Datadog expands the existing NGINX community version plugin to support the new advanced activity monitoring in NGINX Plus. Datadog now collects all metrics exposed by the NGINX Plus status module, including:

Connections accepted, dropped, active, idle
Bytes sent and received by server zone
Number of health checks failed
Response counts by status code and upstream server

Find a sample of all NGINX Plus metric names and their tagging structure here.

NGINX connections and upstreams

Tags are the linchpin of monitoring with Datadog. They are used everywhere to navigate data, define graphs, and create alerts easily whether you have 5 or 5,000 servers. With tags you can slice and dice metrics, graphs, alerts and get an inventory of servers to quickly pinpoint the servers you wish to look into.

The new NGINX integration tags all NGINX metrics for you with the correct upstream server and server zone so that you can very easily see how traffic gets distributed between NGINX servers, zones, and individual upstream servers.

For example, in the graph below we are showing active connections per minute, split by upstream server.

Aggregate, alert on, and graph NGINX Plus metrics

Datadog automatically creates a metric for every combination of upstream pool, upstream server, and server zone. These metrics can be aggregated in order to monitor an entire upstream pool or cluster of upstream pools. Furthermore, metrics from multiple NGINX servers can be aggregated into a single, high-level metric for the entire web service served via NGINX.

You can graph any of these metrics on Datadog dashboards or receive alerts on them via email, PagerDuty, HipChat, Slack or webhooks. Conducting a drilled-down analysis for an event that happened over the last year is a snap, using Datadog’s high-resolution historical data.

Correlate events from other systems with NGINX Plus metrics

You can overlay events from the other + Datadog integrations on top of the NGINX Plus metrics. With a side-by-side view of events and related dashboards of metrics, you can drastically cut the time you spend investigating and correlating events and metrics coming from different parts of your infrastructure.

If you would like to get visibility into your HTTP load balancing as described in this article, first sign up for a NGINX Plus Free Trial. Then try a 14-day free trial of Datadog to instantly gain insight and monitor NGINX Plus metrics. Visit our Datadog NGINX Integration page to learn more about the Datadog NGINX integration.

Get Started with Datadog

Monitor NGINX Plus Load Balancing Metrics

Monitor NGINX Plus metrics in Datadog

NGINX connections and upstreams

Aggregate, alert on, and graph NGINX Plus metrics

Correlate events from other systems with NGINX Plus metrics

Start monitoring your metrics in minutes

Monitor NGINX Plus metrics in Datadog

NGINX connections and upstreams

Aggregate, alert on, and graph NGINX Plus metrics

Correlate events from other systems with NGINX Plus metrics

Related jobs at Datadog

We're always looking for talented people to collaborate with

Start monitoring your metrics in minutes