Monitor Cilium and Kubernetes performance with Hubble

Mallory Mooney

In Part 1, we looked at some key metrics for monitoring the health and performance of your Cilium-managed Kubernetes clusters and network. In this post, we’ll look at how Hubble enables you to visualize network traffic via a CLI and user interface. But first, we’ll briefly look at Hubble’s underlying infrastructure and how it provides visibility into your environment.

Hubble’s underlying infrastructure

There are several Linux-based and Kubernetes command-line tools that enable you to review network data for individual pods A group of containers running in a Kubernetes cluster , such as their IP addresses and hostnames. But in order to efficiently troubleshoot any performance degradation, such as service latency, you need a better understanding of pod-to-pod and client-to-pod communication. Hubble collects and aggregates network data from every pod in your environment to give you a better view into request throughput, status, errors, and more. Hubble also integrates with OpenTelemetry, enabling you to export log and trace data from Cilium-managed networks to a third-party monitoring platform.

Because Cilium can control traffic at layers 3, 4, and 7 of the OSI model, Hubble enables you to monitor multiple levels of network traffic, such as TCP connections, DNS queries, and HTTP requests across clusters or cluster meshes. To accomplish this, Hubble leverages two primary components: servers and the Hubble Relay.

Hubble servers run alongside the Cilium agent on each cluster node. Each server implements an Observer service to monitor pod traffic and a Peer service to keep track of Hubble instances on other nodes. The Hubble Relay is a stand-alone component that collects network flow data from each server instance and makes it available to the Hubble UI and CLI via a set of APIs.

Though the Hubble platform is deployed automatically with Cilium, it is not enabled by default. You can enable it by running the following command on your host:

1
cilium hubble enable

You can also check the status of both Hubble and Cilium by running the cilium status command, which should give you output similar to the following:

You will see an error status in the command’s output if either service failed to launch. This issue can sometimes happen if underlying nodes are running out of memory. Allocating more memory and relaunching Cilium can help resolve the problem.

The Hubble CLI

Hubble’s CLI extends the visibility that is provided by standard kubectl commands like kubectl get pods to give you more network-level details about a request, such as its status and the security identities associated with its source and destination. You can view this information via the hubble observe command and monitor traffic to, from, and between pods in order to determine if your policies are working as expected. For example, you can view all dropped requests between services by using the following command:

1
hubble observe --verdict DROPPED
2

3
May 12 13:35:35.923: default/service-a:58578 (ID:1469) -> default/service-c:80 (ID:851) http-request DROPPED (HTTP/1.1 PUT http://service-c.default.svc.cluster.local/v1/endpoint-1)

The sample output above shows that the destination pod (service-c) dropped requests from the source pod (service-a). You can investigate further by adding the -o json option to the hubble observe command. The JSON output provides more context for an event, including:

the request event’s verdict and relevant error message (e.g., drop_reason_desc)
the direction of the request (e.g., traffic_direction)
the type of policy that manages the pods associated with the request (e.g., "Type")
the IP addresses and ports for the source and destination endpoints

Using our previous example, you can review the command’s JSON output to determine why the service-b pod is dropping requests:

1
{
2
  "time": "2022-05-12T14:16:09.475485361Z",
3
  "verdict": "DROPPED",
4
  "drop_reason": 133,
5
  "ethernet": {...},
6

7

8
  "IP": {
9
    "source": "10.0.0.87",
10
    "destination": "10.0.0.154",
11
    "ipVersion": "IPv4"
12
  },
13
  "l4": {...},
14

15

16

17
  "source": {
18
    "ID": 3173,
19
    "identity": 12878,
20
    "namespace": "default",
21
    "labels": [
22
      "k8s:app.kubernetes.io/name=service-b",
23
      "k8s:class=service-b",
24
      "k8s:io.cilium.k8s.policy.cluster=minikube",
25
      "k8s:io.cilium.k8s.policy.serviceaccount=default",
26
      "k8s:io.kubernetes.pod.namespace=default",
27
      "k8s:org=gobs-1"
28
    ],
29
    "pod_name": "service-b"
30
  },
31
  "destination": {
32
    "ID": 939,
33
    "identity": 4418,
34
    "namespace": "default",
35
    "labels": [
36
      "k8s:app.kubernetes.io/name=service-c",
37
      "k8s:class=service-c",
38
      "k8s:io.cilium.k8s.policy.cluster=minikube",
39
      "k8s:io.cilium.k8s.policy.serviceaccount=default",
40
      "k8s:io.kubernetes.pod.namespace=default",
41
      "k8s:org=gobs-2"
42
    ],
43
    "pod_name": "service-c",
44
    "workloads": [...]
45

46
  },
47
  "Type": "L3_L4",
48
  "node_name": "minikube/minikube",
49
  "event_type": {
50
    "type": 1,
51
    "sub_type": 133
52
  },
53
  "traffic_direction": "INGRESS",
54
  "drop_reason_desc": "POLICY_DENIED",
55
  "Summary": "TCP Flags: SYN"
56
}

In the sample snippet above, you can see that requests were dropped ("drop_reason_desc": "POLICY_DENIED") due to an L3/L4 policy ("Type": "L3_L4"), which indicates that Cilium was managing traffic appropriately in this case. You can modify your policy if you need to enable communication between these two pods.

The Hubble UI

While the CLI provides insight into networking issues for individual pods, you still need visibility into how these problems affect the entire cluster. The Hubble UI offers a high-level service map for monitoring network activity and policy behavior, enabling you to get a better understanding of how each of your pods interact with one another. Service maps can automatically capture interdependencies between Kubernetes services, making them especially useful for monitoring large-scale environments. This level of visibility enables you to confirm that your network is routing traffic to the appropriate endpoints.

To get started, you can enable and access the Hubble UI by running the following commands:

1
cilium hubble enable --ui
2
cilium hubble ui

The Cilium CLI will automatically navigate to your Hubble UI instance at http://localhost:12000/, where you can select a Kubernetes namespace An abstraction for isolating groups of resources within a single Kubernetes cluster to view the service map for a particular set of pods. In the example service map below, the service-b pod is attempting to communicate with the service-c pod, but its requests are failing.

In the request list below the service map, you can see that Cilium is dropping requests between the service-b and service-c pods. You can troubleshoot further by selecting an individual request to view more details and determine if the drop is the result of a network policy or another issue. Hubble’s UI leverages the same data points as its CLI, so you have complete context for mitigating the problem.

Monitor network traffic with Hubble

In this post, we looked at how Hubble enables you to monitor network traffic across your Cilium-managed infrastructure. Check out Cilium’s documentation to learn more about leveraging the Hubble platform for monitoring the health and performance of your Kubernetes network. In the next part of this series, we’ll show you how Datadog provides complete visibility into Cilium metrics, logs, and network data.

Monitor Cilium and Kubernetes performance with Hubble

Hubble’s underlying infrastructure

The Hubble CLI

The Hubble UI

Monitor network traffic with Hubble

Related Articles

Key metrics for monitoring Cilium

Monitor Cilium-managed infrastructure with Datadog

How to support a growing Kubernetes cluster with a small etcd

Integration roundup: Monitoring your container-native technologies

Start monitoring your metrics in minutes

Get Started with Datadog

Hubble’s underlying infrastructure

The Hubble CLI

The Hubble UI

Monitor network traffic with Hubble

Related Articles

Key metrics for monitoring Cilium

Monitor Cilium-managed infrastructure with Datadog

How to support a growing Kubernetes cluster with a small etcd

Integration roundup: Monitoring your container-native technologies

Related jobs at Datadog

We're always looking for talented people to collaborate with

Start monitoring your metrics in minutes