Skip to content

Latest commit

 

History

History
296 lines (221 loc) · 14.1 KB

README.md

File metadata and controls

296 lines (221 loc) · 14.1 KB

OpenTelemetry Target Allocator Example

This is the companion repo for the KubeCon EU 2024 Talk, "Prometheus and OpenTelemetry: Better Together".

See the slides here.

Watch the talk here.

In this example, we will:

  1. Install KinD (KUbernetes in Docker)
  2. Install the ServiceMonitor and PodMonitor Prometheus CRs
  3. Install the OTel Operator
  4. Deploy 3 Python services:

The Target Allocator configuration and sample Python Prometheus app are based on this tutorial.

Installation

This project can be run as a Development (Dev) Container in:

Or, if Dev Containers aren't your jam, then skip Step 0 and go straight to Step 1.

0- Build and run the Dev Container locally (optional)

This step requires the Dev Container CLI. You can also build and run your Dev Container using the Dev Container command palette in VSCode.

git clone https://github.com/avillela/otel-target-allocator-talk.git
cd otel-target-allocator-talk

devcontainer build --no-cache
devcontainer open

1- Install KinD and set up k8s cluster

The scripts below will install KinD, and then will install the following in the newly-created cluster:

  • the PodMonitor and ServiceMonitor Prometheus CRs
  • cert-manager (an OTel Operator pre-requisite)
  • the OTel Operator.
./src/scripts/01-install-kind.sh
./src/scripts/02-k8s-setup.sh

2- Build and deploy image

Build Docker image and deploy to KinD

./src/scripts/03-build-and-deploy-images.sh

Verify that images are in KinD:

docker exec -it otel-target-allocator-talk-control-plane crictl images | grep target-allocator

Reference here.

3a - Kubernetes Deployment (Collector stdout only)

🚨 This step deploys resources to send telemetry to the OTel Collector's sdout only. If you want to send telemetry to Dynatrace or ServiceNow Cloud Observability (formerly known as Lightstep), you'll need to skip this step and follow Step 3b or Step 3c, respectively, instead.

Now you are ready to deploy the Kubernetes resources

./src/scripts/04-deploy-resources.sh

3b- Kubernetes deployment with Dynatrace backend

🚨 If you want to send telemetry to Dynatrace), you'll need to follow the steps below, and skip Step 3a and Step 3c.

To send telemetry to Dynatrace, you will first need a Dynatrace free trial account. You will also need to obtain an access token.

We're going to store the access token in a Kubernetes secret, and will map the secret to an environment variabe in the OpenTelemetryCollector CR.

First, create a secrets file for the Lightstep token.

tee -a src/resources/00-secret-dt.yaml <<EOF
 apiVersion: v1
 kind: Secret
 metadata:
   name: otel-collector-secret
   namespace: opentelemetry
 data:
   DT_TOKEN: <base64-encoded-dynatrace-token>
   DT_ENV_ID: <base64-encoded-dynatrace-environment-identifier>
 type: "Opaque"
EOF

Replace <base64-encoded-LS-token> with your own [access token] (https://docs.dynatrace.com/docs/discover-dynatrace/references/dynatrace-api/basics/dynatrace-api-authentication#create-token). Make sure you grant the correct permissions to your token so that you Dynatrace can ingest traces (opentelemetryTrace.ingest), logs (logs.ingest), and metrics (metrics.ingest). See Dynatrace Token Scopes.

Replace <base64-encoded-dynatrace-environment-identifier> with your own environment ID. Your Dynatrace tenant will have a URL like this: https://<dynatrace-environment-identifier>.apps.dynatrace.com. The part before between https:// and apps.dynatrace.com is what your environment identifier. Make sure you base64 encode that value before adding it to your secrets YAML.

Be sure to Base64 encode it like this:

echo <base64-encoded-dynatrace-token> | base64

Or you can Base64-encode it through this website.

Finally, deploy the Kubernetes resources:

# Don't forget to pass the "dt" argument
./src/scripts/04-deploy-resources.sh dt

For more on OTel Collector configuration with Dynatrace, check out this article.

3c- Kubernetes deployment with ServiceNow Cloud Observability backend

🚨 If you want to send telemetry to ServiceNow Cloud Observability (formerly known as Lightstep), you'll need to follow the steps below, and skip Step 3a and Step 3b.

To send telemetry to ServiceNow Cloud Observability, you will first need a Cloud Observability account. You will also need to obtain an access token.

We're going to store the access token in a Kubernetes secret, and will map the secret to an environment variabe in the OpenTelemetryCollector CR.

First, create a secrets file for the Lightstep token.

tee -a src/resources/00-secret-ls.yaml <<EOF
 apiVersion: v1
 kind: Secret
 metadata:
   name: otel-collector-secret
   namespace: opentelemetry
 data:
   LS_TOKEN: <base64-encoded-LS-token>
 type: "Opaque"
EOF

Replace with your own [access token] (https://docs.lightstep.com/docs/create-and-manage-access-tokens#create-an-access-token)

Be sure to Base64 encode it like this:

echo <LS-access-token> | base64

Or you can Base64-encode it through this website.

Finally, deploy the Kubernetes resources:

# Don't forget to pass the "ls" argument
./src/scripts/04-deploy-resources.sh ls

4- Check logs

This command will tail the Collector logs

kubectl logs -l app.kubernetes.io/component=opentelemetry-collector -n opentelemetry --follow

This command will return unique items from the Collector logs containing "Name:"

kubectl logs otelcol-collector-0 -n opentelemetry | grep "Name:" | sort | uniq

Check Target Allocator logs:

kubectl logs -l app.kubernetes.io/component=opentelemetry-targetallocator -n opentelemetry --follow

Troubleshooting

Based on this article

Expose target allocator port

kubectl port-forward svc/otelcol-targetallocator -n opentelemetry 8080:80

So we can get list of jobs

curl localhost:8080/jobs | jq

Sample output:

{
  "serviceMonitor/opentelemetry/sm-example/2": {
    "_link": "/jobs/serviceMonitor%2Fopentelemetry%2Fsm-example%2F2/targets"
  },
  "serviceMonitor/opentelemetry/sm-example/0": {
    "_link": "/jobs/serviceMonitor%2Fopentelemetry%2Fsm-example%2F0/targets"
  },
  "serviceMonitor/opentelemetry/sm-example/1": {
    "_link": "/jobs/serviceMonitor%2Fopentelemetry%2Fsm-example%2F1/targets"
  },
  "otel-collector": {
    "_link": "/jobs/otel-collector/targets"
  }
}

Peek into one of the jobs:

curl localhost:8080/jobs/serviceMonitor%2Fopentelemetry%2Fmy-app%2F1/targets | jq

Sample output:

{
  "otelcol-collector-0": {
    "_link": "/jobs/serviceMonitor%2Fopentelemetry%2Fmy-app%2F1/targets?collector_id=otelcol-collector-0",
    "targets": [
      {
        "targets": [
          "10.244.0.29:8082"
        ],
        "labels": {
          "__meta_kubernetes_endpointslice_annotation_endpoints_kubernetes_io_last_change_trigger_time": "2024-02-28T18:09:34Z",
          "__meta_kubernetes_endpointslice_labelpresent_app_kubernetes_io_name": "true",
          "__meta_kubernetes_endpointslice_endpoint_conditions_serving": "true",
          "__meta_kubernetes_endpointslice_labelpresent_endpointslice_kubernetes_io_managed_by": "true",
          "__meta_kubernetes_service_labelpresent_app": "true",
          "__meta_kubernetes_pod_label_pod_template_hash": "f89fdbc4f",
          "__meta_kubernetes_endpointslice_endpoint_conditions_terminating": "false",
          "__meta_kubernetes_pod_label_app": "my-app",
          "__meta_kubernetes_pod_ip": "10.244.0.29",
          "__meta_kubernetes_pod_container_image": "otel-target-allocator-talk:0.1.0-py-otel-server",
          "__meta_kubernetes_endpointslice_address_type": "IPv4",
          "__meta_kubernetes_service_annotation_kubectl_kubernetes_io_last_applied_configuration": "{\"apiVersion\":\"v1\",\"kind\":\"Service\",\"metadata\":{\"annotations\":{},\"labels\":{\"app\":\"my-app\",\"app.kubernetes.io/name\":\"py-otel-server\"},\"name\":\"py-otel-server-svc\",\"namespace\":\"opentelemetry\"},\"spec\":{\"ports\":[{\"name\":\"py-server-port\",\"port\":8082,\"protocol\":\"TCP\",\"targetPort\":\"py-server-port\"}],\"selector\":{\"app.kubernetes.io/name\":\"py-otel-server\"}}}\n",
          "__meta_kubernetes_endpointslice_annotationpresent_endpoints_kubernetes_io_last_change_trigger_time": "true",
          "__meta_kubernetes_pod_container_port_number": "8082",
          "__meta_kubernetes_pod_uid": "06cc65c5-61a2-4d0c-87ef-74302f977d48",
          "__meta_kubernetes_endpointslice_labelpresent_kubernetes_io_service_name": "true",
          "__meta_kubernetes_endpointslice_label_endpointslice_kubernetes_io_managed_by": "endpointslice-controller.k8s.io",
          "__meta_kubernetes_service_name": "py-otel-server-svc",
          "__meta_kubernetes_endpointslice_endpoint_conditions_ready": "true",
          "__meta_kubernetes_endpointslice_address_target_kind": "Pod",
          "__meta_kubernetes_pod_annotation_instrumentation_opentelemetry_io_inject_python": "true",
          "__meta_kubernetes_pod_container_port_name": "py-server-port",
          "__meta_kubernetes_endpointslice_address_target_name": "py-otel-server-f89fdbc4f-lbb7z",
          "__meta_kubernetes_endpointslice_labelpresent_app": "true",
          "__meta_kubernetes_pod_host_ip": "172.24.0.2",
          "__meta_kubernetes_pod_container_name": "py-otel-server",
          "__meta_kubernetes_namespace": "opentelemetry",
          "__meta_kubernetes_endpointslice_label_kubernetes_io_service_name": "py-otel-server-svc",
          "__meta_kubernetes_pod_controller_name": "py-otel-server-f89fdbc4f",
          "__meta_kubernetes_pod_labelpresent_app_kubernetes_io_name": "true",
          "__meta_kubernetes_service_label_app_kubernetes_io_name": "py-otel-server",
          "__meta_kubernetes_pod_node_name": "otel-target-allocator-talk-control-plane",
          "__meta_kubernetes_pod_labelpresent_pod_template_hash": "true",
          "__address__": "10.244.0.29:8082",
          "__meta_kubernetes_service_labelpresent_app_kubernetes_io_name": "true",
          "__meta_kubernetes_pod_label_app_kubernetes_io_name": "py-otel-server",
          "__meta_kubernetes_pod_container_port_protocol": "TCP",
          "__meta_kubernetes_pod_labelpresent_app": "true",
          "__meta_kubernetes_pod_annotationpresent_instrumentation_opentelemetry_io_inject_python": "true",
          "__meta_kubernetes_endpointslice_port": "8082",
          "__meta_kubernetes_pod_phase": "Running",
          "__meta_kubernetes_endpointslice_name": "py-otel-server-svc-t2wgv",
          "__meta_kubernetes_endpointslice_label_app_kubernetes_io_name": "py-otel-server",
          "__meta_kubernetes_endpointslice_port_protocol": "TCP",
          "__meta_kubernetes_service_annotationpresent_kubectl_kubernetes_io_last_applied_configuration": "true",
          "__meta_kubernetes_pod_ready": "true",
          "__meta_kubernetes_pod_controller_kind": "ReplicaSet",
          "__meta_kubernetes_pod_name": "py-otel-server-f89fdbc4f-lbb7z",
          "__meta_kubernetes_service_label_app": "my-app",
          "__meta_kubernetes_endpointslice_port_name": "py-server-port",
          "__meta_kubernetes_endpointslice_label_app": "my-app"
        }
      }
    ]
  }
}