Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

new readme #70

Merged
merged 10 commits into from
Dec 18, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
222 changes: 190 additions & 32 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,48 +1,206 @@
## OpenTelemetry Acceptance Tests (OATs)
# OpenTelemetry Acceptance Tests (OATs)

### Goals
OpenTelemetry Acceptance Tests (OATs), or OATs for short, is a test framework for OpenTelemetry.

1. Flexibility to support qualification of changes to the [OpenTelemetry Collector][], and [Tempo][]
1. Ability to support OpenTelemetry SDK functionality such as [sampling][]
1. Highlight the use of [Ginkgo][], and [Gomega][]
1. Have a cute name
- Declarative tests written in YAML
- Supported signals: traces, logs, metrics
- Full round-trip testing: from the application to the observability stack
- Data is stored in the LGTM stack ([Loki], [Grafana], [Tempo], [Prometheus], [OpenTelemetry Collector])
- Data is queried using LogQL, PromQL, and TraceQL
- All data is sent to the observability stack via OTLP - so OATs can also be used with other observability stacks
- End-to-end testing
- Docker Compose with the [docker-otel-lgtm] image
- Kubernetes with the [docker-otel-lgtm] and [k3d]

[Tempo]: https://github.com/grafana/tempo
[OpenTelemetry Collector]: https://github.com/open-telemetry/opentelemetry-collector
[Prometheus]: https://github.com/prometheus/prometheus
[dockertest]: https://github.com/ory/dockertest
[sampling]: https://opentelemetry.io/docs/instrumentation/go/sampling/
[Ginkgo]: https://onsi.github.io/ginkgo/
[Gomega]: https://onsi.github.io/gomega/
Under the hood, OATs uses [Ginkgo] and [Gomega] to run the tests.

## Getting Started

> You can use the test cases in [prom_client_java](https://github.com/prometheus/client_java/tree/main/examples/example-exporter-opentelemetry/oats-tests) as a reference.
> The [GitHub action](https://github.com/prometheus/client_java/blob/main/.github/workflows/acceptance-tests.yml)
> uses a [script](https://github.com/prometheus/client_java/blob/main/scripts/run-acceptance-tests.sh) to run the tests.

### Getting Started
1. Create a folder `oats-tests` for the following files
2. Create `Dockerfile` to build the application you want to test
```Dockerfile
FROM eclipse-temurin:21-jre
COPY target/example-exporter-opentelemetry.jar ./app.jar
ENTRYPOINT [ "java", "-jar", "./app.jar" ]
```
3. Create `docker-compose.yaml` to start the application and any dependencies
```yaml
version: '3.4'

services:
java:
build:
dockerfile: Dockerfile
environment:
OTEL_SERVICE_NAME: "rolldice"
OTEL_EXPORTER_OTLP_ENDPOINT: http://lgtm:4318
OTEL_EXPORTER_OTLP_PROTOCOL: http/protobuf
OTEL_METRIC_EXPORT_INTERVAL: "5000" # so we don't have to wait 60s for metrics
```
4. Create `oats.yaml` with the test cases
```yaml
# OATs is an acceptance testing framework for OpenTelemetry - https://github.com/grafana/oats
docker-compose:
files:
- ./docker-compose.yaml
expected:
metrics:
- promql: 'uptime_seconds_total{}'
value: '>= 0'
```
5. `cd /path/to/oats/yaml`
6. `go install github.com/onsi/ginkgo/v2/ginkgo`
7. `TESTCASE_BASE_PATH=/path/to/oats-tests ginkgo -v`

1. Install [Go][]
1. Install [Docker][] ([Podman][] also works provided it is listening on the expected Docker Unix socket)
1. Clone the repository
1. Ensure that `${GOBIN}` is on your `${PATH}`
1. From within the repository directory, install [Ginkgo][]
## Test Case Syntax

> You can use any file name that matches `oats*.yaml` (e.g. `oats-test.yaml`), that doesn't end in `-template.yaml`.
> `oats-template.yaml` is reserved for template files, which are used in the `include` section.

The syntax is a bit similar to https://github.com/kubeshop/tracetest

This is an example:

```yaml
include:
- ../oats-template.yaml
docker-compose:
file: ../docker-compose.yaml
input:
- url: http://localhost:8080/stock
interval: 500ms # interval between requests to the input URL
expected:
traces:
- traceql: '{ name =~ "SELECT .*product"}'
spans:
- name: 'regex:SELECT .*'
attributes:
db.system: h2
logs:
- logql: '{exporter = "OTLP"}'
contains:
- 'hello LGTM'
metrics:
- promql: 'db_client_connections_max{pool_name="HikariPool-1"}'
value: "== 10"
dashboards: # Grafana dashboards
- path: ../jdbc-dashboard.json
panels:
- title: Connection pool waiting requests
value: "== 0"
- title: Connection pool utilization
value: "> 0"
```
go install github.com/onsi/ginkgo/v2/ginkgo

### Query traces

Each entry in the `traces` array is a test case for traces.

```yaml
expected:
traces:
- traceql: '{ name =~ "SELECT .*product"}'
spans:
- name: 'regex:SELECT .*' # regex match
attributes:
db.system: h2
allow-duplicates: true # allow multiple spans with the same attributes
```

1. Run the specs
### Query logs

Each entry in the `logs` array is a test case for logs.

```yaml
expected:
logs:
- logql: '{service_name="rolldice"} |~ `Anonymous player is rolling the dice.*`'
equals: 'Anonymous player is rolling the dice'
attributes:
service_name: rolldice
attribute-regexp:
container_id: ".*"
no-extra-attributes: true # fail if there are extra attributes
- logql: '{service_name="rolldice"} |~ `Anonymous player is rolling the dice.*`'
regexp: 'Anonymous player is .*'
```
ginkgo -r (or ginkgo ./...)

### Query metrics

```yaml
expected:
metrics:
- promql: 'db_client_connections_max{pool_name="HikariPool-1"}'
value: "== 10"
dashboards: # Useful if you populate Grafana dashboards from JSON
- path: ../jdbc-dashboard.json
panels:
- title: Connection pool waiting requests
value: "== 0"
- title: Connection pool utilization
value: "> 0"
```

1. Browse the [example][]
## Docker Compose

Describes the docker-compose file(s) to use for the test.
The files typically define the instrumented application you want to test and optionally some dependencies,
e.g. a database server to send requests to.
You don't need (and shouldn't have) to define the observability stack (e.g. Prometheus, Grafana, etc.),
because this is provided by the test framework (and may test different versions of the observability stack,
e.g. OTel Collector and Grafana Alloy).

This docker-compose file is relative to the `oats.yaml` file.

## Kubernetes

[Go]: https://go.dev/
[Docker]: https://www.docker.com/
[Podman]: https://podman.io/
[example]: examples/dockertest/send_simple_trace_test.go
A local Kubernetes cluster can be used to test the application in a Kubernetes environment rather than in docker-compose.
This is useful to test the application in a more realistic environment - and when you want to test Kubernetes specific features.

### Writing Specs
Describes the Kubernetes manifest(s) to use for the test.

```yaml
kubernetes:
dir: k8s
app-service: dice
app-docker-file: Dockerfile
app-docker-context: ..
app-docker-tag: dice:1.1-SNAPSHOT
app-docker-port: 8080
```

## Debugging

If you want to run a single test case, you can use the `--focus` option:

```sh
TESTCASE_BASE_PATH=/path/to/project ginkgo -v --focus="jdbc"
```

You can increase the timeout, which is useful if you want to inspect the telemetry data manually
in Grafana at http://localhost:3000

```sh
TESTCASE_TIMEOUT=1h TESTCASE_BASE_PATH=/path/to/project ginkgo -v
```

You can keep the container running without executing the tests - which is useful to debug in Grafana manually:

```sh
TESTCASE_MANUAL_DEBUG=true TESTCASE_BASE_PATH=/path/to/project ginkgo -v
```

[Ginkgo]: https://onsi.github.io/ginkgo/
[Gomega]: https://onsi.github.io/gomega/
[Tempo]: https://github.com/grafana/tempo
[OpenTelemetry Collector]: https://opentelemetry.io/docs/collector/
[Prometheus]: https://prometheus.io/
[Grafana]: https://grafana.com/
[Loki]: https://github.com/grafana/loki
[docker-otel-lgtm]: https://github.com/grafana/docker-otel-lgtm/
[k3d]: https://k3d.io/

1. Decide whether to use the `testhelpers/observability` package, individual packages such as
`testhelpers/tempo`, or only support externally provisioned endpoints
1. Write the specs using [Ginkgo][], and [Gomega][]
1. Profit
Loading