Skip to content

Latest commit

 

History

History
55 lines (43 loc) · 3.32 KB

README.md

File metadata and controls

55 lines (43 loc) · 3.32 KB

data-prepper

What is Data Prepper

Data Prepper is an open source utility service. Data Prepper is a server side data collector with abilities to filter, enrich, transform, normalize and aggregate data for downstream analytics and visualization. The broader vision for Data Prepper is to enable an end-to-end data analysis life cycle from gathering raw logs to facilitating sophisticated and actionable interactive ad-hoc analyses on the data.

What is Data Prepper Integration

Data Prepper integration is concerned with the following aspects

  • Allow simple and automatic generation of all schematic structured

    • traces ( including specific fields mapping to map to SS4O schema)
    • services ( adding support for specific service mapping category)
    • metrics (using the standard SS4O schema)
  • Add Dashboard Assets for correlation between traces-services-metrics

  • Add correlation queries to investigate traces based metrics

Data - Prepper Trace Fields

Data Prepper uses the following Traces mapping file The next fields are used:


- traceId - A unique identifier for a trace. All spans from the same trace share the same traceId.
- spanId - A unique identifier for a span within a trace, assigned when the span is created.
- traceState - Conveys information about request position in multiple distributed tracing graphs.
- parentSpanId - The spanId of this span's parent span. If this is a root span, then this field must be empty.
- name - A description of the span's operation.
- kind - The type of span. See OpenTelemetry - SpanKind.
- startTime - The start time of the span.
- endTime - The end time of the span.
- durationInNanos - Difference in nanoseconds between startTime and endTime.
- serviceName - Currently derived from the opentelemetry.proto.resource.v1.Resource associated with the span, the resource from the span originates.
- events - A list of events. See OpenTelemetry - Events.
- links - A list of linked spans. See OpenTelemetry - Links.
- droppedAttributesCount - The number of attributes that were discarded.
- droppedEventsCount - The number of events that were discarded.
- droppedLinksCount - The number of links that were dropped.
- span.attributes.* - All span attributes are split into a list of keywords.
- resource.attributes.* - All resource attributes are split into a list of keywords.
- status.code - The status of the span. See OpenTelemetry - Status.

There are some additional trace.group related fields which are not part of the OTEL spec for traces

- traceGroup - A derived field, the name of the trace's root span.
- traceGroupFields.endTime - A derived field, the endTime of the trace's root span.
- traceGroupFields.statusCode - A derived field, the status.code of the trace's root span.
- traceGroupFields.durationInNanos - A derived field, the durationInNanos of the trace's root span.

Metrics from Traces Processors

Adding new processors for creating metrics for logs and traces that pass through Data Prepper