Tracing SDK
Status: Stable
Tracer Provider
Tracer Creation
It SHOULD only be possible to create Tracer
instances through a TracerProvider
(see API).
The TracerProvider
MUST implement the Get a Tracer API.
The input provided by the user MUST be used to create
an InstrumentationScope
instance which
is stored on the created Tracer
.
Configuration
Configuration (i.e., SpanProcessors, IdGenerator,
SpanLimits and Sampler
) MUST be owned by the
the TracerProvider
. The configuration MAY be applied at the time of TracerProvider
creation if appropriate.
The TracerProvider MAY provide methods to update the configuration. If
configuration is updated (e.g., adding a SpanProcessor
),
the updated configuration MUST also apply to all already returned Tracers
(i.e. it MUST NOT matter whether a Tracer
was obtained from the
TracerProvider
before or after the configuration change).
Note: Implementation-wise, this could mean that Tracer
instances have a
reference to their TracerProvider
and access configuration only via this
reference.
Shutdown
This method provides a way for provider to do any cleanup required.
Shutdown
MUST be called only once for each TracerProvider
instance. After
the call to Shutdown
, subsequent attempts to get a Tracer
are not allowed. SDKs
SHOULD return a valid no-op Tracer for these calls, if possible.
Shutdown
SHOULD provide a way to let the caller know whether it succeeded,
failed or timed out.
Shutdown
SHOULD complete or abort within some timeout. Shutdown
can be
implemented as a blocking API or an asynchronous API which notifies the caller
via a callback or an event. OpenTelemetry client authors can decide if they want to
make the shutdown timeout configurable.
Shutdown
MUST be implemented at least by invoking Shutdown
within all internal processors.
ForceFlush
This method provides a way for provider to immediately export all spans that have not yet been exported for all the internal processors.
ForceFlush
SHOULD provide a way to let the caller know whether it succeeded,
failed or timed out.
ForceFlush
SHOULD complete or abort within some timeout. ForceFlush
can be
implemented as a blocking API or an asynchronous API which notifies the caller
via a callback or an event. OpenTelemetry client authors can decide if they want to
make the flush timeout configurable.
ForceFlush
MUST invoke ForceFlush
on all registered SpanProcessors
.
Additional Span Interfaces
The API-level definition for Span’s interface
only defines write-only access to the span.
This is good because instrumentations and applications are not meant to use the data
stored in a span for application logic.
However, the SDK needs to eventually read back the data in some locations.
Thus, the SDK specification defines sets of possible requirements for
Span
-like parameters:
Readable span: A function receiving this as argument MUST be able to access all information that was added to the span, as listed in the API spec for Span. Note: Below, a few particular properties are called out for clarity, but for the complete list of required properties, the Span API spec is authoritative.
A function receiving this as argument MUST be able to access the
InstrumentationScope
[since 1.10.0] andResource
information (implicitly) associated with the span. For backwards compatibility it MUST also be able to access theInstrumentationLibrary
[deprecated since 1.10.0] having the same name and version values as theInstrumentationScope
.A function receiving this as argument MUST must be able to reliably determine whether the Span has ended (some languages might implement this by having an end timestamp of
null
, others might have an explicithasEnded
boolean).Counts for attributes, events and links dropped due to collection limits MUST be available for exporters to report as described in the exporters specification.
As an exception to the authoritative set of span properties defined in the API spec, implementations MAY choose not to expose (and store) the full parent Context of the Span but they MUST expose at least the full parent SpanContext.
A function receiving this as argument might not be able to modify the Span.
Note: Typically this will be implemented with a new interface or (immutable) value type. In some languages SpanProcessors may have a different readable span type than exporters (e.g. a
SpanData
type might contain an immutable snapshot and aReadableSpan
interface might read information directly from the same underlying data structure that theSpan
interface manipulates).Read/write span: A function receiving this as argument must have access to both the full span API as defined in the API-level definition for span’s interface and additionally must be able to retrieve all information that was added to the span (as with readable span).
It MUST be possible for functions being called with this to somehow obtain the same
Span
instance and type that the span creation API returned (or will return) to the user (for example, theSpan
could be one of the parameters passed to such a function, or a getter could be provided).
Sampling
Sampling is a mechanism to control the noise and overhead introduced by OpenTelemetry by reducing the number of samples of traces collected and sent to the backend.
Sampling may be implemented on different stages of a trace collection. The earliest sampling could happen before the trace is actually created, and the latest sampling could happen on the Collector which is out of process.
The OpenTelemetry API has two properties responsible for the data collection:
IsRecording
field of aSpan
. Iffalse
the currentSpan
discards all tracing data (attributes, events, status, etc.). Users can use this property to determine if collecting expensive trace data can be avoided. Span Processor MUST receive only those spans which have this field set totrue
. However, Span Exporter SHOULD NOT receive them unless theSampled
flag was also set.Sampled
flag inTraceFlags
onSpanContext
. This flag is propagated via theSpanContext
to child Spans. For more details see the W3C Trace Context specification. This flag indicates that theSpan
has beensampled
and will be exported. Span Exporters MUST receive those spans which haveSampled
flag set to true and they SHOULD NOT receive the ones that do not.
The flag combination SampledFlag == false
and IsRecording == true
means that the current Span
does record information, but most likely the child
Span
will not.
The flag combination SampledFlag == true
and IsRecording == false
could cause gaps in the distributed trace, and because of this the OpenTelemetry SDK
MUST NOT allow this combination.
Recording Sampled reaction table
The following table summarizes the expected behavior for each combination of
IsRecording
and SampledFlag
.
IsRecording | Sampled Flag | Span Processor receives Span? | Span Exporter receives Span? |
---|---|---|---|
true | true | true | true |
true | false | true | false |
false | true | Not allowed | Not allowed |
false | false | false | false |
The SDK defines the interface Sampler
as well as a set of
built-in samplers and associates a Sampler
with each [TracerProvider
].
SDK Span creation
When asked to create a Span, the SDK MUST act as if doing the following in order:
- If there is a valid parent trace ID, use it. Otherwise generate a new trace ID
(note: this must be done before calling
ShouldSample
, because it expects a valid trace ID as input). - Query the
Sampler
’sShouldSample
method (Note that the built-inParentBasedSampler
can be used to use the sampling decision of the parent, translating a set SampledFlag to RECORD and an unset one to DROP). - Generate a new span ID for the
Span
, independently of the sampling decision. This is done so other components (such as logs or exception handling) can rely on a unique span ID, even if theSpan
is a non-recording instance. - Create a span depending on the decision returned by
ShouldSample
: see description ofShouldSample
’s return value below for how to setIsRecording
andSampled
on the Span, and the table above on whether to pass theSpan
toSpanProcessor
s. A non-recording span MAY be implemented using the same mechanism as when aSpan
is created without an SDK installed or as described in wrapping a SpanContext in a Span.
Sampler
Sampler
interface allows users to create custom samplers which will return a
sampling SamplingResult
based on information that is typically available just
before the Span
was created.
ShouldSample
Returns the sampling Decision for a Span
to be created.
Required arguments:
Context
with parentSpan
. The Span’s SpanContext may be invalid to indicate a root span.TraceId
of theSpan
to be created. If the parentSpanContext
contains a validTraceId
, they MUST always match.- Name of the
Span
to be created. SpanKind
of theSpan
to be created.- Initial set of
Attributes
of theSpan
to be created. - Collection of links that will be associated with the
Span
to be created. Typically useful for batch operations, see Links Between Spans.
Note: Implementations may “bundle” all or several arguments together in a single object.
Return value:
It produces an output called SamplingResult
which contains:
- A sampling
Decision
. One of the following enum values:DROP
-IsRecording
will befalse
, theSpan
will not be recorded and all events and attributes will be dropped.RECORD_ONLY
-IsRecording
will betrue
, but theSampled
flag MUST NOT be set.RECORD_AND_SAMPLE
-IsRecording
will betrue
and theSampled
flag MUST be set.
- A set of span Attributes that will also be added to the
Span
. The returned object must be immutable (multiple calls may return different immutable objects). - A
Tracestate
that will be associated with theSpan
through the newSpanContext
. If the sampler returns an emptyTracestate
here, theTracestate
will be cleared, so samplers SHOULD normally return the passed-inTracestate
if they do not intend to change it.
GetDescription
Returns the sampler name or short description with the configuration. This may
be displayed on debug pages or in the logs. Example:
"TraceIdRatioBased{0.000100}"
.
Description MUST NOT change over time and caller can cache the returned value.
Built-in samplers
OpenTelemetry supports a number of built-in samplers to choose from.
The default sampler is ParentBased(root=AlwaysOn)
.
AlwaysOn
- Returns
RECORD_AND_SAMPLE
always. - Description MUST be
AlwaysOnSampler
.
AlwaysOff
- Returns
DROP
always. - Description MUST be
AlwaysOffSampler
.
TraceIdRatioBased
- The
TraceIdRatioBased
MUST ignore the parentSampledFlag
. To respect the parentSampledFlag
, theTraceIdRatioBased
should be used as a delegate of theParentBased
sampler specified below. - Description MUST return a string of the form
"TraceIdRatioBased{RATIO}"
withRATIO
replaced with the Sampler instance’s trace sampling ratio represented as a decimal number. The precision of the number SHOULD follow implementation language standards and SHOULD be high enough to identify when Samplers have different ratios. For example, if a TraceIdRatioBased Sampler had a sampling ratio of 1 to every 10,000 spans it COULD return"TraceIdRatioBased{0.000100}"
as its description.
TODO: Add details about how the TraceIdRatioBased
is implemented as a function
of the TraceID
. #1413
Requirements for TraceIdRatioBased
sampler algorithm
- The sampling algorithm MUST be deterministic. A trace identified by a given
TraceId
is sampled or not independent of language, time, etc. To achieve this, implementations MUST use a deterministic hash of theTraceId
when computing the sampling decision. By ensuring this, running the sampler on any childSpan
will produce the same decision. - A
TraceIdRatioBased
sampler with a given sampling rate MUST also sample all traces that anyTraceIdRatioBased
sampler with a lower sampling rate would sample. This is important when a backend system may want to run with a higher sampling rate than the frontend system, this way all frontend traces will still be sampled and extra traces will be sampled on the backend only. - WARNING: Since the exact algorithm is not specified yet (see TODO above),
there will probably be changes to it in any language SDK once it is, which
would break code that relies on the algorithm results.
Only the configuration and creation APIs can be considered stable.
It is recommended to use this sampler algorithm only for root spans
(in combination with
ParentBased
) because different language SDKs or even different versions of the same language SDKs may produce inconsistent results for the same input.
ParentBased
- This is a sampler decorator.
ParentBased
helps distinguish between the following cases:- No parent (root span).
- Remote parent (
SpanContext.IsRemote() == true
) withSampledFlag
set - Remote parent (
SpanContext.IsRemote() == true
) withSampledFlag
not set - Local parent (
SpanContext.IsRemote() == false
) withSampledFlag
set - Local parent (
SpanContext.IsRemote() == false
) withSampledFlag
not set
Required parameters:
root(Sampler)
- Sampler called for spans with no parent (root spans)
Optional parameters:
remoteParentSampled(Sampler)
(default: AlwaysOn)remoteParentNotSampled(Sampler)
(default: AlwaysOff)localParentSampled(Sampler)
(default: AlwaysOn)localParentNotSampled(Sampler)
(default: AlwaysOff)
Parent | parent.isRemote() | parent.IsSampled() | Invoke sampler |
---|---|---|---|
absent | n/a | n/a | root() |
present | true | true | remoteParentSampled() |
present | true | false | remoteParentNotSampled() |
present | false | true | localParentSampled() |
present | false | false | localParentNotSampled() |
JaegerRemoteSampler
Jaeger remote sampler allows remotely controlling the sampling configuration for the SDKs. The sampling configuration is periodically loaded from the backend (see Remote Sampling API), where it can be managed by operators via configuration files or even automatically calculated (see Adaptive Sampling). The sampling configuration retrieved by the remote sampler can instruct it to use either a single sampling method for the whole service (e.g., TraceIdRatioBased
), or different methods for different endpoints (span names), for example, sample /product
endpoint at 10%, /admin
endpoint at 100%, and never sample /metrics
endpoint.
The full Protobuf definition can be found at jaegertracing/jaeger-idl/api_v2/sampling.proto.
The following configuration properties should be available when creating the sampler:
- endpoint - address of a service that implements the Remote Sampling API, such as Jaeger Collector or OpenTelemetry Collector.
- polling interval - polling interval for getting configuration from remote
- initial sampler - initial sampler that is used before the first configuration is fetched
Span Limits
Span attributes MUST adhere to the common rules of attribute limits.
SDK Spans MAY also discard links and events that would increase the number of elements of each collection beyond the configured limit.
If the SDK implements the limits above it MUST provide a way to change these limits, via a configuration to the TracerProvider, by allowing users to configure individual limits like in the Java example bellow.
The name of the configuration options SHOULD be EventCountLimit
and LinkCountLimit
. The options MAY be bundled in a class,
which then SHOULD be called SpanLimits
. Implementations MAY provide additional
configuration such as AttributePerEventCountLimit
and AttributePerLinkCountLimit
.
public final class SpanLimits {
SpanLimits(int attributeCountLimit, int linkCountLimit, int eventCountLimit);
public int getAttributeCountLimit();
public int getAttributeCountPerEventLimit();
public int getAttributeCountPerLinkLimit();
public int getEventCountLimit();
public int getLinkCountLimit();
}
Configurable parameters:
- all common options applicable to attributes
EventCountLimit
(Default=128) - Maximum allowed span event count;LinkCountLimit
(Default=128) - Maximum allowed span link count;AttributePerEventCountLimit
(Default=128) - Maximum allowed attribute per span event count;AttributePerLinkCountLimit
(Default=128) - Maximum allowed attribute per span link count;
There SHOULD be a message printed in the SDK’s log to indicate to the user that an attribute was discarded due to such a limit. To prevent excessive logging, the message MUST be printed at most once per span (i.e., not per discarded attribute, event, or link).
Id Generators
The SDK MUST by default randomly generate both the TraceId
and the SpanId
.
The SDK MUST provide a mechanism for customizing the way IDs are generated for
both the TraceId
and the SpanId
.
The SDK MAY provide this functionality by allowing custom implementations of
an interface like the java example below (name of the interface MAY be
IdGenerator
, name of the methods MUST be consistent with
SpanContext), which provides
extension points for two methods, one to generate a SpanId
and one for TraceId
.
public interface IdGenerator {
byte[] generateSpanIdBytes();
byte[] generateTraceIdBytes();
}
Additional IdGenerator
implementing vendor-specific protocols such as AWS
X-Ray trace id generator MUST NOT be maintained or distributed as part of the
Core OpenTelemetry repositories.
Span processor
Span processor is an interface which allows hooks for span start and end method
invocations. The span processors are invoked only when
IsRecording
is true.
Built-in span processors are responsible for batching and conversion of spans to exportable representation and passing batches to exporters.
Span processors can be registered directly on SDK TracerProvider
and they are
invoked in the same order as they were registered.
Each processor registered on TracerProvider
is a start of pipeline that consist
of span processor and optional exporter. SDK MUST allow to end each pipeline with
individual exporter.
SDK MUST allow users to implement and configure custom processors.
The following diagram shows SpanProcessor
’s relationship to other components
in the SDK:
+-----+--------------+ +-------------------------+ +----------------+
| | | | | | |
| | | | Batching Span Processor | | SpanExporter |
| | +---> Simple Span Processor +---> (OTLPExporter) |
| | | | | | |
| SDK | Span.start() | +-------------------------+ +----------------+
| | Span.end() |
| | |
| | |
| | |
| | |
+-----+--------------+
Interface definition
OnStart
OnStart
is called when a span is started. This method is called synchronously
on the thread that started the span, therefore it should not block or throw
exceptions.
Parameters:
span
- a read/write span object for the started span. It SHOULD be possible to keep a reference to this span object and updates to the span SHOULD be reflected in it. For example, this is useful for creating a SpanProcessor that periodically evaluates/prints information about all active span from a background thread.parentContext
- the parentContext
of the span that the SDK determined (the explicitly passedContext
, the currentContext
or an emptyContext
if that was explicitly requested).
Returns: Void
OnEnd(Span)
OnEnd
is called after a span is ended (i.e., the end timestamp is already set).
This method MUST be called synchronously within the Span.End()
API,
therefore it should not block or throw an exception.
Parameters:
Span
- a readable span object for the ended span. Note: Even if the passed Span may be technically writable, since it’s already ended at this point, modifying it is not allowed.
Returns: Void
Shutdown()
Shuts down the processor. Called when SDK is shut down. This is an opportunity for processor to do any cleanup required.
Shutdown
SHOULD be called only once for each SpanProcessor
instance. After
the call to Shutdown
, subsequent calls to OnStart
, OnEnd
, or ForceFlush
are not allowed. SDKs SHOULD ignore these calls gracefully, if possible.
Shutdown
SHOULD provide a way to let the caller know whether it succeeded,
failed or timed out.
Shutdown
MUST include the effects of ForceFlush
.
Shutdown
SHOULD complete or abort within some timeout. Shutdown
can be
implemented as a blocking API or an asynchronous API which notifies the caller
via a callback or an event. OpenTelemetry client authors can decide if they want to
make the shutdown timeout configurable.
ForceFlush()
This is a hint to ensure that any tasks associated with Spans
for which the
SpanProcessor
had already received events prior to the call to ForceFlush
SHOULD
be completed as soon as possible, preferably before returning from this method.
In particular, if any SpanProcessor
has any associated exporter, it SHOULD
try to call the exporter’s Export
with all spans for which this was not
already done and then invoke ForceFlush
on it.
The built-in SpanProcessors MUST do so.
If a timeout is specified (see below), the SpanProcessor MUST prioritize honoring the timeout over
finishing all calls. It MAY skip or abort some or all Export or ForceFlush
calls it has made to achieve this goal.
ForceFlush
SHOULD provide a way to let the caller know whether it succeeded,
failed or timed out.
ForceFlush
SHOULD only be called in cases where it is absolutely necessary,
such as when using some FaaS providers that may suspend the process after an
invocation, but before the SpanProcessor
exports the completed spans.
ForceFlush
SHOULD complete or abort within some timeout. ForceFlush
can be
implemented as a blocking API or an asynchronous API which notifies the caller
via a callback or an event. OpenTelemetry client authors can decide if they want to
make the flush timeout configurable.
Built-in span processors
The standard OpenTelemetry SDK MUST implement both simple and batch processors, as described below. Other common processing scenarios should be first considered for implementation out-of-process in OpenTelemetry Collector.
Simple processor
This is an implementation of SpanProcessor
which passes finished spans
and passes the export-friendly span data representation to the configured
SpanExporter
, as soon as they are finished.
Configurable parameters:
exporter
- the exporter where the spans are pushed.
Batching processor
This is an implementation of the SpanProcessor
which create batches of finished
spans and passes the export-friendly span data representations to the
configured SpanExporter
.
The processor SHOULD export a batch when any of the following happens AND
SpanProcessor#Export()
has not yet returned (for additional concurrency
details see the Export() specification):
scheduledDelayMillis
after the processor is constructed OR the first span is received by the span processor.scheduledDelayMillis
after the previous export timer ends, OR the previous export completes, OR the first span is added to the queue after the previous export timer ends or previous batch completes.- The queue contains
maxExportBatchSize
or more spans. ForceFlush
is called.
If any of the above events occurs before Export()
returns, the span processor
should wait until Export()
returns before exporting the next batch.
If the queue is empty when an export is triggered, the processor MAY export
an empty batch OR skip the export and consider it to be completed immediately.
Configurable parameters:
exporter
- the exporter where the spans are pushed.maxQueueSize
- the maximum queue size. After the size is reached spans are dropped. The default value is2048
.scheduledDelayMillis
- the maximum delay interval in milliseconds between two consecutive exports. The default value is5000
.exportTimeoutMillis
- how long the export can run before it is cancelled. The default value is30000
.maxExportBatchSize
- the maximum batch size of every export. It must be smaller or equal tomaxQueueSize
. If the queue reachesmaxExportBatchSize
a batch will be exported even ifscheduledDelayMillis
milliseconds have not elapsed. The default value is512
.
Span Exporter
Span Exporter
defines the interface that protocol-specific exporters must
implement so that they can be plugged into OpenTelemetry SDK and support sending
of telemetry data.
The goal of the interface is to minimize burden of implementation for protocol-dependent telemetry exporters. The protocol exporter is expected to be primarily a simple telemetry data encoder and transmitter.
Interface Definition
The exporter MUST support three functions: Export, Shutdown, and ForceFlush.
In strongly typed languages typically there will be one separate Exporter
interface per signal (SpanExporter
, …).
Export(batch)
Exports a batch of readable spans. Protocol exporters that will implement this function are typically expected to serialize and transmit the data to the destination.
Export() will never be called concurrently for the same exporter instance. Depending on the implementation the result of the export may be returned to the Processor not in the return value of the call to Export() but in a language specific way for signaling completion of an asynchronous task. This means that while an instance of an exporter will never have its Export() called concurrently it does not mean that the task of exporting can not be done concurrently. How this is done is outside the scope of this specification. Each implementation MUST document the concurrency characteristics the SDK requires of the exporter.
Export() MUST NOT block indefinitely, there MUST be a reasonable upper limit
after which the call must time out with an error result (Failure
).
Concurrent requests and retry logic is the responsibility of the exporter. The default SDK’s Span Processors SHOULD NOT implement retry logic, as the required logic is likely to depend heavily on the specific protocol and backend the spans are being sent to. For example, the OpenTelemetry Protocol (OTLP) specification defines logic for both sending concurrent requests and retrying requests.
Parameters:
batch - a batch of readable spans. The exact data type of the batch is language
specific, typically it is some kind of list,
e.g. for spans in Java it will be typically Collection<SpanData>
.
Returns: ExportResult:
The return of Export() is implementation specific. In what is idiomatic for the
language the Exporter must send an ExportResult
to the Processor.
ExportResult
has values of either Success
or Failure
:
Success
- The batch has been successfully exported. For protocol exporters this typically means that the data is sent over the wire and delivered to the destination server.Failure
- exporting failed. The batch must be dropped. For example, this can happen when the batch contains bad data and cannot be serialized.
For example, in Java the return of Export() would be a Future which when
completed returns the ExportResult
object. While in Erlang the Exporter sends
a message to the Processor with the ExportResult
for a particular batch of
spans.
Shutdown()
Shuts down the exporter. Called when SDK is shut down. This is an opportunity for exporter to do any cleanup required.
Shutdown
should be called only once for each Exporter
instance. After the
call to Shutdown
subsequent calls to Export
are not allowed and should
return a Failure
result.
Shutdown
should not block indefinitely (e.g. if it attempts to flush the data
and the destination is unavailable). OpenTelemetry client authors can decide if they
want to make the shutdown timeout configurable.
ForceFlush()
This is a hint to ensure that the export of any Spans
the exporter has received prior to the
call to ForceFlush
SHOULD be completed as soon as possible, preferably before
returning from this method.
ForceFlush
SHOULD provide a way to let the caller know whether it succeeded,
failed or timed out.
ForceFlush
SHOULD only be called in cases where it is absolutely necessary,
such as when using some FaaS providers that may suspend the process after an
invocation, but before the exporter exports the completed spans.
ForceFlush
SHOULD complete or abort within some timeout. ForceFlush
can be
implemented as a blocking API or an asynchronous API which notifies the caller
via a callback or an event. OpenTelemetry client authors can decide if they want to
make the flush timeout configurable.
Further Language Specialization
Based on the generic interface definition laid out above library authors must define the exact interface for the particular language.
Authors are encouraged to use efficient data structures on the interface boundary that are well suited for fast serialization to wire formats by protocol exporters and minimize the pressure on memory managers. The latter typically requires understanding of how to optimize the rapidly-generated, short-lived telemetry data structures to make life easier for the memory manager of the specific language. General recommendation is to minimize the number of allocations and use allocation arenas where possible, thus avoiding explosion of allocation/deallocation/collection operations in the presence of high rate of telemetry data generation.
Examples
These are examples on what the Exporter
interface can look like in specific
languages. Examples are for illustration purposes only. OpenTelemetry client authors
are free to deviate from these provided that their design remain true to the
spirit of Exporter
concept.
Go SpanExporter Interface
type SpanExporter interface {
Export(batch []ExportableSpan) ExportResult
Shutdown()
}
type ExportResult struct {
Code ExportResultCode
WrappedError error
}
type ExportResultCode int
const (
Success ExportResultCode = iota
Failure
)
Java SpanExporter Interface
public interface SpanExporter {
public enum ResultCode {
Success, Failure
}
ResultCode export(Collection<ExportableSpan> batch);
void shutdown();
}