Complete proposal.

open-telemetry · Dec 16, 2024 · fbe4f99 · fbe4f99
1 parent 65216db
commit fbe4f99
Showing 1 changed file with 193 additions and 81 deletions.
diff --git a/docs/rfcs/batching-process-design.md b/docs/rfcs/batching-process-design.md
@@ -1,4 +1,4 @@
-# Error transmission through a batching processor with concurrency 
+# Error transmission through a batching processor with concurrency
 
 Establish normative guidelines for components that batch telemetry to follow so that the batchprocessor and exporterhelper-based batch_sender behave in similar ways with good defaults.
 
@@ -12,14 +12,11 @@ Third, to establish how batching processors should handle incoming context deadl
 
 ## Explanation
 
-Here, "batching process" refers to specifically the batch processor 
+Here, "batching process" refers to specifically the batch processor
 (i.e., a processor component) or the exporterhelper's batch_sender
-(i.e., part of an exporter component). 
-Both are core components, and it is an explicit goal that these two
+(i.e., part of an exporter component).
+Both are core componentry, and it is an explicit goal that these two
 options for batching cover a wide range of requirements.
-We prefer to improve the core batching-process components instead
-of introduce new components, in order to achieve desired batching 
-behavior.
 
 We use the term "request" as a signal-independent descriptor of the
 concrete data types used in Collector APIs (e.g., a `plog.Logs`,
@@ -53,82 +50,197 @@ Today, both batching processes restrict size configuration to an item
 count, [however the desire to use request count and byte count (in
 some encoding) is well recognized](https://github.com/open-telemetry/opentelemetry-collector/issues/9462).
 
-### Sequence of Events
+### Batch processor logic: existing
+
+The batch processor operates over Pipeline data for both its input
+and output. It takes advantage of the top-level `repeated` portion
+of the OpenTelemetry data model to combine many requests into one
+request.  Here is the logical sequence of events that
+takes place as a request makes its way through a batch processor:
+
+A. The request arrives when the preceding component calls `Consume()`
+   on this component with Context and data.
+B. The request is placed into a channel.
+C. The request is removed from a channel by a background thread and
+   entered into the pending state.
+D. The batching process calls export one or more times containing data
+   from the original request, receiving responses possibly with errors
+   which are logged.
+E. The `Consume()` call returns control to the caller.
+
+In the batch processor, the request producer performs step A and B,
+and then it skips to F. Since the processor returns before the export
+completes, it always returns success. We refer to this behavior as
+"error suppression". The background thread, independently, observes steps
+C, D, and E, after which errors (if any) are logged.
+
+The batch procsesor performs steps D and E multiple times in sequence, with
+never more than one export at a time. Effective concurrency is limited to
+1 within the component, however this is usually alleviated by the use of
+a "queue_sender" later in the pipeline. When the exporterhelper's queue sender
+is enabled and the queue has space, it immediately returns success (a
+form of error suppression), which allows the batch processor to issue multiple
+batches at a time.
+
+The batch processor does not consult the incoming request's Context deadline
+or allow request context cancellation to interrupt step B. Step D is executed
+without a context deadline.
+
+Trace context is interrupted. By default, incoming gRPC metadata is not propagated.
+To address the loss of metadata, the batch processor has been extended with a
+`metadata_keys` configuration; with this option, independent batch processes
+are constructed for each distinct combination of metadata key values.  Per-group
+metadata values are placed into the outgoing context.
+
+### Batch sender logic: existing
+
+The batch sender is a feature of the exporterhelper; it is an optional
+sub-component situated after the queue sender, and it is used to compute
+batches in the intended encoding used by the exporter. It follows a different
+sequence of events compared to the processor:
+
+A. Check if there is a pending batch.
+B. If there is no pending batch, it creates a new one and starts a timer.
+C. Add the incoming data to the pending batch.
+D. Send the batch to the exporter.
+E. Wait for the batch-error.
+F. Each caller returns the batch-error.
+
+Unlike the batch processor, errors are propagated, not suppressed.
+
+Trace context is interrupted. Outgoing requests have empty `client.Metadata`.
+
+The context deadline of the caller is not considered in step E. In step D, the
+export is made without a context deadline; a subsequent timeout sender typically
+configures a timeout for the export.
+
+The pending batch is managed through a Golang interface, making it possible
+to accumulate protocol-specific intermediate data. There are two specific
+interfaces an exporter component provides:
+
+- Merge(): when request batches have no upper bound. In this case, the interface
+  produces single outputs.
+- MergeSplit(): when there is a maximum size imposed. In this case, the interface
+  produces potentially more than one output request.
+
+Concurrency behavior varies. In the case where `MergeSplit()` is used, there is
+a potential for multiple requests to emit from a single request. In this case,
+steps D through F are executed repeatedly while there are more requests, meaning:
+
+1. Exports are synchronous and sequential.
+2. An error causes aborting of subsequent parts of the request.
+
+### Queue sender logic: existing
+
+The queue sender provides key functionality that determines the overall behavior
+of both batching components. When enabled, the queue sender will return success
+to the caller as soon as the request is enqueued. In the background, it concurrently
+exports requests in the queue using a configurable number of threads.
+
+It is worth evaluating the behavior of the queue sender with a persistent queue
+and with an in-memory queue:
+
+- Persistent queue: In this case, the queue stores the request before returning
+  success. There is not a chance of data loss.
+- In-memory queue: In this case, the queue acts as a form of error suppression.
+  Callers do not wait for the export to return, so there is a chance of data loss
+  in this configuration.
+
+The queue sender does not consider the caller's context deadline when it attempts
+to enqueue the request. If the queue is full, the queue sender returns a queue-full
+error immediately.
+
+### Feature matrix
+
+| Support area | Batch processor | Batch sender | Explanation |
+| -----| -- |  -- | -- |
+| Merges requests | Yes | Yes | Does it merge smaller into larger batches? |
+| Splits requests | Yes | Yes, however sequential | Does it split larger into smaller batches? |
+| Cancellation  | No | No | Does it respect caller cancellation? |
+| Deadline  | No | No | Does it set an outgoing deadline? |
+| Metadata  | Yes | No | Can it batch by metadata key value(s)? |
+| Tracing  | No | No | Instrumented for tracing? |
+| Error transmission | No | Yes | Are export errors returned to callers? |
+| Concurrency allowed | No | Yes | Does the component limit concurrency? |
+| Independence | Yes | No | Are all data exported independently? |
+
+### Change proposal
+
+#### Batch processor: required
+
+The batch processor MUST be modified to achieve the following
+outcomes:
+
+- Allow concurrent exports. When the processor has a batch of
+  data available to send, it will send the data immediately.
+- Transmit errors back to callers. Callers will be blocked
+  while one or more requests are issued and wait on responses.
+- Respect context cancellation. Callers will return control
+  to the pipeline when their context is cancelled.
+
+#### Batch sender: required
+
+The batch sender MUST be modified to achieve the following
+outcomes:
+
+- Allow concurrent splitting. When multiple full-size requests
+  are produced from an input, they are exported concurrently and
+  independently.
+- Recognize partial splitting. When a request is split, leaving
+  part of a request that is not full, it remains in the active
+  request (i.e., still pending).
+- Respect key metadata. Implement the `metadata_keys` feature
+  supported by the batch processor.
+- Respect context cancellation. Callers will return control
+  to the pipeline when their context is cancelled.
 
-A batching process breaks the normal flow of Context through a
-Collector pipeline.  Here are the events that take place as a request
-makes its way through a batching process:
+### Open questions
+
+#### Batch trace context
 
-A. The request arrives when the caller calls a `Consume()` or `Send()`
-   method on this component with Context and data.
-B. The request is added to the currently-pending batch.
-C. The batching process calls export one or more times containing data from the original request.
-D. The batching process receives the response from the export(s), possibly with an error.
-E. The `Consume()` or `Send()` call returns control to the caller, possibly with an error.
+Should outgoing requests be instrumented by a trace Span linked to the incoming trace contexts? This document proposes yes, in
+one of two ways:
+
+1. When an export corresponds with data for a single incoming
+   request, the request's original context is used as the parent.
+2. When an export corresponds with data from multiple incoming
+   requests, the incoming trace contexts are linked with the new
+   root span.
 
-The two batch processors execute these steps with different sequences.
+#### Empty request handling
+
+How should a batching process handle requests that contain no
+concrete items of data? [These requests may be seen as empty
+containers](https://github.com/open-telemetry/opentelemetry-proto/issues/598),
+for example, tracing requests with no spans, metric requests with
+no metric data points, and logs requests with no log records.
+
+For a batching process, these requests can be problematic.  If
+request size is measured in item count, these "empty" requests
+leave batch size unchanged, and could cause unbounded memory
+growth.
+
+This document proposes a consistent treatment for empty requests:
+batching processes should return immediate success, which is the
+behavior of the batching processor currently.
+
+#### Outgoing context deadline
 
-In the batch processor, we observe two independent sequences.  One is
-A ⮕ B ⮕ E: the request arrives, then is placed in a batch, then
-returns success.  The other is B ⮕ C ⮕ D: once in a batch, the request
-is exported, then errors (if any) are logged.
-
-In the batch_sender, we observe a single sequence, A ⮕ B ⮕ C ⮕ D ⮕ E:
-the request arrives, is placed in a batch, the batch is sent, the
-response is received, and the caller returns the error (if any).
-
-To resolve the inconsistency, this document proposes to modify the
-batch processor to use a single sequence, i.e., A ⮕ B ⮕ C ⮕ D ⮕ E.
-
-### Request handling
-
-There are a number of open questions related to the arriving request
-and its context.
-
-- The arriving request has no items of telemetry.  Does the batching process return success immediately?
-
-Consider the arriving deadline:
-
-- The arriving Context deadline has already expired.  Does the batching process fail the request immediately?
-- The arriving Context deadline expires while waiting for the export(s).  What happens?
-
-Considering the arriving Context's trace context:
-
-- An export contains data from multiple requests, is a new root span instrumented?
-- An export contains data from a single request, is a child span instrumented?
-
-The batching process may be configured to use client metadata as a
-batching identifier
-([batchprocessor](https://github.com/open-telemetry/opentelemetry-collector/issues/4544)
-is complete,
-[batch_sender](https://github.com/open-telemetry/opentelemetry-collector/issues/10825)
-is incomplete).  Considering the arriving Context's client metadata:
-
-- An export contains data from a single request, are there circumstances when the request's client metadata passes through?
-
-### Error handling
-
-A batching process determines what happens when some or all of a
-request fails to be processed.  Consider when an incoming request has
-partially or completely failed:
-
-- Does the caller receive an error?
-- Does the remaining portion of the request still export?
-- Under what conditions is the error returned by a batching process retryable?
-
-### Concurrency handling
-
-Here are some questions about concurrency in the batching process.
-Consider what happens when there is more than one batch of data
-available to send:
-
-- Does the batching process wait for one batch to complete before sending another?
-- Does the batching process use a caller's goroutine to export, or can it create its own?
-- Is there any limit on the number of concurrent exports?
-
-## Proposed Requirements
-
-The questions posed above are meant to help us identify areas where
-the two batching processes are either inconsisent with each other or
-inconsistent with the goals of the project.
+Should the batching process set an outgoing context deadline
+to convey the maximum amount of time to consider processing
+the request?
 
+Neither existing component uses an outgoing context deadline.
+This could lead to resource exhaustion, in some cases, by
+allowing requests to remain pending indefinitely.
+
+On the one hand, this support may not be necessary, since in
+most cases the batching process is followed by an exporter, which
+includes a timeout sender option, capable of ensuring a default
+timeout.
+
+On the other hand, the batching process knows the callers'
+actual deadlines, and it could even use this information to
+form batches.
+
+This proposal makes no specific recommendation.