Metrics

Metrics or traces?

A metric is a numeric representation of data measured over intervals of time. Metrics are malleable to statistical transformations such as sampling, aggregation and correlation, which make them suited to report the overall health of a system.

A trace is a representation of a series of causally related distributed events that encode information about the end-to-end request flow through a distributed system. Traces are used to identify the amount of work done at each layer in an application while preserving causality.

The main difference between metrics and traces is therefore that metrics are system-centric and traces are request-centric: metrics give you insight into how a particular system is doing, while traces help teams identify the path of requests through various services.

For most things, you likely want a metric, except for two scenarios:

For contributors, traces are a good profiling tool
For end-users that run complicated infrastructure, traces in the RPC component makes sense

How to add a metric

To add metrics use the metrics crate.

Add the code emitting the metric.
Add the metrics description in the crate's metrics describer module, e.g.: network metrics describer.
Document the metric in this file.

Metric anatomy

There are three types of metrics:

Counters: Represent (ideally) monotonically increasing values, e.g. the number of errors that have occurred, the number of blocks processed, etc.
Gauges: Represent metrics that can go up or down arbitrarily over time. Usually they are used to measure things like resource usage (memory, CPU, ...) and throughput.
Histograms: Used to store an arbitrary number of observations of a specific measurement, and provides statistical analysis over the observed values. A typical use case is latency of some operation (writing to disk, responding to a request, ...).

Each metric is identified by a Key, which itself is composed of a KeyName and an arbitrary number of Labels.

The KeyName represents the actual metric name, and the labels are used to further drill down into the metric.

For example, a metric that represents stage progress would have a key name of stage_progress and a stage label that can be used to get the progress of individual stages.

There will only ever exist one description per metric KeyName; it is not possible to add a description for a label, or a KeyName/Label group.

Creating metrics

The metrics crate provides three macros per metric variant: register_<metric>!, <metric>!, and describe_<metric>!. Prefer to use these where possible, since they generate the code necessary to register and update metrics under various conditions.

The register_<metric>! macro simply creates the metric and returns a handle to it (e.g. a Counter). These metric structs are thread-safe and cheap to clone.
The <metric>! macro registers the metric if it does not exist, and updates it's value.
The describe_<metric>! macro adds an end-user description for the metric.

How the metrics are exposed to the end-user is determined by the CLI.

Metric best practices

Use . to namespace metrics
- The top-level namespace should NOT be reth¹
Metric names should not contain spaces
Add a unit to the metric where appropriate
- Use the Prometheus base units
Do not add rate-metrics
- Rates can be calculated by e.g. Prometheus on the fly
Avoid duplicate metrics
- An example would be adding two metrics for connections: reth.p2p.connections for current connections and reth.p2p.connections.total for total connections. One of these metrics can be used to infer the other.

Current metrics

This list may be non-exhaustive.

Stage: Headers

stages.headers.counter: Number of headers successfully retrieved
stages.headers.timeout_error: Number of timeout errors while requesting headers
stages.headers.validation_errors: Number of validation errors while requesting headers
stages.headers.unexpected_errors: Number of unexpected errors while requesting headers
stages.headers.request_time: Elapsed time of successful header requests

Component: Transaction Pool

transaction_pool.inserted_transactions: Number of transactions inserted in the pool
transaction_pool.invalid_transactions: Number of invalid transactions
transaction_pool.removed_transactions: Number of removed transactions from the pool
transaction_pool.pending_pool_transactions: Number of transactions in the pending sub-pool
transaction_pool.pending_pool_size_bytes: Total amount of memory used by the transactions in the pending sub-pool in bytes
transaction_pool.basefee_pool_transactions: Number of transactions in the basefee sub-pool
transaction_pool.basefee_pool_size_bytes: Total amount of memory used by the transactions in the basefee sub-pool in bytes
transaction_pool.queued_pool_transactions: Number of transactions in the queued sub-pool
transaction_pool.queued_pool_size_bytes: Total amount of memory used by the transactions in the queued sub-pool in bytes
transaction_pool.blob_pool_transactions: Number of transactions in the blob sub-pool
transaction_pool.blob_pool_size_bytes: Total amount of memory used by the transactions in the blob sub-pool in bytes

Component: Network

network.connected_peers: Number of currently connected peers
network.tracked_peers: Number of peers known to the node
network.pending_session_failures: Cumulative number of failures of pending sessions
network.closed_sessions: Total number of sessions closed
network.incoming_connections: Number of active incoming connections
network.outgoing_connections: Number of active outgoing connections
network.total_incoming_connections: Total number of incoming connections handled
network.total_outgoing_connections: Total number of outgoing connections established
network.invalid_messages_received: Number of invalid/malformed messages received from peers
network.propagated_transactions: Total number of propagated transactions

Footnotes

The top-level namespace is added by the CLI using metrics_util::layers::PrefixLayer. ↩

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metrics.md

metrics.md

Metrics

Metrics or traces?

How to add a metric

Metric anatomy

Creating metrics

Metric best practices

Current metrics

Stage: Headers

Component: Transaction Pool

Component: Network

Files

metrics.md

Latest commit

History

metrics.md

File metadata and controls

Metrics

Metrics or traces?

How to add a metric

Metric anatomy

Creating metrics

Metric best practices

Current metrics

Stage: Headers

Component: Transaction Pool

Component: Network

Footnotes