diff --git a/docs/overview.md b/docs/overview.md new file mode 100644 index 00000000000..47bf9ab8077 --- /dev/null +++ b/docs/overview.md @@ -0,0 +1,101 @@ +# Cumulus Overview + +This document provides high-level documentation for Cumulus. + +## Runtime + +Each Substrate blockchain provides a runtime. The runtime is the state transition function of the +blockchain. Cumulus provides interfaces and extensions to convert a Substrate FRAME runtime into a +Parachain runtime. Polkadot expects each runtime exposes an interface for validating a +Parachain's state transition and also provides interfaces for the Parachain to send and receive +messages of other Parachains. + +To convert a Substrate runtime into a Parachain runtime, the following code needs to be added to the +runtime: +```rust +cumulus_runtime::register_validate_block!(Block, Executive); +``` + +This macro call expects the `Block` and `Executive` type. It generates the `validate_block` function +that is expected by Polkadot to validate the state transition. + +When compiling a runtime that uses Cumulus, a WASM binary is generated that contains the *full* code +of the Parachain runtime plus the `validate_block` functionality. This binary is required to +register a Parachain on the relay chain. + +When the Parachain validator calls the `validate_block` function, it passes the PovBlock (See [Block +building](#block-building) for more information) and the parent header of the Parachain that is +stored on the relay chain. From the PovBlock witness data, Cumulus reconstructs the partial trie. +This partial trie is used as storage while executing the block. Cumulus also redirects all storage +related host functions to use the witness data storage. After the setup is done, Cumulus calls +`execute_block` with the transactions and the header stored in the PovBlock. On success, the new +Parachain header is returned as part of the `validate_block` result. + +## Node + +Parachains support light-clients, full nodes, and authority nodes. Authority nodes are called +Collators in the Polkadot ecosystem. Cumulus provides the consensus implementation for a +Parachain and the block production logic. + +The Parachain consensus will follow the relay chain to get notified about which Parachain blocks are +included in the relay-chain and which are finalized. Each block that is built by a Collator is sent +to a validator that is assigned to the particular Parachain. Cumulus provides the block production +logic that notifies each Collator of the Parachain to build a Parachain block. The +notification is triggered on a relay-chain block import by the Collator. This means that every +Collator of the Parachain can send a block to the Parachain validators. For more sophisticated +authoring logic, the Parachain will be able to use Aura, BABE, etc. (Not supported at the moment) + +A Parachain Collator will join the Parachain network and the relay-chain network. The Parachain +network will be used to gossip Parachain blocks and to gossip transactions. Collators will only +gossip blocks to the Parachain network that have a high chance of being included in the relay +chain. To prove that a block is probably going to be included, the Collator will send along side +the notification the so-called candidate message. This candidate message is issued by a Parachain +validator after approving a block. This proof of possible inclusion prevents spamming other collators +of the network with useless blocks. +The Collator joins the relay-chain network for two reasons. First, the Collator uses it to send the +Parachain blocks to the Parachain validators. Secondly, the Collator participates as light/full-node +of the relay chain to be informed of new relay-chain blocks. This information will be used for the +consensus and the block production logic. + +## Block building + +Polkadot requires that a Parachain block is transmitted in a fixed format. These blocks sent by a +Parachain to the Parachain validators are called proof-of-validity blocks (PovBlock). Such a +PovBlock contains the header and the transactions of the Parachain as opaque blobs (`Vec`). They +are opaque, because Polkadot can not and should not support all kinds of possible Parachain block +formats. Besides the header and the transactions, it also contains the witness data and the outgoing +messages. + +A Parachain validator needs to validate a given PoVBlock, but without requiring the full state of +the Parachain. To still make it possible to validate the Parachain block, the PovBlock contains the +witness data. The witness data is a proof that is collected while building the block. The proof will +contain all trie nodes that are read during the block production. Cumulus uses the witness data to +reconstruct a partial trie and uses this a storage when executing the block. + +The outgoing messages are also collected at block production. These are messages from the Parachain +the block is built for to other Parachains or to the relay chain itself. + +## Runtime Upgrade + +Every Substrate blockchain supports runtime upgrades. Runtime upgrades enable a blockchain to update +its state transition function without requiring any client update. Such a runtime upgrade is applied +by a special transaction in a Substrate runtime. Polkadot and Cumulus provide support for these +runtime upgrades, but updating a Parachain runtime is not as easy as updating a standalone +blockchain runtime. In a standalone blockchain, the special transaction needs to be included in a +block and the runtime is updated. + +A Parachain will follow the same paradigm, but the relay chain needs to be informed before +the update. Cumulus will provide functionality to notify the relay chain about the runtime update. The +update will not be enacted directly; instead it takes `X` relay blocks (a value that is configured +by the relay chain) before the relay chain allows the update to be applied. The first Parachain +block that will be included after `X` relay chain blocks needs to apply the upgrade. +If the update is applied before the waiting period is finished, the relay chain will reject the +Parachain block for inclusion. The Cumulus runtime pallet will provide the functionality to +register the runtime upgrade and will also make sure that the update is applied at the correct block. + +After updating the Parachain runtime, a Parachain needs to wait a certain amount of time `Y` +(configured by the relay chain) before another update can be applied. + +The WASM blob update not only contains the Parachain runtime, but also the `validate_block` +function provided by Cumulus. So, updating a Parachain runtime on the relay chain involves a +complete update of the validation WASM blob.