History modes

History modes allow a node to require less disk storage. Indeed, depending on the chosen history mode, some parts of the complete chain history can be deleted as they are not required anymore.

Three history modes are provided:

  • Full (default mode with 1 additional cycle)

    The node stores the minimal data since the genesis required to reconstruct (or ‘replay’) the complete chain’s ledger state.

    • Upsides:

      • Can synchronize using a snapshot.

      • Keep all necessary information in order to reconstruct all the chain’s ledger state (balances, contracts, etc..) since the genesis block.

      • Requires little disk storage.

      • Suitable for bakers as you can still query any block information or operation at any level.

      • Help other nodes to bootstrap and synchronize with the chain.

    • Downsides:

      • The node is not able to query the balances or staking rights before the current checkpoint.

      • Disk storage slowly increases as the node keeps the history.

    See how to set up a full node.

  • Rolling

    This is the lightest mode as it only maintains a minimal rolling fragment of the chain data so the node can still validate new blocks and synchronize with the head.

    • Upsides

      • Only requires a minimal and bounded disk storage.

      • Can run on low resources architectures.

      • Can be bootstrapped within minutes, thanks to a rolling snapshot import.

    • Downsides

      • The node is not able to query block information of balances and staking rights before the current checkpoint.

      • The node does not help other nodes to bootstrap as it is not able to send the whole chain history.

    See how to set up a rolling node.

  • Archive

    This is the heaviest mode as it keeps the whole chain data to be able to query any information stored on the chain since the genesis. It is particularly suitable for indexers or block explorers.

    • Upsides

      • The whole chain data is available.

    • Downsides

      • Consume an increasing and rather large amount and data storage.

    See how to set up an archive node.

History modes in a nutshell

Storage amount

Suitable for bakers

Operations history

Archive

High

Yes

Complete

Full

Limited

Yes

Complete

Rolling

Low

Restricted*

Restricted*

(*) Suitable for delegation services if the number of additional kept cycles is high enough to allow the computation of rewards. See Keeping additional cycles for details.

History modes use some markers which are used to describe the state of the storage:

  • checkpoint: the last allowed fork level of the chain (as defined in the Mavryk position paper),

  • savepoint: the last known block which contains metadata,

  • caboose: the last known block.

For more details about what data are stored in each mode, refer to The storage layer.

Setting up a node in full mode

To run a full node you can either use the command line arguments:

mavkit-node run --history-mode full

or use your configuration file as described in here:

{ "shell": {
    "history_mode": "full"
}}

Note that, since the full mode is the default one, this configuration is optional.

You can then verify that your history mode is set to full by using the checkpoint RPC.

mavkit-client rpc get /chains/main/checkpoint
{ "block": { "some": "data" },
   "savepoint": 4096, "caboose": 0, "history_mode": "full" }

In full mode, the savepoint is the last block which contains its metadata. The caboose is the last known block which is pruned (that contains partial data).

Setting up a node in rolling mode

To run a rolling node you can either use the command line arguments:

mavkit-node run --history-mode rolling

or use your configuration file as described in here:

{ "shell": {
    "history_mode": "rolling"
}}

In rolling mode, the caboose is the genesis at its early state, and then, it is updated to the last known block of the rolling window. The savepoint is moved in accordance to the number of configured additional cycles.

$ tezos rpc get /chains/main/checkpoint

Setting up a node in archive mode

To run an archive node you can use the command line arguments: $ mavkit-node run --history-mode archive

Or the configuration file: { "shell": {"history_mode": "archive"} }

If you want to start an archive node, it is now mandatory to pass this argument the first time you launch your node. Indeed, there are some restrictions when switching from one mode to another.

In archive mode, both the savepoint and caboose are located down to the genesis.

Keeping additional cycles

When running a node in full or rolling mode, you have a full access to the block information in a sliding window of history. Indeed, at each new cycle, a garbage collection phase removes the ledger state and the block metadata (operation receipts, rewards updates, etc.) of blocks outside the offset of this sliding window. Depending on the network, a minimum number of cycles are kept. These cycles correspond to the ones above the last allowed fork level, containing blocks subjects to a potential chain reorganization (this minimal number of cycles is currently given by the preserved_cycles protocol parameter, which on mainnet is currently set to 5 cycles). However, the node is able to keep an additional number of cycles that is configurable.

By default, 1 additional cycle is kept for both full and rolling nodes. It is possible to increase this parameter to keep more history or, on the contrary, decrease it to reduce the storage size. For example, it is possible to run a node with 5 additional cycles. On mainnet, this would total 10 cycles of complete history (approximately four weeks), as we keep 5 cycles beyond the minimal number of cycles, that is 5 + 5 = 10.

When initializing your node on an empty storage, you may specify the history mode and number of additional cycles using --history-mode <HISTORY_MODE>:<NB_CYCLES> when running it. For example, running a node with --history-mode rolling:5 would allow full RPC queries of the 10 previous cycles.

It is also possible to modify the number of additional cycles kept of a previously configured node, see Switch mode restrictions. When updating the number of additional cycles to keep on an already configured node, one must consider that the change may require time. If the update aims to shrink the number of additional cycles to keep, it is just a matter of deleting some data, and the operation is performed instantaneously. If the update aims to increase the number of additional cycles to keep, one must consider that the switch will be complete only after waiting for the end of the aforementioned number of cycles. Indeed, the “cycles window” will be expanded as blocks/cycles are processed by the node and it won’t fetch blocks prior to the previous limit. For example, switching from 5 to 7 cycles requires to wait 2 complete cycles to reach the target number of cycles to keep. On the contrary, switching from 7 to 5 is instantaneous.

Switching between node’s modes

It is possible to switch between history modes and/or to modify the number of additional cycles. To do so, it is necessary to restart the node with the desired history mode and add the flag --force-history-mode-switch. This flag is required to prevent erroneous history switches. Indeed, changing from one history mode to an other can irremediably remove data from the storage. The history mode switches must be manipulated with care.

However, as the different modes rely on different storage schemes, there are some restrictions when switching from one mode to another.

From/to

Archive

Full

Rolling

Archive

X

Yes

Yes

Full

Yes*

Yes

Yes

Rolling

No

No

Yes

(*) Switching from a full node to an archive one is possible using the reconstruct feature. To do so, run mavkit-node reconstruct on your node. Note that the storage reconstruction is a long process that, on the main network, may require more than a week to complete. Reconstruction also requires a machine with at least 16GB of memory (for the flattening of the context storage operation) and takes up about 1TB of storage once completed.