Block header pruning

shamil-gadelshin · April 3, 2024, 10:45am

The block header pruning seems like a logical continuation for the state and block pruning and appears like a nice feature to have.

How to implement block header pruning correctly, though?

Our initial experiments show that it required several steps:

change fork calculation logic (the previous algorithm worked on the premise of always existing block headers)
add the block header pruning by:
- removing data from DB using columns::KEY_LOOKUP, columns::HEADER, and block hash.
- removing data from block header in-memory cache
- removing data from header metadata in-memory cache (the last two using remove_header_metadata)
fix the metadata saving algorithm: prune_blocks_on_finalize_and_reorg test marks several blocks in a row as finalized and commits the transaction later in contrast with prune_blocks_on_finalize which finalized every block in a separate transaction. If we modify prune_blocks_on_finalize_and_reorg test to query a header from the pruned fork we will get one, however, the database won’t contain it. It seems that the cache gets polluted by dirty reads originating in meta updates because meta updates are saved separately and after the transaction.

I would appreciate comments.

shamil-gadelshin · April 4, 2024, 8:05am

Topic		Replies	Views
Polkadot state sync Tech Talk	0	307	July 28, 2023
Archive RPC-V2 Methods Tech Talk	0	35	December 12, 2024
Altering Polkadot's fork-choice to reduce DA Load Tech Talk	12	826	July 17, 2023
Polkadot Summit 24' - PoV-Reclaim & Elastic Scaling Ecosystem polkadot-summit	0	219	April 4, 2024
Polkadot Release Analysis v0.9.36 Ecosystem release-analysis	4	3234	January 5, 2023