Block Database #4027

DracoLi · 2025-06-23T17:40:35Z

Why this should be merged

This PR introduces BlockDB, a specialized database optimized for block storage.

Avalanche VMs currently store blocks in a key-value database (LevelDB or PebbleDB). This approach is no optimal for block storage because large blocks trigger frequent compactions causing write amplification that degrades performance as the database grows, and KV databases are designed for random key-value access rather than the sequential patterns typical of blockchain operations.

For how BlockDB works see README.md.

Changes

Added blockdb to x/.
Updated our lru cache to support onEvict. This is used by the blockdb for storing opened file descriptors for the data files.

How this was tested

Units tests for now

Todos

Split data across multiple files when MaxDataFileSize is reached
Compress data files to reduce storage size - Will be in a follow up PR
Add performance benchmarks - Will be in follow up

Copilot

Pull Request Overview

This PR introduces BlockDB, a specialized on-disk database optimized for blockchain block storage with improved write performance and automatic recovery.

Implements dedicated tests for writing, reading, concurrency, and error cases.
Introduces recovery logic and index management for efficient block lookups, along with detailed documentation in the README.

Reviewed Changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
x/blockdb/writeblock_test.go	Adds comprehensive tests verifying block writes, error conditions, and concurrency scenarios.
x/blockdb/recovery.go	Introduces recovery logic to reconcile the data and index file contents after crashes.
x/blockdb/readblock_test.go	Provides test coverage for reading full blocks, headers, and bodies in various conditions.
x/blockdb/index.go	Implements fixed-size index entries and header serialization/deserialization.
x/blockdb/database.go	Sets up file handling, header initialization, recovery trigger, and connection closure.
x/blockdb/block.go	Implements block header serialization, writing/reading blocks, and ensuring data integrity.
x/blockdb/config.go	Defines default and custom configuration options for the BlockDB.
x/blockdb/README.md	Documents design, file formats, recovery, and usage of BlockDB.

x/blockdb/block.go

…eader and body

rkuris

Still unreviewed: recovery code and some of the block allocation logic, but there is enough here to get started with some changes.

x/blockdb/README.md

rkuris · 2025-06-25T14:08:20Z

x/blockdb/README.md

+│ Min Block Height               │ 8 bytes │
+│ Max Contiguous Height          │ 8 bytes │
+│ Data File Size                 │ 8 bytes │
+│ Reserved                       │ 24 bytes│


Why do we need a reserved area here?

I wanted to account for we might add features that will require us to store more data in the header in future versions. If this happens, we can add it here without needing to reindex.

x/blockdb/README.md

rkuris · 2025-06-25T14:47:22Z

x/blockdb/block.go

+			}
+		}
+
+		if s.nextDataWriteOffset.CompareAndSwap(currentOffset, newOffset) {


Nice way of doing this! Presumably this is faster in the non-contention case than a mutex?

yeah, this should be more lightweight

rkuris · 2025-06-25T14:50:06Z

x/blockdb/block.go

+			fileIndex := int(currentOffset / maxDataFileSize)
+			localOffset := currentOffset % maxDataFileSize
+
+			if localOffset+totalSize > maxDataFileSize {
+				writeOffset = (uint64(fileIndex) + 1) * maxDataFileSize
+				newOffset = writeOffset + totalSize
+			}


I think this means that files other than the first one will not contain a header. Is this intentional? If so, it means the first file is always going to be opened and can never be deleted which should be mentioned in the README.

not every block will contain the header (blockSize includes the metadata header + block). We are only splitting the data files here, not the index file. This is just calculating the global next write offset if the current data file cannot fit the block.

The variable names could have been better. I updated this method to be a clearer in terms of what its doing.

rkuris · 2025-06-25T14:51:32Z

x/blockdb/database.go

+}
+
+func (s *Database) getOrOpenDataFile(fileIndex int) (*os.File, error) {
+	if handle, ok := s.fileCache.Load(fileIndex); ok {


I think we need some limit on the fileCache size, otherwise we could run out of file descriptors if the maxFileSize is pretty small and/or blocks are really big.

good idea. i can set a 10k limit

x/blockdb/README.md

x/blockdb/database.go

- Added more tests for recovery

github-project-automation bot added this to avalanchego Jun 23, 2025

DracoLi requested a review from Copilot June 23, 2025 17:41

Copilot AI reviewed Jun 23, 2025

View reviewed changes

x/blockdb/block.go Outdated Show resolved Hide resolved

DracoLi force-pushed the dl/blockdb branch from 2ec2399 to 99cb01d Compare June 23, 2025 21:30

DracoLi added 5 commits June 23, 2025 17:30

blockdb setup & readme

260979a

feat: block db implementation & readme

64ca7f1

refactor: rename store to database

cf35473

feat: add tests and update blockdb to have separate methods to read h…

15ae1d1

…eader and body

feat: data splitting & fix linting

c6989b0

DracoLi force-pushed the dl/blockdb branch from 99cb01d to c6989b0 Compare June 23, 2025 21:30

fix: close db before deleting the file

c1bcf97

rkuris reviewed Jun 25, 2025

View reviewed changes

DracoLi added 4 commits June 26, 2025 12:56

fix: recovery issues with data files splitting & feedback

4201549

use lru for file cache and fix recovery issues

9a90669

- Added more tests for recovery

refactor: use t.TempDir

decbfe8

fix: cache test

f08b7a7

DracoLi requested a review from rkuris July 2, 2025 18:42

DracoLi changed the title ~~[Draft] BlockDB~~ Block Database Jul 2, 2025

Merge branch 'master' into dl/blockdb

cb900cf

DracoLi marked this pull request as ready for review July 2, 2025 21:12

DracoLi requested review from meaghanfitzgerald and StephenButtolph as code owners July 2, 2025 21:12

DracoLi requested a review from yacovm July 2, 2025 21:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Block Database #4027

Block Database #4027

Uh oh!

DracoLi commented Jun 23, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

rkuris left a comment

Uh oh!

Uh oh!

Uh oh!

rkuris Jun 25, 2025

Uh oh!

DracoLi Jun 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

rkuris Jun 25, 2025

Uh oh!

DracoLi Jun 25, 2025 •

edited

Loading

Uh oh!

rkuris Jun 25, 2025

Uh oh!

DracoLi Jun 25, 2025 •

edited

Loading

Uh oh!

DracoLi Jun 26, 2025 •

edited

Loading

Uh oh!

rkuris Jun 25, 2025

Uh oh!

DracoLi Jun 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Block Database #4027

Are you sure you want to change the base?

Block Database #4027

Uh oh!

Conversation

DracoLi commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why this should be merged

Changes

How this was tested

Todos

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

rkuris left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

rkuris Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

DracoLi Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

rkuris Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

DracoLi Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rkuris Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

DracoLi Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DracoLi Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rkuris Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

DracoLi Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DracoLi commented Jun 23, 2025 •

edited

Loading

DracoLi Jun 25, 2025 •

edited

Loading

DracoLi Jun 25, 2025 •

edited

Loading

DracoLi Jun 25, 2025 •

edited

Loading

DracoLi Jun 26, 2025 •

edited

Loading

DracoLi Jun 25, 2025 •

edited

Loading