Skip to content

Blockdaemon/agave-snapshot-gossip-client

Repository files navigation

Solana Snapshot Delivery Network (SSDN) Gossip Client

Overview

The SSDN Gossip Client is part of a solution to decouple snapshot serving from validator operations in the Solana network. It enables scalable snapshot distribution without impacting validator performance.

Problem Statement

Current snapshot distribution has several limitations:

  • Everyone (generally) uses the same 4 recommended KVs from the recommended command lines, effectively centralizing the network
  • Those same validators stop serving snapshots effectively once there is more than 2 or 3 concurrent requests
  • Existing validators and RPC providers are overwhelmed with snapshot requests, and they are limited in number due to the added performance overhead
  • Snapshot serving impacts critical validator operations
  • Snapshot serving impacts critical RPC provider operations
  • No scalability during high-demand periods (e.g., network restarts)
  • No dedicated infrastructure for snapshot distribution

Solution

This project, along with agave-snapshot-uploader, provides a scalable snapshot delivery network by:

  1. Decoupling snapshot distribution from validator/RPC operations
  2. Reusing the existing discovery mechanism for snapshot sources
  3. Maintaining compatibility with existing validator clients
  4. Supporting flexible deployment options

Key Features

  • Gossip network integration for discovery
  • Can be used as an entrypoint for the gossip network
  • Minimum required JSONRPC endpoint compatibility for snapshot distribution (getSlot, getVersion, getGenesisHash)
  • Additional non-standard JSONRPC methods (getNumPeers, getShredVersion, getPublicKey)
  • Native Agave support for ip_echo public IP and shred version discovery (client and server)
  • Health endpoint on http://localhost:8899/health
  • STUN support for public IP detection
  • UPnP support for NAT traversal

Architecture

The system supports three deployment models:

  1. Independent Gossip Client

    • Separate gossip client (or RPC server only)
    • Uploader sidecar on validator
  2. Independent Client + Reverse Proxy

    • Separate gossip client (or RPC server only)
    • Reverse proxy
    • Uploader sidecar on validator
  3. Simplified Legacy Setup - No Gossip Client

    • No separate gossip client
    • All components on single host
    • Reverse proxy
    • Uploader sidecar

Architecture Diagram

Configuration

Quick Start (if not using the Debian package)

  1. Generate a keypair (if needed):

    solana-keygen new -o keypair.json
  2. Create a config.toml file (optional, see example-config.toml)

  • The only required setting is the storage_path to the snapshot location. The default is storage in the current working directory (/var/www in Debian packages), and not a remote host.
  • To use a different network, simply set the network field to "devnet", "testnet", or "mainnet". The correct entrypoints and expected genesis hash will be automatically configured.
  • For advanced users, you can still manually configure entrypoints and expected_genesis_hash if needed.
  1. Run the client:
    cargo run -r
    or for minimal logging:
    RUST_LOG=warn cargo run -r
    or for more verbose logging:
    RUST_LOG=h2=off,hyper_util=off,solana_metrics=off,solana_gossip::cluster_info=off,debug cargo run -r

See the Installation Guide file for more information on installing the Debian package or using docker.

Configuration Options

Use --config <path> to specify a custom config file location. Default is config.toml in the current working directory.

Option Default/Debian default Description
network testnet "devnet", "testnet", or "mainnet"
entrypoints Follow network Gossip entrypoint override
expected_genesis_hash Follow network Expected genesis hash override
shred_version None Expected shred version override
keypair_path keypair.json Path to keypair file
listen_ip 0.0.0.0 Local bind/listen IP
public_ip Auto (STUN) Public IP address
enable_stun false Use STUN to discover public IP instead of ip_echo
stun_server stun.l.google.com:3478 STUN server address
disable_gossip false/true Disable gossip client. See RPC-Only Mode.
gossip_port 8001 Gossip listen port
rpc_port 8899 RPC listen port
enable_upnp false Enable UPnP port forwarding
storage_path storage//var/www Redirect/proxy target dir or URL
enable_proxy false Reverse proxy GET requests instead of redirecting

shred_version is used when joining the gossip network. If you have issues connecting to gossip, try setting it to the correct network value. If not specified, the gossip client will attempt to autodetect it, but that is unreliable.

expected_genesis_hash is used to verify the given storage_path is valid for the network. If not specified, no checking will be done.

See Solana Cluster Information for the correct values.

Hardware Requirements (gossip enabled, mainnet)

  • 4GB to 8GB RAM
  • 4 to 8 CPU Cores

Network Requirements

  • 100Mbps to 200Mbps connection
  • 50Mbps sustained bi-directional bandwidth (16TB per month)
  • Static public IP address
  • Ingress allow/forward list:
    • TCP/UDP port 8001 (gossip) - if you need to be a publicly reachable gossip entrypoint (optional)
    • TCP port 8899 (RPC) - A publicly reachable RPC endpoint is required for validators to accept snapshots from you

Note: STUN-based IP detection and UPnP port forwarding are not recommended for production. Configure port firewall/forwarding rules manually. IP detection will be done via ip_echo to each entrypoint by default.

IP resolution preference order:

  1. public_ip from user config (if provided)
  2. IP echo result (if public_ip not provided)
  3. STUN result (only if IP echo fails and STUN is enabled)

Shred version resolution:

  1. If both configured and discovered versions exist and differ, it's an error
  2. Use whichever version is available (either configured or discovered)
  3. Return None if neither version is available

Note that even when public_ip is configured, IP echo is still attempted to get the shred version.

Explicit public_ip and shred_version configuration is always checked against ip_echo results.

Known Issues

  • --debug builds may be unstable and have significantly higher memory and CPU usage. Use -r or --release to avoid this.
  • Large crate dependency footprint (900+).
  • Excessive memory and CPU usage for large gossip networks (Issue 53).
  • We do not periodically renew the UPnP port mappings, so if the router expires it, you may lose connectivity if you rely on on it (issue #11).
  • For a detailed analysis of the benefits, limitations, and production considerations of the SSDN implementation, please see Design Tradeoffs.

License

Apache 2.0

About

Solana Snapshot Delivery Network (SSDN) lightweight gossip client and proxy

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors 2

  •  
  •