perf: txpool parallel stateless validation and pre-read state before promotion by beamuu · Pull Request #9 · kub-chain/bkc

beamuu · 2026-04-01T10:31:03Z

TxPool Optimization

Background

The transaction pool (core/tx_pool.go) processes incoming transactions from the network. Under distributed traffic — many unique sender addresses each sending one transaction — two bottlenecks emerged:

Stateless checks run under the write lock. Checks like size, transaction type, gas price floor, and fee cap validation do not touch shared pool state, but they execute sequentially inside pool.mu.Lock() along with every other validation step.
Cold statedb reads hold the write lock during promotion. promoteExecutables calls GetNonce and GetBalance for every queued account while pool.mu is held. For N unique senders with uncached state this means N sequential disk/trie reads inside the critical section, blocking all concurrent transaction submissions for the full duration.

Changes

Change A — Parallel stateless validation (`validateTxStateless`)

File: core/tx_pool.go

A new validateTxStateless function extracts all checks from validateTx that do not require pool.currentState:

Transaction type support (EIP-2718 / EIP-1559 fork flags)
Maximum transaction size
Negative value guard
Gas limit vs pool.currentMaxGas
Fee cap / tip sanity (bit length, tip ≤ feecap)
Gas price floor vs pool.gasPrice
Sender recovery (already concurrently cached by senderCacher)

addTxs now runs validateTxStateless concurrently across all incoming transactions before acquiring pool.mu. Transactions that fail are rejected early. Only survivors reach the locked phase where stateful checks (nonce ordering, balance sufficiency) and insertion happen.

Before:
  lock → [stateless + stateful + insert] × N  (sequential)

After:
  [stateless] × N  (concurrent, no lock)
  → filter
  lock → [stateful + insert] × survivors  (sequential)

Change B — Pre-read account state before promotion (`runReorg`)

File: core/tx_pool.go

runReorg now reads nonces and balances for all queued addresses outside pool.mu before calling promoteExecutables. The results are passed as a pre-fetched map, and promoteExecutables uses those values instead of calling statedb directly.

Before:
  pool.mu.Lock()
    for addr in N accounts:
      GetNonce(addr)    ← disk/trie, serialized under write lock
      GetBalance(addr)  ← disk/trie, serialized under write lock
      promote(addr)
  pool.mu.Unlock()

After:
  pool.mu.RLock()
  addrs := keys(pool.queue)
  pool.mu.RUnlock()

  for addr in addrs:             ← I/O outside write lock
    prefetch[addr] = {nonce, balance}

  pool.mu.Lock()
    for addr in accounts:
      use prefetch[addr]         ← map lookup, nanoseconds
      promote(addr)
  pool.mu.Unlock()

The total number of statedb reads is unchanged. The write lock hold time drops from O(N × disk_latency) to O(N × map_lookup).

What Was Not Changed

txNoncer caching logic
txPricedList heap operations
Signature recovery (already batched and concurrent)
Queue / pending data structures
All consensus and validation semantics

Benchmark

Running the benchmarks

# Run all three scenarios
go test ./core/ -run "TestTxPoolBenchmark" -v -count=1

# Run a specific scenario
go test ./core/ -run "TestTxPoolBenchmark_ManyUniqueSenders" -v -count=1

Scenarios

Scenario	Senders	Tx per sender	Total txs	What it measures
`SingleSender`	1	200	200	Hot/warm baseline — state cached after first read
`ManyUniqueSenders`	200	1	200	Cold path — every sender is a fresh address
`MixedLoad`	50	4	200	Realistic blend of new and returning accounts

Comparing before and after

# On the benchmark commit (original code):
git checkout <benchmark-commit>
go test ./core/ -run "TestTxPoolBenchmark" -v -count=3 2>&1 | grep BENCH

# On the optimization commit:
git checkout <optimization-commit>
go test ./core/ -run "TestTxPoolBenchmark" -v -count=3 2>&1 | grep BENCH

The ManyUniqueSenders scenario shows the largest improvement because it maximises cold statedb reads — exactly the path targeted by both optimizations.

…allel stateless validation and pre-read account state

…ented in code

…tion (#1)

beamuu and others added 5 commits April 1, 2026 17:14

feat: add benchmark tests for TxPool with various transaction scenarios

0ef8f3d

perf: implement TxPool optimizations for improved throughput with par…

02f3677

…allel stateless validation and pre-read account state

chore: remove task plan

25d79b7

chore: remove TxPool optimization documentation as changes are implem…

98c02de

…ented in code

txpool: parallel stateless validation and pre-read state before promo…

bfc2b04

…tion (#1)

beamuu requested review from SakuBoyz and kongrath April 1, 2026 10:31

beamuu self-assigned this Apr 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: txpool parallel stateless validation and pre-read state before promotion#9

perf: txpool parallel stateless validation and pre-read state before promotion#9
beamuu wants to merge 5 commits intokub-chain:mainfrom
beamuu:main

beamuu commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

beamuu commented Apr 1, 2026

TxPool Optimization

Background

Changes

Change A — Parallel stateless validation (validateTxStateless)

Change B — Pre-read account state before promotion (runReorg)

What Was Not Changed

Benchmark

Running the benchmarks

Scenarios

Comparing before and after

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Change A — Parallel stateless validation (`validateTxStateless`)

Change B — Pre-read account state before promotion (`runReorg`)