Geth(10) QA - Kehao Zheng's Website

Kehao Zheng

Click the avatar to see more info

853 words

4 minutes

Geth(10) QA

2026-04-19

Inside Ethereum

BlockChain

/

Ethereum

/

Geth

Q1: What happens from when a new block arrives to when it’s persisted on disk?#

Overview#

1
Block arrives (network / Engine API)
2
    │
3
    ▼
4
InsertChain()
5
    ├─ Contiguity check (sequential block numbers? parent hashes match?)
6
    ├─ Acquire write lock (chainmu)
7
    │
8
    │  Three things start concurrently:
9
    │  ├─ Background: ECDSA signature recovery (most CPU-intensive)
10
    │  ├─ Background: engine.VerifyHeaders() (consensus rule checks)
11
    │  └─ Foreground: process blocks one by one ↓
12
    │
13
    ▼
14
ProcessBlock() (for each block)
15
    ├─ ① Create StateDB from parent's state root
16
    ├─ ② processor.Process() — execute all transactions (Chapter 6)
17
    ├─ ③ validator.ValidateState() — verify gas, receipt root, state root
18
    └─ ④ Write to disk ↓
19
    │
20
    ▼
21
writeBlockAndSetHead()
22
    ├─ writeBlockWithState() — persist block + state
23
    ├─ reorg() (if needed) — switch canonical chain
24
    ├─ writeHeadBlock() — update head pointers
25
    └─ Emit events (ChainEvent, ChainHeadEvent)

Parallelization design#

Three things happen concurrently to maximize throughput:

Signature recovery runs in background goroutines, performing ECDSA recovery on all transactions. This is the most CPU-intensive part of validation — starting early means results are ready by the time execution needs them.

Header verification also runs in parallel. The consensus engine checks each header’s fields (timestamp, gas limit, base fee, difficulty…). Results arrive via channel to the main loop.

Block execution must be sequential — each block’s state depends on the previous block’s result, so it cannot be parallelized.

Three layers of persistence#

writeBlockWithState() writes across three layers:

1
Layer 1: Block data (atomic batch)
2
  ├─ rawdb.WriteBlock()      — header + body
3
  ├─ rawdb.WriteReceipts()   — receipts
4
  └─ rawdb.WritePreimages()  — preimage mappings
5
  → batch.Write() (all succeed or all fail)
6

7
Layer 2: State commit
8
  └─ statedb.Commit()        — dirty state → trie database
9

10
Layer 3: Trie GC
11
  ├─ Archive node: flush to disk every block
12
  └─ Full node: keep last 128 blocks' tries in memory, older ones eligible for GC

Updating the chain head#

writeHeadBlock() atomically writes five markers:

1
batch := bc.db.NewBatch()
2
rawdb.WriteHeadHeaderHash(batch, block.Hash())      // head header
3
rawdb.WriteHeadFastBlockHash(batch, block.Hash())    // snap sync head
4
rawdb.WriteCanonicalHash(batch, block.Hash(), num)   // block number → hash mapping
5
rawdb.WriteTxLookupEntriesByBlock(batch, block)      // tx hash → block number
6
rawdb.WriteHeadBlockHash(batch, block.Hash())        // head block
7
batch.Write()
8

9
// Then update in-memory atomic pointers
10
bc.currentBlock.Store(block.Header())

Because the pointers are atomic.Pointer, readers (RPC, tx pool, etc.) immediately see the new head without waiting for locks.

Engine API’s two-step path#

The Engine API uses a slightly different path:

1
NewPayload        → InsertBlockWithoutSetHead() — validate + store, don't move head
2
ForkchoiceUpdated → SetCanonical()              — move head to specified block

Why two steps? Because the CL may ask you to validate multiple blocks before telling you which one is canonical. Block validation and chain head selection are decoupled.

Q2: How does chain reorganization (reorg) work?#

When does a reorg happen?#

When a new block’s parent is not the current chain head, a reorg is needed. For example:

1
Current canonical chain:  1 → 2 → 3 → A4 → A5 (current head)
2

3
New block B5 arrives, parent is B4, B4's parent is 3:
4

5
              ┌→ A4 → A5 (old head)
6
1 → 2 → 3 ──┤
7
              └→ B4 → B5 (new head)

Geth needs to switch the canonical chain from fork A to fork B.

Algorithm: finding the common ancestor#

1
Step 1: Bring both chains to the same height
2
  Old chain: A5 (height 5)
3
  New chain: B5 (height 5)
4
  → Already equal, skip
5

6
Step 2: Walk both back simultaneously until hashes match
7
  A5 vs B5 → different, continue
8
  A4 vs B4 → different, continue
9
  3  vs 3  → match! Common ancestor = block 3
10

11
Result:
12
  Old chain only: [A4, A5]
13
  New chain only: [B4, B5]

If the two chains have different heights, the longer one is walked back to match the shorter one first, then both are walked back in parallel.

The switch process#

After finding the common ancestor, five steps execute the switch:

1
1. Collect logs from old chain blocks → send RemovedLogsEvent
2
   (notify subscribers these logs are no longer valid)
3

4
2. Collect transaction hashes from old chain blocks → deletion list
5

6
3. Iterate new chain blocks in forward order, for each:
7
   ├─ writeHeadBlock() — update canonical hash mapping and tx lookup index
8
   └─ Collect reborn logs → send to logsFeed
9

10
4. Clean up tx index:
11
   Deleted txs = old chain txs - new chain txs
12
   (some txs may exist in both chains — don't accidentally delete those)
13

14
5. Clear tx lookup LRU cache (may hold stale data)

Concrete example#

1
Old chain: block A4 contains tx1, tx2; block A5 contains tx3
2
New chain: block B4 contains tx1, tx4; block B5 contains tx5
3

4
Old chain txs: {tx1, tx2, tx3}
5
New chain txs: {tx1, tx4, tx5}
6

7
Need to delete index for: {tx2, tx3} (old-chain-only)
8
tx1 exists in both chains → keep
9
tx4, tx5 are new → create index

Interaction with the transaction pool#

After reorg completes, ChainHeadEvent is sent. The transaction pool receives it and executes Reset() (covered in Chapter 8):

1
TxPool Reset:
2
  1. Find common ancestor (similar logic to reorg)
3
  2. Transactions from old-chain-only blocks → re-inject into pool
4
  3. promoteExecutables() → promote executable transactions
5
  4. demoteUnexecutables() → demote invalid transactions

So tx2 and tx3 from A4/A5 don’t vanish — they return to the transaction pool, waiting to be re-included in future blocks.

Large reorg warning#

Reorgs deeper than 63 blocks trigger log.Warn("Large chain reorg detected") to alert operators. Under normal conditions, reorgs are only 1-2 blocks deep; excessive depth usually indicates network issues or consensus failures.

Geth(10) QA

https://kehaozheng.vercel.app/posts/chainethgeth/10_qa/

Author

Kehao Zheng

Published at