test: add invalidate fork testcases #399

github-actions · 2026-01-21T09:46:18Z

Comprehensive nightly test workflow

The new workflow adds important long-running integration testing with 3 concurrent block generators. Key features:

Error monitoring: Checks container logs for ERROR messages at 5-second intervals with timestamp tracking

Height consensus: Waits for all 3 nodes to reach height 120+ before declaring success

Chain integrity validation: Uses chainintegrity tool to verify log consistency across nodes

Proper cleanup: Always runs docker compose down and uploads artifacts on failure

Minor suggestions:

Consider making REQUIRED_HEIGHT configurable via workflow input for flexibility

The 10-minute timeout (MAX_ATTEMPTS=120 * SLEEP=5s) may be tight for slower runners - consider documenting expected runtime

Line 66: Empty since_param on first check is clever but could use a comment explaining why

github-actions · 2026-01-21T09:45:12Z

Context cancellation check added but reorg work continues

The early context cancellation check at line 1425 is good defensive programming. However, if the context is cancelled during reorg processing (e.g., in the middle of fetching block headers or updating subtrees), the cleanup operations may be left incomplete.

Consider: Does the reorg logic handle partial state correctly if cancelled midway? The error handling returns early, but are there any state mutations that should be rolled back?

Why was this added? Should this not be taken care of the for/select where handleReorg is called?

-Original file line number
+Diff line change
@@ Expand Up / @@ -14,8 +14,163 @@ permissions: @@
       contents: read
       pull-requests: read
       id-token: write
+      packages: write
     jobs:
+      # Chain Integrity Test with 3 block generators
+      chainintegrity:
+        name: Chain Integrity Test
+        runs-on: teranode-runner-16-core
+        timeout-minutes: 30
+        steps:
+          - name: Checkout code
+            uses: actions/checkout@08eba0b27e820071cde6df949e0beb9ba4906955 # v4
+          - name: Set up Go
+            uses: actions/setup-go@d35c59abb061a4a6fb18e82ac0862c26744d6ab5 # v5
+            with:
+              go-version: ${{ env.GO_VERSION }}
+              cache: false
+          - name: Set up Docker Buildx
+            uses: docker/setup-buildx-action@e468171a9de216ec08956ac3ada2f0791b6bd435 # v3
+          - name: Login to GitHub Container Registry
+            uses: docker/login-action@5e57cd118135c172c3672efd75eb46360885c0ef # v3
+            with:
+              registry: ghcr.io
+              username: ${{ github.actor }}
+              password: ${{ secrets.GITHUB_TOKEN }}
+          - name: Build and load Docker image locally
+            uses: docker/build-push-action@4f58ea79222b3b9dc2c8bbdd6debcef730109a75 # v6
+            with:
+              context: .
+              load: true
+              tags: teranode:latest
+              cache-from: type=gha
+              cache-to: type=gha,mode=max
+          - name: Remove old data and recreate directories
+            run: |
+              rm -rf data
+              mkdir -p data/postgres
+          - name: Start Teranode nodes with 3 block generators (docker compose up)
+            run: docker compose -f compose/docker-compose-3blasters.yml up -d
+          - name: Wait for mining to complete (all nodes at height 120+ and in sync)
+            run: |
+              set -e
+              REQUIRED_HEIGHT=120
+              MAX_ATTEMPTS=120  # 10 minutes with 5s sleep
+              SLEEP=5
+              # Function to check for errors in all teranode container logs at once
+              check_errors() {
+                # Get current time for this check
+                local current_time
+                current_time=$(date -u +"%Y-%m-%dT%H:%M:%SZ")
+                # Check for errors - if last_check_time is empty, it will check all logs
+                local since_param=""
+                if [ ! -z "$last_check_time" ]; then
+                  since_param="--since=$last_check_time"
+                fi
+                # Single command pattern that works for both initial and subsequent checks
+                local errors
+                errors=$(docker compose -f compose/docker-compose-3blasters.yml logs --no-color $since_param teranode1 teranode2 teranode3 | grep -i "| ERROR |" || true)
+                # Update timestamp for next check
+                last_check_time=$current_time
+                if [[ ! -z "$errors" ]]; then
+                  echo "ERROR: Found error logs in teranode containers:"
+                  echo "$errors"
+                  return 1
+                fi
+                return 0
+              }
+              # Initialize empty for first check to get all logs
+              last_check_time=""
+              for ((i=1; i<=MAX_ATTEMPTS; i++)); do
+                h1=$(curl -s http://localhost:18090/api/v1/bestblockheader/json | jq -r .height)
+                h2=$(curl -s http://localhost:28090/api/v1/bestblockheader/json | jq -r .height)
+                h3=$(curl -s http://localhost:38090/api/v1/bestblockheader/json | jq -r .height)
+                echo "Attempt $i: heights: $h1 $h2 $h3"
+                # Check for errors in all teranode containers
+                if ! check_errors; then
+                  echo "Errors found in container logs. Exiting."
+                  exit 1
+                fi
+                if [[ -z "$h1" || -z "$h2" || -z "$h3" ]]; then
+                  if [[ $i -gt 10 ]]; then
+                    echo "Error: One or more nodes are not responding after 10 attempts. Exiting."
+                    exit 1
+                  else
+                    echo "Warning: One or more nodes are not responding. Continuing..."
+                  fi
+                fi
+                if [[ "$h1" =~ ^[0-9]+$ && "$h2" =~ ^[0-9]+$ && "$h3" =~ ^[0-9]+$ ]]; then
+                  if [[ $h1 -ge $REQUIRED_HEIGHT && $h2 -ge $REQUIRED_HEIGHT && $h3 -ge $REQUIRED_HEIGHT ]]; then
+                    echo "All nodes have reached height $REQUIRED_HEIGHT or greater."
+                    exit 0
+                  fi
+                fi
+                sleep $SLEEP
+              done
+              echo "Timeout waiting for all nodes to reach height $REQUIRED_HEIGHT."
+              exit 1
+          - name: Collect Docker container logs
+            if: failure()
+            run: |
+              mkdir -p container-logs
+              containers=$(docker ps -a --format "{{.Names}}")
+              for container in $containers; do
+                echo "Collecting logs for container: $container"
+                docker logs "$container" > "container-logs/$container.log" 2>&1 || true
+              done
+          - name: Upload container logs
+            if: failure()
+            uses: actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02 # v4
+            with:
+              name: container-logs
+              path: container-logs/
+          - name: Stop Teranode nodes (docker compose down for teranode-1/2/3)
+            run: docker compose -f compose/docker-compose-3blasters.yml down teranode1 teranode2 teranode3
+          - name: Build chainintegrity binary
+            run: make build-chainintegrity
+          - name: Run chainintegrity test
+            run: ./chainintegrity.run --logfile=chainintegrity --debug | tee chainintegrity_output.log
+          - name: Check for hash mismatch and fail if found
+            run: |
+              if grep -q "All filtered log file hashes differ! No majority consensus among nodes." chainintegrity_output.log; then
+                echo "Chain integrity test failed: all log file hashes differ, no majority consensus."
+                exit 1
+              fi
+          - name: Upload chainintegrity logs
+            uses: actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02 # v4
+            with:
+              name: chainintegrity-logs
+              path: |
+                chainintegrity*.log
+                chainintegrity*.filtered.log
+          - name: Cleanup (docker compose down)
+            if: always()
+            run: docker compose -f compose/docker-compose-3blasters.yml down
       run-daemon-tests:
         runs-on: teranode-runner-16-core
         steps:
@@ Expand Down @@

-Original file line number
+Diff line change
@@ Expand Up @@
     // Returns:
     //   - error: Any error encountered during reorganization
     func (b *BlockAssembler) handleReorg(ctx context.Context, header *model.BlockHeader, height uint32) error {
+    	if err := ctx.Err(); err != nil {
+    		return errors.NewProcessingError("context", err)
+    	}
     	startTime := time.Now()
     	prometheusBlockAssemblerReorg.Inc()
@@ Expand Down Expand Up @@
     	// are necessarily going to be on the same height
     	baBestBlockHeader, baBestBlockHeight := b.CurrentBlock()
-    	startingHeight := height
+    	if baBestBlockHeader == nil {
+    		return nil, nil, errors.NewProcessingError("best block header is nil, reorg not possible")
+    	}
-    	if height > baBestBlockHeight {
-    		startingHeight = baBestBlockHeight
+    	startingHeight := baBestBlockHeight
+    	if height < startingHeight {
+    		startingHeight = height
     	}
-    	// Get block locator for current chain
     	currentChainLocator, err := b.blockchainClient.GetBlockLocator(ctx, baBestBlockHeader.Hash(), startingHeight)
     	if err != nil {
-    		return nil, nil, errors.NewServiceError("error getting block locator for current chain", err)
+    		return nil, nil, errors.NewServiceError("error getting current chain block locator", err)
     	}
-    	// Get block locator for the new best block
     	newChainLocator, err := b.blockchainClient.GetBlockLocator(ctx, header.Hash(), startingHeight)
     	if err != nil {
-    		return nil, nil, errors.NewServiceError("error getting block locator for new chain", err)
+    		return nil, nil, errors.NewServiceError("error getting new chain block locator", err)
     	}
-    	// Find common ancestor using locators
-    	var (
-    		commonAncestor     *model.BlockHeader
-    		commonAncestorMeta *model.BlockHeaderMeta
-    	)
+    	newChainLocatorSet := make(map[chainhash.Hash]struct{}, len(newChainLocator))
+    	for _, h := range newChainLocator {
+    		newChainLocatorSet[*h] = struct{}{}
+    	}
+    	var commonAncestorHash *chainhash.Hash
     	for _, currentHash := range currentChainLocator {
-    		for _, newHash := range newChainLocator {
-    			if currentHash.IsEqual(newHash) {
-    				commonAncestor, commonAncestorMeta, err = b.blockchainClient.GetBlockHeader(ctx, currentHash)
-    				if err != nil {
-    					return nil, nil, errors.NewServiceError("error getting common ancestor header", err)
-    				}
-    				goto FoundAncestor
-    			}
+    		if _, ok := newChainLocatorSet[*currentHash]; ok {
+    			commonAncestorHash = currentHash
+    			break
     		}
     	}
-    FoundAncestor:
+    	if commonAncestorHash == nil {
+    		return nil, nil, errors.NewProcessingError("common ancestor not found, reorg not possible")
+    	}
+    	commonAncestor, commonAncestorMeta, err := b.blockchainClient.GetBlockHeader(ctx, commonAncestorHash)
+    	if err != nil {
+    		return nil, nil, errors.NewServiceError("error getting common ancestor header", err)
+    	}
     	if commonAncestor == nil || commonAncestorMeta == nil {
     		return nil, nil, errors.NewProcessingError("common ancestor not found, reorg not possible")
     	}
@@ Expand Down Expand Up / @@ -1654,7 +1661,7 @@ FoundAncestor: @@
     	maxGetReorgHashes := b.settings.BlockAssembly.MaxGetReorgHashes
     	if len(filteredMoveBack) > maxGetReorgHashes {
     		currentHeader, currentHeight := b.CurrentBlock()
-    		b.logger.Errorf("reorg is too big, max block reorg: current hash: %s, current height: %d, new hash: %s, new height: %d, common ancestor hash: %s, common ancestor height: %d, move down block count: %d, move up block count: %d, current locator: %v, new block locator: %v", currentHeader.Hash(), currentHeight, header.Hash(), height, commonAncestor.Hash(), commonAncestorMeta.Height, len(filteredMoveBack), len(moveForwardBlockHeaders), currentChainLocator, newChainLocator)
+    		b.logger.Errorf("reorg is too big, max block reorg: current hash: %s, current height: %d, new hash: %s, new height: %d, common ancestor hash: %s, common ancestor height: %d, move down block count: %d, move up block count: %d", currentHeader.Hash(), currentHeight, header.Hash(), height, commonAncestor.Hash(), commonAncestorMeta.Height, len(filteredMoveBack), len(moveForwardBlockHeaders))
     		return nil, nil, errors.NewProcessingError("reorg is too big, max block reorg: %d", maxGetReorgHashes)
     	}
@@ Expand Down @@

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: add invalidate fork testcases #399

Uh oh!

Diff view

Diff view

There are no files selected for viewing

github-actions bot Jan 21, 2026

Uh oh!

github-actions bot Jan 21, 2026

Uh oh!

icellan Jan 21, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

test: add invalidate fork testcases #399

Are you sure you want to change the base?

Uh oh!

test: add invalidate fork testcases #399

Uh oh!

Uh oh!

Diff view

Diff view

There are no files selected for viewing

github-actions bot Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

icellan Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!