add pollHealthChecker interface for optional RPC health checks by Krish-vemula · Pull Request #83 · smartcontractkit/chainlink-framework

Krish-vemula · 2026-02-17T22:03:08Z

Summary

Adds an optional PollHealthCheck method to the RPCClient interface, enabling chain-specific RPC clients to perform additional health checks during node pool polling. Failures from this check count toward the PollFailureThreshold, allowing automatic detection and failover from unhealthy RPC nodes.

Supports: #352

Add optional interface for chain-specific RPC clients to run extra health checks during alive-loop polling. Failures count toward poll failure threshold. Enables chain integrations to detect issues like missing historical state.

multinode/node_lifecycle.go

…r finalized state availability with configurable threshold and regex-based error classification.

dhaidashenko · 2026-03-03T17:47:35Z

multinode/config/config.go

+	return c.MultiNode.FinalizedStateCheckEnabled != nil && *c.MultiNode.FinalizedStateCheckEnabled
+}
+
+func (c *MultiNodeConfig) FinalizedStateCheckAddress() string {


If properly implemented, this should never panic because of the nil value. If we see a panic, it's an early signal that config overrides are not working as expected.
I agree that, in general, we should be cautious and check for nils, but in this case we should follow the common config structure to keep things consistent and spot issues early.

dhaidashenko · 2026-03-03T17:49:06Z

multinode/node_lifecycle.go

 			lggr.Tracew("Pinging RPC", "nodeState", n.State(), "pollFailures", pollFailures)
 			pollCtx, cancel := context.WithTimeout(ctx, pollInterval)
 			version, pingErr := n.RPC().ClientVersion(pollCtx)
+			if pingErr == nil {


This is redundat with new logic, no?

dhaidashenko · 2026-03-03T17:50:29Z

multinode/node_lifecycle.go

+							finalizedStateFailures++
+						}
+						lggr.Warnw("Finalized state not available", "err", stateErr, "failures", finalizedStateFailures, "threshold", finalizedStateCheckFailureThreshold)
+						if finalizedStateCheckFailureThreshold > 0 && finalizedStateFailures >= finalizedStateCheckFailureThreshold {


IMO finalizedStateCheckFailureThreshold > 0 is redundant, since we control the healthcheck via finalizedStateCheckEnabled

dhaidashenko · 2026-03-03T17:51:55Z

multinode/node_lifecycle.go

+						}
+						lggr.Warnw("Finalized state not available", "err", stateErr, "failures", finalizedStateFailures, "threshold", finalizedStateCheckFailureThreshold)
+						if finalizedStateCheckFailureThreshold > 0 && finalizedStateFailures >= finalizedStateCheckFailureThreshold {
+							lggr.Errorw("RPC node cannot serve finalized state after consecutive failures", "failures", finalizedStateFailures)


Let's introduce a metric similar to PollsFailed to have better visibility into the failure rate.

dhaidashenko · 2026-03-03T17:53:28Z

multinode/node_lifecycle.go

+		case <-time.After(dialRetryBackoff.Duration()):
+			lggr.Tracew("Trying to re-dial RPC node", "nodeState", n.getCachedState())
+
+			err := n.rpc.Dial(ctx)


Use createVerifiedConn and wait for at least on sucesfull poll of CheckFinalizedStateAvailability

dhaidashenko · 2026-03-03T18:05:56Z

multinode/node_lifecycle.go

+// isFinalizedStateUnavailableError checks if the error indicates that the RPC cannot serve
+// historical state (as opposed to an RPC reachability issue).
+// If regexPattern is empty, all errors are treated as state unavailable errors.
+func isFinalizedStateUnavailableError(err error, regexPattern string) bool {


Classification should be done in evm

dhaidashenko · 2026-03-03T18:06:53Z

multinode/node_lifecycle_test.go

 			return nodeStateUnreachable == node.State()
 		})
 	})
+	t.Run("optional poll health check failure counts as poll failure and transitions to unreachable", func(t *testing.T) {


Add a test to verfiy that RPC can be marked as nodeStateFinalizedStateNotAvailable and then marked alive again.

Krish-vemula added 2 commits February 17, 2026 14:00

added fixes for build and lint

a81093e

Krish-vemula marked this pull request as ready for review February 18, 2026 23:02

Krish-vemula requested a review from a team as a code owner February 18, 2026 23:02

product-security-plaid-production bot requested a review from DylanTinianov February 18, 2026 23:02

dhaidashenko reviewed Feb 26, 2026

View reviewed changes

multinode/node_lifecycle.go Show resolved Hide resolved

Krish-vemula added 2 commits March 2, 2026 09:52

Introduce nodeStateFinalizedStateNotAvailable and separate polling fo…

387ec1a

…r finalized state availability with configurable threshold and regex-based error classification.

added fixes for lint and mock

6788548

dhaidashenko reviewed Mar 3, 2026

View reviewed changes

Add FinalizedStateUnavailable to ClientErrors

72b9577

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add pollHealthChecker interface for optional RPC health checks#83

add pollHealthChecker interface for optional RPC health checks#83
Krish-vemula wants to merge 5 commits intomainfrom
cre/PLEX-2476

Krish-vemula commented Feb 17, 2026 •

edited

Loading

Uh oh!

Uh oh!

dhaidashenko Mar 3, 2026

Uh oh!

dhaidashenko Mar 3, 2026

Uh oh!

dhaidashenko Mar 3, 2026

Uh oh!

dhaidashenko Mar 3, 2026

Uh oh!

dhaidashenko Mar 3, 2026

Uh oh!

dhaidashenko Mar 3, 2026

Uh oh!

dhaidashenko Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Krish-vemula commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Uh oh!

dhaidashenko Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

dhaidashenko Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

dhaidashenko Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

dhaidashenko Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

dhaidashenko Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

dhaidashenko Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

dhaidashenko Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Krish-vemula commented Feb 17, 2026 •

edited

Loading