Conclusion first: not the sol indexer is stuck, but the stream in run_stream is stuck.
Observation:
This only happened on node 11 and node 4.
I looked into the solana indexer hang on Mar 19 on node 11, the grafana dashboard shows the number of solana requests and block height both stuck
But I checked our gcp logs, it is clear that node 11 found solana requests consistently during the time window shown as stuck in grafana dashboard.
I looked into our codebase, we emit the latest block height metric only when a block event is emitted from the stream(code). And the number of requests metric is coming from multichain_sign_request_latency_sec, which only gets emitted in record_request_latency, and this won't get emitted if a signature generator has not been started.
So my guess of what's happening is: not the sol indexer is stuck, but the stream in run_stream is stuck.
Fix:
Not sure why the stream can get stuck yet. ideas? @volovyks @jakmeier @ChaoticTempest
Conclusion first: not the sol indexer is stuck, but the stream in run_stream is stuck.
Observation:
This only happened on node 11 and node 4.
I looked into the solana indexer hang on Mar 19 on node 11, the grafana dashboard shows the number of solana requests and block height both stuck
But I checked our gcp logs, it is clear that node 11 found solana requests consistently during the time window shown as stuck in grafana dashboard.
I looked into our codebase, we emit the latest block height metric only when a block event is emitted from the stream(code). And the number of requests metric is coming from
multichain_sign_request_latency_sec, which only gets emitted in record_request_latency, and this won't get emitted if a signature generator has not been started.So my guess of what's happening is: not the sol indexer is stuck, but the stream in run_stream is stuck.
Fix:
Not sure why the stream can get stuck yet. ideas? @volovyks @jakmeier @ChaoticTempest