This repository was archived by the owner on May 18, 2021. It is now read-only.

Description
Operationally, there are some obvious things to measure per flow node. These should be exposed via /metrics if they aren't already:
DB connectivity:
- number of active pool connections (vs. idle)
- sql span histograms for journalling
One upper limit on how many concurrent stage operations we can sustain per second is (max pool connections) / <sql query span>.
Executor connectivity:
- number of active fn invocations the executor is waiting on.
Error counts:
- fn failures
- db errors
- lower-level errors: eg, socket availability (we might conceivably bump into this if we have a naive http/1.1 connection to the fn api).