Dataflow run metrics

Each individual dataflow run shows the following details:

  • Dataflow run start: The time that the dataflow run started at.
  • Processing time: The amount of time that it took for the dataflow to process.
  • Records Received: The total number of records received in the dataflow from a source connector.
  • Records Ingested: The total count of records ingested into Data Lake.
  • Records with Warnings: The total count of records with warnings that were ingested. All mapper transformation errors are reported as warnings and rows that are partially ingested are labelled as success with a warning. Note: Support for ingesting records with warnings is only available to streaming sources.
  • Records Failed: The number of records that were not ingested into Data Lake due to errors in the data.
  • Ingestion Rate: The success rate of records ingested into Data Lake. This metric is applicable when Partial Ingestion is enabled.
  • Status: Represents the state the dataflow is in: either Completed or Processing. Completed means that all records for the corresponding dataflow run were processed within the one-hour period. Processing means that the dataflow run has not yet finished.

The Dataflow run overview page contains additional information on your dataflow, such as its corresponding dataflow run ID, target dataset, and organization ID.

A flow run with errors also contains the Dataflow run errors panel, which displays the particular error that led to the failure of the run, as well as the total count of records that failed.

dataflow-run-overview