Profile Diff

Overview

Easily compare data profiles across sources to ensure consistency throughout your pipeline or between different environments.

By leveraging intelligent profiling techniques, this approach enables you to confidently manage and synchronize data integrity throughout your entire data pipeline, ensuring that data remains consistent and reliable whether you're comparing production and staging environments, undertaking database migrations, or simply verifying that different data sources align seamlessly with your expected data behavior.

Compare data sources

Select the Profile Diff tab - this can be either from the left side menu or from the tab inside a specific data source.

Once the tab is open, a view allowing you to select two different sources and times appears on the screen.

Profile diff before selecting data sources and times

Choosing Profile Diff from within a specific data source page automatically selects it for comparison.

Use the selection drop downs to choose the data sources that will be compares as well as the time for comparison. The times presented are the times in which a data contract was updated for a specific data source.

Notice - data sources created from pivot fields are also presented in the view and can be compared like any regular data source.

Data source selection including "derived" data sources created by pivots.

Once both data sources and dates are selected, a "diff" view for the two profiles is displayed. This view includes both the metadata metrics (freshness and volume) as well as a smart comparison generated for each of the fields based on the field profiles.

Profile diff between two data source in staging and production

For each of the monitored metrics, a comparison between the two data sources is created. If the profiles for a metric don't match for the two data sources, it is marked in red.

Compared metrics for numeric fields with high cardinality

Clicking on any of the compared metric will open a graph showing the profiles over time with a clear comparison. This comparison can be viewed as a unified graph or as two separate graphs.

Profile comparison for a specific field

Last updated