Profiling Data

Foundation automatically analyzes and profiles your data to provide deep insights into data quality, structure, and characteristics, enabling proactive data quality management and informed decision-making.

How Data Profiling Works

When Data Products are updated, the platform automatically examines each dataset to generate comprehensive statistical profiles:

  • Numeric columns: Min/max values, mean, median, standard deviation, and distribution patterns

  • Text columns: Unique values, frequency distributions, pattern detection, and length analysis

  • Date/time columns: Range boundaries, temporal patterns, and gap detection

  • Completeness metrics: Null counts, missing value patterns, and fill rates

Reading the Data Profiling results through the Data Catalog

  1. Navigate to the Data Catalog page

  2. Search for the Data Product you want to see the profile for

  3. Click on the Profile tab

The Profile tab displays, for every column in the data product:

  • Data type

  • Random example values

  • Unique values

  • Percentage of null values

  • Min

  • Max

  • Other statistics depending on the data type

You can click on the "+19 more" for the example values list to display more examples.

Reading the Data Profiling results through the API

Foundation offers API endpoints to read the Profile of a Data Product:

/api/data/data_product/quality/profiling
/api/data/data_product/

Read the Using the Foundation APIs page to leverage this option in the best way possible.

Last updated