Profiling Data

Foundation automatically analyzes and profiles your data to provide deep insights into data quality, structure, and characteristics, enabling proactive data quality management and informed decision-making.

How Data Profiling Works

When Data Products are updated, the platform automatically examines each dataset to generate comprehensive statistical profiles:

Numeric columns: Min/max values, mean, median, standard deviation, and distribution patterns
Text columns: Unique values, frequency distributions, pattern detection, and length analysis
Date/time columns: Range boundaries, temporal patterns, and gap detection
Completeness metrics: Null counts, missing value patterns, and fill rates

Reading the Data Profiling results through the Data Catalog

Navigate to the Data Catalog page
Search for the Data Product you want to see the profile for
Click on the Profile tab

The Profile tab displays, for every column in the data product:

Data type
Random example values
Unique values
Percentage of null values
Min
Max
Other statistics depending on the data type

You can click on the "+19 more" for the example values list to display more examples.

Reading the Data Profiling results through the API

Foundation offers API endpoints to read the Profile of a Data Product:

/api/data/data_product/quality/profiling

/api/data/data_product/

Read the Using the Foundation APIs page to leverage this option in the best way possible.

PreviousSetting up Custom Data Quality Checks for a Data Product NextExploring Data Lineage

Last updated 4 months ago

hashtagHow Data Profiling Works

hashtagReading the Data Profiling results through the Data Catalog

hashtagReading the Data Profiling results through the API

How Data Profiling Works

Reading the Data Profiling results through the Data Catalog

Reading the Data Profiling results through the API