Data Profile Plan¶
A Data Profile Plan at a high level documents what types of profiling is done, what data sources and what dataset(s) are used. Types of data profile can be described in 3 categories:
- Structure Discovery: Validating that data is consistent and formatted correctly, and performing mathematical checks on the data such as min,sum.
- or what percentage of personnel don’t have a SSN.
Content Discovery: identifies systemic issues in the data for example, personnel SSN missing last 4 digits
Relationship discovery: Discovering how parts of the data are interrelated. For example, key relationships between database tables and referntial integrity, cardinality