Data Understanding:
Collect initial data and describe it
Exploratory data analysis
Verify the data quality
Data Preparation may include any or all of the following:
select the data and document rationale for inclusion of exclusion
clean the data
construct attributes (derived) and/or generate records when needed
integrate and merge data sources
reformat as needed
describe the data to be used for modeling
Data Quality Life-Cycle
Data Discovery: Requirement gathering, source application identification, data collection, organization, and data quality report classification
Data Profiling: Initial examination, sample data quality check, rule suggestion, and approval of final data quality rule
Data Rules: Execution of final business rule to examine accuracy of the data, and its fit for purpose
Data Distribution and Remediation: Process of distributing the data quality reports to the responsible parties and start of remediation process
Data Monitoring: Ongoing monitoring of remediation process, and creation of data quality dashboards and score cards