7 years ago
Predictive Modeling Forensics: Identifying Data Problems


Excerpted and modified from Chapters 3 and 4 of Mr. Abbott’s book Applied Predictive Analytics, Wiley 2014

The Data Understanding stage of a predictive analytics project is intended to uncover the characteristics of the data available for predictive modeling. One key part of Data Understanding is what we might call a Data Audit, where every field is summarized. One purpose of a data audit is to uncover potential problems with the data that should be corrected during data preparation.

