In the data audit process, canned routines are programmed that can provide analytics practitioners with their first glimpse regarding data quality. Three canned programming routines look at the following: A random sample of 100 records Data Diagnostics and Meta-data reports that look at missing values, unique values as well as a variety of statistical diagnostics for each field. Frequency distribution reports for each field The end objective of these canned routines is to determine the data quality of each field from each file and to determine whether or not this source data will be useful in future analytics exercises. After
Already receive the Machine Learning Times emails? The Machine Learning Times now requires legacy email subscribers to upgrade their subscription - one time only - in order to attain a password-protected login and gain complete access.