Anomalies vs Outliers Anomaly detection, or finding needles in a haystack, is an important tool in data exploration and unsupervised analytic modeling. Anomaly detection also creates a path to supervised modeling by singling out key examples that an analyst can begin to classify as needles or hay. Those labeled examples are essential for supervised learning, which is much more powerful than unsupervised learning methods like clustering. Though anomaly and outlier are often used interchangeably we’d like to emphasize distinct definitions. As Ravi Parikh describes well in a blog post, “An outlier is a legitimate data point that’s far away from
To view this content
OR subscribe for free
Already receive the Predictive Analytics Times emails?
Click here to complete this one-time subscription upgrade
As of January 2014, the Predictive Analytics Times now requires legacy email subscribers to upgrade their subscription - one time only - in order to attain a password-protected login and gain complete access.