Machine Learning Times
Machine Learning Times
EXCLUSIVE HIGHLIGHTS
Survey: Machine Learning Projects Still Routinely Fail to Deploy
 Originally published in KDnuggets. Eric Siegel highlights the chronic...
Three Best Practices for Unilever’s Global Analytics Initiatives
    This article from Morgan Vawter, Global Vice...
Getting Machine Learning Projects from Idea to Execution
 Originally published in Harvard Business Review Machine learning might...
Eric Siegel on Bloomberg Businessweek
  Listen to Eric Siegel, former Columbia University Professor,...
SHARE THIS:

10 months ago
Don’t Let Yourself Be Fooled By Data Drift

 
Originally published in NannyML, May 31, 2023.

If you search for information on ML monitoring online, there is a good chance that you’ll come across various monitoring approaches advocating for putting data drift at the center of monitoring solutions.

While data drift detection is indeed a key component of a healthy monitoring workflow, we found that it is not the most important one. Data drift and its other siblings’, target, and prediction drift can misrepresent the state of an ML model in production.

The purpose of this blog post is to demonstrate that not all data drift impacts model performance. Making drift methods hard to trust since they tend to produce a large number of false alarms. To illustrate this point, we will train an ML model using a real-world dataset, monitor the distribution of the model’s features in production, and report any data drift that might occur.

After, we will present a new algorithm invented by NannyML that will significantly reduce these false alarms.

So, without further ado, let’s check the dataset used in this post.

Power consumption dataset

We use the Power Consumption of Tetouan City dataset, a real and open-source dataset. This data was collected by the Supervisory Control and Data Acquisition System (SCADA) of Amendis, a public service operator in charge of distributing drinking water and electricity in Morocco.

To continue reading this article, click here.

7 thoughts on “Don’t Let Yourself Be Fooled By Data Drift

  1. Pingback: Don't Be Fooled By Data Drift « Machine Learning Times - AI Consultancy

  2. Pingback: Don’t Let Yourself Be Fooled By Data Drift -

  3. Your writings stick out to me since the content is interesting and simple to understand. Even though I’ve read a lot of websites, I still like yours more. Your essay was interesting to read. I can understand the essay better now that I’ve read it carefully. In the future, I’d like to read more of your writing us map
    .

     
  4. Spend some time playing. I’m interested in finding out more because I have strong views about it. Would you please provide more details to your blog post? We will all actually gain from it. run 3

     

Leave a Reply