Machine Learning Times
Machine Learning Times
EXCLUSIVE HIGHLIGHTS
A University Curriculum Supplement to Teach a Business Framework for ML Deployment
    In 2023, as a visiting analytics professor...
The AI Playbook: Providing Important Reminders to Data Professionals
 Originally published in DATAVERSITY. This article reviews the new...
Decode the Algorithm: Navigate the World of Machine Learning in Business with ‘The AI ​​Playbook’
  This article reviews the new book, The AI Playbook, by...
To Deploy Machine Learning, You Must Manage Operational Change—Here Is How UPS Got It Right
 Originally published in Harvard Data Science Review. For more...
SHARE THIS:

8 months ago
Don’t Let Yourself Be Fooled By Data Drift

 
Originally published in NannyML, May 31, 2023.

If you search for information on ML monitoring online, there is a good chance that you’ll come across various monitoring approaches advocating for putting data drift at the center of monitoring solutions.

While data drift detection is indeed a key component of a healthy monitoring workflow, we found that it is not the most important one. Data drift and its other siblings’, target, and prediction drift can misrepresent the state of an ML model in production.

The purpose of this blog post is to demonstrate that not all data drift impacts model performance. Making drift methods hard to trust since they tend to produce a large number of false alarms. To illustrate this point, we will train an ML model using a real-world dataset, monitor the distribution of the model’s features in production, and report any data drift that might occur.

After, we will present a new algorithm invented by NannyML that will significantly reduce these false alarms.

So, without further ado, let’s check the dataset used in this post.

Power consumption dataset

We use the Power Consumption of Tetouan City dataset, a real and open-source dataset. This data was collected by the Supervisory Control and Data Acquisition System (SCADA) of Amendis, a public service operator in charge of distributing drinking water and electricity in Morocco.

To continue reading this article, click here.

7 thoughts on “Don’t Let Yourself Be Fooled By Data Drift

  1. Pingback: Don't Be Fooled By Data Drift « Machine Learning Times - AI Consultancy

  2. Pingback: Don’t Let Yourself Be Fooled By Data Drift -

  3. Your writings stick out to me since the content is interesting and simple to understand. Even though I’ve read a lot of websites, I still like yours more. Your essay was interesting to read. I can understand the essay better now that I’ve read it carefully. In the future, I’d like to read more of your writing us map
    .

     
  4. Spend some time playing. I’m interested in finding out more because I have strong views about it. Would you please provide more details to your blog post? We will all actually gain from it. run 3

     

Leave a Reply