Home Pages: PAW Business | PAW Financial | PAW Healthcare | PAW Industry 4.0 | PAW Climate | Deep Learning World
Agenda
Predictive Analytics World for Business 2021
May 24-28, 2021 – Livestreamed
To view the full 7-track agenda for the six co-located conferences at Machine Learning Week click here or for the individual conference agendas here: PAW Business, PAW Financial, PAW Healthcare, PAW Industry 4.0, PAW Climate or Deep Learning World.
TRACK TOPICS – The two PAW Business tracks cover these topics:
Operationalization, management, best practices
Track 1
Machine learning methods & advanced topics
Track 2
TRACK TOPICS – The two PAW Business tracks cover these topics:
Operationalization, management, best practices
Track 1
Machine learning methods & advanced topics
Track 2
Session Levels:
Blue circle sessions are for All Levels
Red triangle sessions are Expert/Practitioner Level
Orange square sessions are Practitioner Level
Workshops - Wednesday, May 19th, 2021
Full-day: 7:15am – 2:30pm PDT
This one-day session surveys standard and advanced methods for predictive modeling (aka machine learning).
Full-day: 7:30am – 3:30pm PDT
Gain experience driving R for predictive modeling across real examples and data sets. Survey the pertinent modeling packages.
Full-day: 8:00am – 3:00pm PDT
This one day workshop reviews major big data success stories that have transformed businesses and created new markets.
Workshops - Thursday, May 20th, 2021
Full-day: 8:00am – 3:00pm PDT
This workshop dives into the key ensemble approaches, inc zluding Bagging, Random Forests, and Stochastic Gradient Boosting.
Full-day: 8:00am – 3:00pm PDT
Python leads as a top machine learning solution – thanks largely to its extensive battery of powerful open source machine learning libraries. It’s also one of the most important, powerful programming languages in general.
Full-day: 8:00am – 3:00pm PDT
Machine learning improves operations only when its predictive models are deployed, integrated and acted upon – that is, only when you operationalize it.
Full-day: 8:30am – 3:30pm PDT
Gain the power to extract signals from big data on your own, without relying on data engineers and Hadoop specialists.
Workshops - Friday, May 21st, 2021
Full-day: 7:15am – 2:30pm PDT
This one-day session reveals the subtle mistakes analytics practitioners often make when facing a new challenge (the “deadly dozen”), and clearly explains the advanced methods seasoned experts use to avoid those pitfalls and build accurate and reliable models.
Predictive Analytics World for Business - Virtual - Day 1 - Monday, May 24th, 2021

As companies continue to increase their investment in Machine Learning (ML) and Artificial Intelligence (AI), the focus is increasingly on how to make machine learning pervasive, creating business value by embedding it throughout a company's operations. But what does it take to get out of Proof of Concept purgatory and go from pilots to operational deployments? Do we need more technology, new roles, a different approach or what?
In this session, James will kick off the operational track with a discussion of what it takes to get value at scale from ML. He'll talk about critical issues and how to address them. He'll outline a proven approach that will let non-digital natives and even the most risk-averse companies operationalize and get value from their machine learning investments. Whatever your motivation for using machine learning and no matter how mature (or not) your machine learning team is, you'll learn how to succeed.
Traditionally, successful predictive analytics initiatives have relied on the application of complex algorithms to a lot of numbers and interpreting the output into meaningful language that clients can easily understand. This presentation aims to turn that premise on its head. Starting with a lot of words provided by consumer panelists, NCP analyzed the text provided in their feedback to us and translated those words into numbers. These numbers added up to value for National Consumer Panel. This presentation will share lessons learned from combining text analytics insights with more traditional approaches leveraging quantitative analytics. Attendees can expect to learn how NCP's experience may be extrapolated to provide insights on other business questions.
In this talk Dr. Anzelc will provide an overview of her own career path from particle physics PhD to insurance analytics executive to people analytics leader, providing a view of how she approached her job search and developed her career path from individual contributor to team leader. Practical advice on what to do, resources available, and other non-academic career paths of physicists will also be shared.
With COVID-19 shifting customer behavior the demand for Marriott’s Home and Villas offering has exploded. In this session we will share how deep neural network based Natural Language Generation techniques were used to support Marriott Homes and Villas generating property titles automatically. We will explore different techniques for title generation, provide performance comparisons and lessons learned to activate the model.
Join your fellow practitioners in this interactive session where you can exchange approaches to shared challenges and hear how your peers are tackling similar issues.
Predictive Analytics World for Business - Virtual - Day 2 - Tuesday, May 25th, 2021
Grab your real coffee and share experiences virtually with your peers to explore the new challenges of operating in a largely virtual world. Just like pre-show breakfast in a regular conference you’ll join a “round table” with seven fellow attendees and see where the conversation takes you.
Due diligence prior to model deployment involves identifying bias risks, features associated with group differences, and potential missing information that could mitigate disparities. Stereotyping, or ascribing common traits to all individuals in a group, is a particular risk for machine learning models, which generate predictions based on feature similarities. Here, I explore stereotyping in regression or classification models. I demonstrate that common fairness metrics are unable to distinguish stereotyping from decisions based on arguably reasonable factors. Therefore, Paychex has developed a “due diligence” script to guide data scientists in assessing disparities. This process uses model outputs and metrics to identify areas of risk, then Shapley explainers with custom references identify features driving differences. Additional reasoning is suggested to investigate the underlying causes of inequalities, consider the effects of missing information that can mitigate risk, and the effects of sensitive features. This structured report format is suitable for stakeholders or oversight committees.
A recurring theme among analytics and data science leaders is the concern of not being able to keep up with all of the rapid change taking place – both individually and as a team. In years past, it was possible to stand up an organization largely made up of analytics generalists who would handle initiatives end to end. In today’s world, it is necessary to utilize a range of specialists focused on either specific methods or specific points in the lifecycle of an analytics and data science initiative. This talk will discuss the trends driving the need to evolve your organization’s talent model, new roles you need to consider implementing, and how they all fit together.
With the need to cater to a global audience, there is a growing demand for applications to support speech identification/translation/transliteration from one language to another. This talk starts off with a brief introduction to the topic of Machine Translation (MT), the evolution/application of Machine Translation. The focus will be on Neural Machine Translation (NMT). Then it moves on to introduce some of the typical customer cases and finally ends with how to embed such functionality in your application. The talk will end with a Python based solution demo and introduce the audience to some resources and tools that could help them as they continue to explore the domain.
Join your fellow practitioners in this interactive session where you can exchange approaches to shared challenges and hear how your peers are tackling similar issues.
Predictive Analytics World for Business - Virtual - Day 3 - Wednesday, May 26th, 2021
Grab your real coffee and share experiences virtually with your peers to explore the new challenges of operating in a largely virtual world. Just like pre-show breakfast in a regular conference you’ll join a “round table” with seven fellow attendees and see where the conversation takes you.
Ethics in AI is no longer optional, and profit and human values are not mutually exclusive. As the scale, scope and speed of AI adoption increase, we are seeing more examples of algorithmic discrimination, automated racism, ageism, deepfakes, large-scale manipulation ("nudging"), and other harms. Much of this is unintentional, but were you to deploy such an app, it could severely damage your organization's reputation and top or bottomline. Because no matter what business you are in, we are all in the business of trust. Join this session to lean how to adopt and scale AI ethically and responsibly.
IBM Sales created a machine learning based tool, which provides leaders insights into deals while increasing revenue. Sales Action Quadrants is a novel idea based on machine learning augmented classification technique, which takes into account seller input about opportunity progression in sales cycle then, based on the degree of progression, maps it against an ensemble of machine learning algorithms. The concept that differentiates SAQ from any modeling approach is the sophisticated implementation of ‘ensemble’ approach, based on the ‘No free lunch theorem in machine learning’ which states - ‘there is no one model that works best for every problem’.
Take a break or join your fellow practitioners in this interactive session where you can exchange approaches to shared challenges and hear how your peers are tackling similar issues.
Learn how Barron’s Group is leveraging automation and Machine Learning to produce direct-to-reader content and power dynamic editorial tools. This talk will focus on both the technical infrastructure necessary to automate content, and the innumerable product and editorial decisions made throughout the design, development, and deployment phases.
Over the past year, Barron’s Group has used Wordsmith to manage the text generation component of its automated story systems. Auto-generated stories are templated to summarize relevant stock-market events, and may be triggered within our own system to publish under any conditions specified. Because templates can be designed for any relevant use-case, this system is easily extended to different story topics, languages, and business segments. Barron’s currently produces stories on the state of the market at company close, and plans to expand to publish japanese-language translations, automated flash headlines, and morning stock summaries.
Barron’s Group is also in the process of building a Machine Learning-powered system for editorial assists. This tool will rely on anomaly detection performed with Amazon Sagemaker, and will alert editors when noteworthy events occur. In the short-term, this tool is intended as a time-saver for editors who would otherwise have to manually sift through price and volume data for dozens of stocks. In the long term, this kind of technology could be combined with Wordsmith’s templating functionality to produce entire headlines or articles to send off to editors, for review or publication. Here, we will focus primarily on our current process for automated article publication with Wordsmith, along with our work to this point on the Sagemaker editorial assist tool. We will review the results of our work, and discuss future steps that can be taken to expand upon these systems.
Companies are frequently faced with large amounts of unstructured text data, like forum comments or product reviews. Important trends can emerge in these datasets, but it can be time-consuming to read through comments, and keyword matching frequently misses critical nuances. We'll discuss how we've approached this problem at Google using Natural Language Processing, with examples of the approach applied to open datasets. We'll explore how this fits into the ML project lifecycle, with examples of common pitfalls. Finally, we'll highlight how to use this technology as part of a "human in the loop" approach to supercharge your existing team members.
Predictive Analytics World for Business - Virtual - Day 4 - Thursday, May 27th, 2021
Grab your real coffee and share experiences virtually with your peers to explore the new challenges of operating in a largely virtual world. Just like pre-show breakfast in a regular conference you’ll join a “round table” with seven fellow attendees and see where the conversation takes you.
Join your fellow practitioners in this interactive session where you can exchange approaches to shared challenges and hear how your peers are tackling similar issues.
About 80% of machine learning projects fail – not because the data scientists failed, but because of the many known and unknown unknowns especially of data, but also of business and users. Consequently, the key to success is to “fail early to succeed sooner” i.e., to identify the critical break points and to focus on the use case with high chances of success. The identification of the right use cases is the purpose of a data strategy and data thinking is the process of thinking through the right solution. This interactive session introduces a method for data thinking, called Data Strategy Design. The free Data Strategy Designkit consists of 15 visual tools to collaboratively and interdisciplinary develop data strategies and data products. It is already used by hundreds of companies worldwide and by participating in this deep dive session you will also learn how to apply the method and tools to your projects. For the joint exercise we will use Miro, an online whiteboard platform, and we will work together on an exemplary machine learning project.
While hotels' star ratings are widely recognized and used across the globe, there are no similar rating system exists for the large majority of vacation rentals, making it difficult for guests to search and compare options and hard for vacation rental suppliers to market their product effectively.
In this talk, we will present a Quality Rating System for Vacation Rentals based on Machine Learning, focusing on automated explainable vacation rental quality ratings. This system was successfully deployed and validated through Randomized Controlled Trials at Booking.com, impacting millions of accommodations and guests.
Take a break or join your fellow practitioners in an interactive session where you can exchange approaches to shared challenges and hear how your peers are tackling similar issues.
Modeling the risk of customer churn in large B2B organizations can be tricky. In this session, we will outline our approach to handling that complexity by developing an interpretable model and an interactive front-end tool to highlight risk factors to a team of non-technical stakeholders. We highlight the importance of transparent modeling (as opposed to black-box modeling) in B2B retention, where understanding the risk factors is as important to our customer experience stakeholders as the model itself.
Systemic racism is a complex and multi-dimensional problem that manifests in several different ways. No single narrative or data point can fully capture the pervasive and damaging nature of this crisis. Through our work at Cape Fear Collective, we attempt to decompose the far-reaching impacts of systemic racism into tangible components. The culmination of this work is a Racial Equity Index, which we believe will aid in the development of strategies and solutions to combat systemic racism and forge a more equitable future.
Predictive Analytics World for Business - Virtual - Day 5 - Friday, May 28th, 2021
A global NGO identifies and rehabilitates tens of thousands of survivors of human trafficking and slavery situations. Different situations necessitate different rehabilitation “journeys.” We analyzed case worker notes from check-ins with survivors to help understand how specific programs, and especially “trauma-informed” programs, might be affecting survivors’ emotional wellbeing and likelihood to finish their journeys. We found generally negative associations between survivor sentiment and interactions with the justice system, and generally positive associations with training programs, especially for rebuilding social support networks and economic empowerment. These findings have implications for journey planning as the NGO tries to scale up its worldwide operations.
There are many non-profit organizations trying to help develop and build economic opportunities for families in developing countries around the world by aiding individuals in starting and maintaining small businesses. Vision Fund International is one such organization that uses microfinance loans to jump start business opportunities in these countries. However, these developing countries do not have the credit institutions that countries in the developed world have, which leaves banks restricted on their ability to make loan decisions. Analytical modeling approaches are widely used in major banks in developed countries to efficiently make these decisions. This talk details a project worked on in partnership with Vision Fund International that develops scorecard models for use in loan decisions and the impact it is having on the growth of economies around the world.
Join your fellow practitioners in this interactive session where you can exchange approaches to shared challenges and hear how your peers are tackling similar issues.
Most business leaders claim to be "results driven," an orientation reflected in their embrace of a "pay for performance" philosophy. Many performance measures commonly used are “noisy” or contaminated by situational factors far removed from employees’ control. Hence, they fail to capture the actual value that employees deliver, with unintended, negative consequences for employer and employees.
Using case examples, the presenter will show how advanced analytics can address this problem by adjusting performance measures to remove the effects of random or situational factors. By distinguishing employee “value” from “performance.” organizations can deliver more effective incentives at less cost to shareholders.
Webex Contact Center is a multi-tenanted Cloud Contact Center solution from Cisco, that scales to thousands of concurrent interactions. Improving Agent Performance directly impacts the efficiency and effectiveness of Contact Centers. We are building the next generation AI enabled interaction experiences to create super agents. With the help of AI, agents can now focus on the customer, understand their sentiment and answer questions quickly. AI capabilities that were built include Speech-To-Text transcriptions, sentiment analysis, Question-Answering and Topic Modeling. AI Models by themselves don't become a scalable solution. Serving these models at scale, specifically for a multi-tenanted cloud service, takes additional planning, architectures, capabilities and operations. This session discusses the unique challenges, solutions and takeaways in building and deploying these models and solutions. Key topics addressed are:
- Multi-tenanted Cloud Services: unique considerations
- Model Scaling Challenges for real time
- Architecture options for scaling Models
- Adapting Models to serve at scale
- Tuning Models for Resource & Cost scaling
Take a break or join your fellow practitioners in this interactive session where you can exchange approaches to shared challenges and hear how your peers are tackling similar issues.
Many analytics teams wait too long to start working on deployment. The most common reason they give is that they want to make sure that the model is “good enough” before they start to plan deployment. The truth is that doing so virtually guarantees project failure, or at a minimum, massive delays. Projects don’t fail because of a failure to find patterns in the data. Projects fail because models that fail to align perfectly with organizational priorities will never be embraced by the end-users of the deployed solution.
This session will outline how Google’s Corporate Engineering team is using AI and machine learning to spur innovation within Google. Additionally, Rich will identify the work that his team does (the structure, example use cases etc.), and the research that’s driving the work his team does and the democratization of AI (work in ML Fairness, Privacy, Interpretability and AutoML technologies).