Predictive Analytics Times
Exclusive Highlights
Doppelganger Discovery: How Baseball Sabermetrics Inspires Predictive Analytics
 This author will present at Predictive Analytics World, Oct 29 –...
Wise Practitioner – Predictive Analytics Interview Series: Leslie Barrett at Bloomberg L.P.
 In anticipation of her upcoming conference presentation, Crowd-Sourcing and...
Why Data Science Argues against a Muslim Ban
 From the perspective of data science, a Muslim ban...
Wise Practitioner – Predictive Analytics Interview Series: Andrew Burt at Immuta
 In anticipation of his upcoming conference presentation, Regulating Opacity:...
Wise Practitioner – Predictive Analytics Interview Series: Feyzi Bagirov at Becker College
 In anticipation of his upcoming conference presentation, Acquisition Funnel...
Wise Practitioner – Predictive Analytics Interview Series: Jack Levis at UPS
 In anticipation of his upcoming keynote conference presentation, UPS’...
Wise Practitioner – Manufacturing Predictive Analytics Interview Series: Richard Semmes at Siemens PLM
 In anticipation of his upcoming Predictive Analytics World Manufacturing Chicago,...
Wise Practitioner – Predictive Analytics Interview Series: Edward Shihadeh at Auspice Analytics, LLC
 In anticipation of his upcoming conference presentation, How to...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Emily Pelosi at CenturyLink
 In anticipation of her upcoming Predictive Analytics World for Workforce conference...
Wise Practitioner – Predictive Analytics Interview Series: Holly Lyke-Ho-Gland and Michael Sims at APQC
 In anticipation of their upcoming conference co-presentation, Change Management...
Wise Practitioner – Predictive Analytics Interview Series: Natasha Balac at Data Insight Discovery, Inc.
 In anticipation of her upcoming conference co-presentation, Identifying Unique...
Wise Practitioner – Predictive Analytics Interview Series: Bryan Bennett at Northwestern University
 In anticipation of his upcoming conference presentation, Cross-Enterprise Deployment: ...
Wise Practitioner – Predictive Analytics Interview Series: David Talby at Atigeo
 In anticipation of his upcoming conference presentation, Semantic Natural...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Haig Nalbantian at Mercer
 In anticipation of his upcoming Predictive Analytics World for...
Book Review: Weapons of Math Destruction by Cathy O’Neil
 Originally published in Analytics Magazine Book: Weapons of Math...
Wise Practitioner – Predictive Analytics Interview Series: Angel Evan at Angel Evan, Inc.
In anticipation of his upcoming conference co-presentation, Identifying Unique...
Wise Practitioner – Predictive Analytics Interview Series: Paul Speaker at The Dow Chemical Company
 In anticipation of his upcoming conference presentation, Creating an...
Wise Practitioner – Predictive Analytics Interview Series: George Iordanescu at Microsoft
 In anticipation of his upcoming conference presentation, Predictive Analytics...
Wise Practitioner – Predictive Analytics Interview Series: Afsheen Alam at Allstate Insurance
 In anticipation of her upcoming conference presentation, Our Success...
Wise Practitioner – Predictive Analytics Interview Series: Jennifer Bertero at CA Technologies
 In anticipation of her upcoming conference presentation, Redefining Analytics...
Wise Practitioner – Predictive Analytics Interview Series: Michael Dessauer at The Dow Chemical Company
 In anticipation of his upcoming conference presentation, Listening Down...
Wise Practitioner – Predictive Analytics Interview Series: Steven Ulinski at Health Care Service Corporation
 In anticipation of his upcoming conference presentation, Challenges of...
Wise Practitioner – Predictive Analytics Interview Series: Lauren Haynes at The University of Chicago
 In anticipation of her upcoming conference presentation, Data Science...
Wise Practitioner – Predictive Analytics Interview Series: Daqing Zhao at Macy’s
 In anticipation of his upcoming conference presentation, Macy’s Advanced...
Wise Practitioner – Predictive Analytics Interview Series: Thomas Schleicher at National Consumer Panel
 In anticipation of his upcoming conference presentation, Combining Inferential...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Kevin Zhan at The Advisory Board
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Wise Practitioner – Predictive Analytics Interview Series: Halim Abbas at Cognoa
 In anticipation of his upcoming conference presentation, Early Screening...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Ben Taylor at HireVue
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Employee Life Time Value and Cost Modeling
 Understanding the Most Expensive Asset Practically every business shares...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Andrew Marritt at OrganizationView GmbH
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Interview with Eric Siegel: Popularizing Predictive Analytics with Song and Dance
  Originally published in l’ADN (in French) Hilarious consultant,...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Sue Lam at Shell
 In anticipation of her upcoming Predictive Analytics World for Workforce conference...
Case Study: Hotel Occupancy Forecasting’s Big Payoff
 This Predictive Analytics story started with a question as...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Mike Rosenbaum at Arena
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Wise Practitioner – Predictive Analytics Interview Series: Darryl Humphrey at Alberta Blue Cross
 In anticipation of his upcoming conference presentation, Claim Pattern...
The Evolving State of Retail Analytics in CRM
 The Traditional State The world of retail has undergone...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Feyzi Bagirov at 592 LLC and Harrisburg University of Science and Technology
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Wise Practitioner – Predictive Analytics Interview Series: Craig Soules at Natero
 In anticipation of his upcoming conference presentation, Using Predictive...
Sound Data Science: Avoiding the Most Pernicious Prediction Pitfall
  In this excerpt from the updated edition of...
Wise Practitioner – Predictive Analytics Interview Series: Ashish Bansal and John Schlerf from Capital One
 In anticipation of their upcoming conference co-presentation, The Quest...
Wise Practitioner – Predictive Analytics Interview Series: Kristina Pototska at TriggMine
 In anticipation of her upcoming conference presentation, 7 Examples...
Wise Practitioner – Predictive Analytics Interview Series: Frédérick Guillot at The Co-operators General Insurance Company
 In anticipation of his upcoming conference presentation, Defining Optimal...
Predictive Analytics vs. Prescriptive Analytics
 We have all heard and seen the diagrams that...
Interview with Prof. Dr. Wil van der Aalst, Eindhoven University of Technology
 Exclusive interview with Prof. Wil van der Aalst who...
Data Story Telling: Bringing Life to Your Data
 There is no doubt that a successful Data Scientist...
Contextual Experience Innovation
  [Title Image Abbreviations: CRM – Customer Relationship Management,...
Are Random Variables a Fact of Life in Predictive Models?
 In some of the more recent literature, discussion has...
Managing Shifting Priorities in Exploratory Data Science Projects
 After working with a client’s data for over three...
Breaking into Analytics: 5 “Musts” for your Career Transition
 In our data-rich society, corporations of all types and...
How Predictive Analytics Can Fuel Innovation for Manufacturing
 Industry leaders like to use the term “culture” to...
Rexer Analytics Data Science Survey – Highlights (New)
  White Paper with 2015 survey results available now....
How Can Predictive Analytics Help Your Bank or Fintech Company?
 Predictive analytics encompasses a powerful set of methods that...
The Role of Feature Engineering in a Machine Learning World
 Artificial Intelligence(AI) continues to be the next great topic...
The Expansive Deployment of Predictive Analytics: 22 Examples
  The future is the ultimate unknown. It’s everything...
Nine Bizarre and Surprising Predictive Insights from Data Science
  Data is the world’s most potent, flourishing unnatural...
The Trick to Predictive Analytics: How to Bridge the Quant/Business Culture Gap
  This article is excerpted from Eric Siegel’s foreword...
Wise Practitioner – Predictive Analytics Interview Series: Robin Thottungal at U.S. Environmental Protection Agency
 In anticipation of his upcoming conference keynote presentation, 21st...
How Hillary for America Is (Almost Certainly) Using Uplift Modeling
  In this article, I provide evidence that Hillary...
Wise Practitioner – Predictive Analytics Interview Series: Miguel Castillo at U.S. Commodity Futures Trading Commission
 In anticipation of his upcoming conference co-presentation, Words that...
Wise Practitioner – Predictive Analytics Interview Series: Michael Berry of TripAdvisor Hotel Solutions
 In anticipation of his upcoming keynote co-presentation, Picking the...
Exploring the Toolkits of Predictive Analytics Practitioners — Part 2
 Continuing on our discussion from last month on toolkits...
The Danger of Playing It Safe
 Research shows that people tend to be overly risk...
Manufacturing Operations: Machine Learning to Separate Actionable Trends from False Alarms
 Predictive analytics is increasingly becoming the object of value...
Predictive Analytics Basics: Six Introductory Terms and The Five Effects
 Here are six key definitions—and The Five Effects of...
Wise Practitioner – Predictive Analytics Interview Series: Ken Yale at ActiveHealth Management
 In anticipation of his upcoming keynote co-presentation at Predictive...
Wise Practitioner – Predictive Analytics Interview Series: Frank Fiorille at Paychex, Inc.
 In anticipation of his upcoming conference presentation, Risk Management...
The Real Reason the NSA Wants Your Data: Predictive Law Enforcement
 The NSA can leverage bulk data collection with predictive...
Wise Practitioner – Predictive Analytics Interview Series: Scott Zoldi at FICO
 In anticipation of his upcoming conference keynote presentation, Fraud...
Wise Practitioner – Predictive Analytics Interview Series: Thomas Klein at Miles & More GMbH
 In anticipation of his upcoming conference co-presentation, Using Predictive...
Book Review: Predictive Analytics for Newcomers and Nontechnical Readers
 The book reviewed in the article, Predictive Analytics: The...
Wise Practitioner – Predictive Analytics Interview Series: Meina Zhou at Bitly
 In anticipation of her upcoming conference presentation, Predictive Analytics...
Wise Practitioner – Predictive Analytics Interview Series: Dr. Shantanu Agrawal at Centers for Medicare & Medicaid Services
 In anticipation of his upcoming conference keynote presentation, Implementing...
Infographic – Discover Predictive Analytics World for Business 2016
 Predictive Analytics World continues to grow – take a...
Exploring the Tool kits of Predictive Analytics Practitioners — Part 1
 Tools, tools, and more tools continue to explode in...
The Power of Data Science for Predictive Maintenance is Only Just Being Tapped
 Future of Automotive Servicing and Preventive Maintenance Several months...
Wise Practitioner – Predictive Analytics Interview Series: Madhusudan Raman at Verizon
 In anticipation of his upcoming conference presentation, Best Practices...
Need a Data Scientist? Try Building a ‘DataScienceStein’
 Organizations are finding that hiring qualified Data Scientists is...
Wise Practitioner – Predictive Analytics Interview Series: Sanjay Gupta at PNC Bank
 In anticipation of his upcoming conference co-presentation, Predictive Analytics...
Wise Practitioner – Predictive Analytics Interview Series: Brian Reich, Former Director at The Hive
 In anticipation of his upcoming conference presentation, The Data...
AnalyticOps: A New Organizational Role So Your Company Can Monetize Analytics
 There is no doubt that data science–and predictive analytics–...
Wise Practitioner – Predictive Analytics Interview Series: Gary Neights at Elemica
 In anticipation of his upcoming conference presentation, Predicting Behavior...
Getting Started with Predictive Analytics – an Interview with Eric Siegel
 Data science and predictive analytics are top of mind...
Wise Practitioner – Predictive Analytics Interview Series: Dr. Sarmila Basu at Microsoft Corporation
 In anticipation of her upcoming conference presentation, Predictive &...
Are Pre-hire Talent Assessments Part of a Predictive Talent Acquisition Strategy?
  Over the past 30+ years, businesses have spent...
Wise Practitioner – Predictive Analytics Interview Series: Dae Park and Vijay D’Souza at Government Accountability Office (GAO)
 In anticipation of their upcoming conference co-presentation, Characteristics for...
Wise Practitioner – Predictive Analytics Interview Series: Dean Abbott of SmarterHQ
 In anticipation of his upcoming conference presentation, The Revolution...
Opportunities and Challenges: Predictive Analytics for IoT
 There is a clear sense in the marketplace today...
Feature Engineering Within the Predictive Analytics Process — Part Two
 In the last article, I discussed the concept of...
HBO Teaches You How to Avoid Bad Science
 Do you know what p-hacking is? John Oliver –...
Jim Sterne’s Book Review of “Predictive Analytics” by Eric Siegel
 Book review originally published in the journal Applied Marketing...
The Big Picture: Today’s Data Analytics Stack
 Enterprises are inundated with data from social, mobile, IoT...
Taking Action on Technical Success: A Fable of Data Science and Consequences
 Note: This story is fiction, but it is based...
Analytics is (often) a Faith-Based Business
 If you follow data science topics in various social...
Wise Practitioner – Manufacturing Predictive Analytics Interview Series: Chris Labbe at Seagate Technology
 In anticipation of his upcoming Predictive Analytics World for Manufacturing conference...
Wise Practitioner – Manufacturing Predictive Analytics Interview Series: Peter Frankwicz at Elmet Technologies
 In anticipation of his upcoming Predictive Analytics World for Manufacturing conference...
Wise Practitioner – Text Analytics Interview Series: Dirk Van Hyfte at InterSystems Corporation
 In anticipation of his upcoming conference co-presentation, Personalized Medicine...
Wise Practitioner – Text Analytics Interview Series: Michael Dessauer and Justin Kauhl at The Dow Chemical Company
 In anticipation of their upcoming conference co-presentation, Understanding our...
Women in Data Science
 The field of Data Science is booming, yet comparatively...
Wise Practitioner – Manufacturing Predictive Analytics Interview Series: Edward Crowley at The Photizo Group, Inc.
 In anticipation of his upcoming Predictive Analytics World for Manufacturing conference...
Boosting Performance of Machine Learning Models
  People often get stuck when they are asked...
Wise Practitioner – Predictive Analytics Interview Series: Tanay Chowdhury at Zurich North America
 In anticipation of his upcoming conference presentation, Deep Learning...
Feature Engineering within the Predictive Analytics Process — Part One
 What is Feature Engineering One of the growing discussions...
The Executive’s Guide to Employee Attrition
 Much has been written about customer churn – predicting...
Wise Practitioner – Predictive Analytics Interview Series: Lawrence Cowan at Cicero Group
 In anticipation of his upcoming conference presentation, Data Driven...
Wise Practitioner – Text Analytics Interview Series: John Herzer and Pengchu Zhang at Sandia National Laboratories
 In anticipation of their upcoming conference co-presentation, Enhancing search...
Wise Practitioner – Text Analytics Interview Series: Emrah Budur at Garanti Technology
 In anticipation of his upcoming conference presentation, Tips and...
Wise Practitioner – Predictive Analytics Interview Series: Thomas Schleicher at National Consumer Panel
 In anticipation of his upcoming conference presentation, Using Predictive...
Ghosts in the Data, Constructing Data Entities
 Data Entities are seldom discussed concepts that primarily hide...
Wise Practitioner – Manufacturing Predictive Analytics Interview Series: Dr. Matteo Bellucci at General Electric
 In anticipation of his upcoming Predictive Analytics World for Manufacturing conference...
HR’s First Predictive Project? Pre-hire Candidate Screening
 Corp recruiters have a very important and difficult job....
Wise Practitioner – Manufacturing Predictive Analytics Interview Series: Gary Neights at Elemica
 In anticipation of his upcoming Predictive Analytics World for Manufacturing conference...
Wise Practitioner – Manufacturing Predictive Analytics Interview Series: Jeffrey Banks at The Applied Research Laboratory at The Pennsylvania State University
 In anticipation of his upcoming Predictive Analytics World for Manufacturing conference...
Wise Practitioner – Text Analytics Interview Series: Frédérick Guillot at Co-operators General Insurance Company
 In anticipation of his upcoming conference presentation, Leveraging Hands...
Wise Practitioner – Predictive Analytics Interview Series: Alice Chung at Genentech
 In anticipation of her upcoming conference co-presentation, Utilizing Advanced...
Wise Practitioner – Manufacturing Predictive Analytics Interview Series: Carlos Cunha at Robert Bosch, LLC
 In anticipation of his upcoming Predictive Analytics World for Manufacturing...
5 Common Mistakes Multi-Channel Retailers Make, and How to Avoid Them
  Multi-channel retailers are often finding themselves stuck in...
Three Critical Definitions You Need Before Building Your First Predictive Model
 Portions excerpted from Chapter 2 of his book Applied...
Measurement and Validation: An Often Underrated Aspect within the Predictive Analytics Discipline
 In our Big Data world, software applications and programming...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Haig Nalbantian at Mercer
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Pasha Roberts at Talent Analytics, Corp.
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Improving Word Clouds as Tool for Text Analytics Data Visualization
 Rich Lanza will present Using Letter Analytic Techniques to...
Dr. Data’s Music Video: The Predictive Analytics Rap
 With today’s release of “Predict This!” – the rap...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Geetanjali Gamel from MasterCard
 In anticipation of her upcoming Predictive Analytics World for Workforce conference...
Mid-Life Journey to Data Science
 Data Science has been hailed as the sexiest job...
Wise Practitioner – Predictive Analytics Interview Series: Dr. Patrick Surry of Hopper
 In anticipation of his upcoming keynote conference presentation, Buy...
What are you Predicting in Customer Retention?
 Customer Retention models are arguably the most valuable models...
Wise Practitioner – Predictive Analytics Interview Series: Ken Elliott at Hewlett Packard Enterprise
 In anticipation of his upcoming conference presentation, Operationalizing Analytics:...
Wise Practitioner – Workforce Predictive Analytics Interview Series: Holger Mueller at Constellation Research
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Wise Practitioner – Predictive Analytics Interview Series: Lawrence Cowan at Cicero Group
 In anticipation of his upcoming conference presentation, Predicting the...
Hey FinTech, What’s Your Strategy for Leveraging Unstructured Data?
 Financial technology has sparked a global wave of startups...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Raffael Devigus at F. Hoffmann-La Roche AG
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Wise Practitioner – Predictive Analytics Interview Series: Rebecca Pang at CIBC
 In anticipation of her upcoming conference presentation, Driving the...
Employee Engagement – a Tricky Metric for Predictive Analytics
 Our work focuses on using predictive analytics to decrease...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Daniil Shash at Eleks
 In anticipation of his upcoming Predictive Analytics World for...
The Information Age’s Latest Move: Four Predictive Analytics Developments for 2016
 Originally published in Big Think Prediction is in the...
Why Do We Stop Asking Why?
 I’ve lived through this phenomenon first hand. The environment...
Predictive Analytics and the Internet of Things
 As technology continues to empower our ability to conduct...
Wise Practitioner – Predictive Analytics Interview Series: Mario Vinasco at Facebook
 In anticipation of his upcoming conference presentation, Advanced Experimentation...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Vishwa Kolla at John Hancock Insurance
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
In Predictive Analytics, Coefficients are Not the Same as Variable Influence, Part II
 In my last post, “Coefficients are not the same...
Wise Practitioner – Predictive Workforce Analytics Interview Series: John Lee at Equifax Workforce Solutions
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Wise Practitioner – Predictive Analytics Interview Series: Peter Bull at DrivenData
 In anticipation of his upcoming conference presentation, Predicting Restaurant...
The “Predictive Analytics” FAQ — What’s New in the Updated Edition and Who’s The Book for?
 This is the preface to Eric Siegel’s newly-released Revised...
Wise Practitioner – Predictive Analytics Interview Series: Matt Bentley at CanIRank.com
 In anticipation of his upcoming conference presentation, Predicting Online...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Lisa Disselkamp and Tristan Aubert at Deloitte
 In anticipation of their upcoming Predictive Analytics World for Workforce conference...
The Data Scientist’s Dilemma: Does Skipping Breakfast Kill You?
 Would skipping breakfast kill you? Not necessarily—but confusing correlation and causation...
Predictive Analytics Can Help with the Challenges Facing Manufacturing in the 21st Century
 Historically, data and analytics have been key to the...
Wise Practitioner – Predictive Analytics Interview Series: Nate Watson at Contemporary Analysis
 In anticipation of his upcoming conference presentation, Predictive Sales...
Customer Experience Predictions for 2016
 As we look ahead and see 2016 unfurling in...
Predictive Analytics Book Excerpt: Hands-On Guide—Resources for Further Learning
 Here is the Hands-On Guide that appears at the...
Wise Practitioner – Predictive Analytics Interview Series: Hans Wolters at Microsoft
 In anticipation of his upcoming conference presentation, Predicting User...
Machine Learning: Not Necessarily a New Phenomenon in Predictive Analytics
 One of the more recent topics gaining traction in...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Frank Fiorille at Paychex, Inc.
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Netflix, Dark Knowledge, and Why Simpler Can Be Better
 Weary from an all-night coding effort, and rushed by...
The Case Against Quick Wins in Predictive Analytics Projects
 When beginning a new predictive analytics project, the client...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Jason Noriega at Chevron
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Wise Practitioner – Predictive Analytics Interview Series: Matthew Pietrzykowski at General Electric
 In anticipation of his upcoming conference co- presentation, Advanced Analytics...
B2B Predictive Analytics: An Untapped Sector
 Much work in predictive analytics and data science has...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Greg Tanaka at Percolata
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Michael Li at The Data Incubator
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Four Ways Data Science Goes Wrong and How Test-Driven Data Analysis Can Help
 If, as Niels Bohr maintained, an expert is a...
In Predictive Analytics, Coefficients are Not the Same as Variable Influence
 When we build predictive models, we often want to...
Oracle’s Ten Enterprise Big Data Predictions for 2016
 Companies big and small are finding new ways to...
Personalities That Are Barriers to Model Deployment (And How to Partner With Them) Part III: The Expert
 So you have gathered your data and completed your...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Kathy Doan at Wells Fargo Bank
 In anticipation of her upcoming Predictive Analytics World for Workforce conference...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Jonathon Frampton at Baylor Scott & White Health
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Mobile Analytics-Mining the Visit Experience of the Customer
 Mobile technology as part of the Big Data discussion...
The Devil’s Data Dictionary – Making Fun of Big Data
 Buy it on Amazon When Stéphane Hamel coined the...
Wise Practitioner – Predictive Workforce Analytics Interview Series: Ben Waber of Humanyze
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
The Quest for Unicorns
 Will there be enough data scientists in the future?...
Most Swans are White: Living in a Predictive Society
 In anticipation of the forthcoming Revised and Updated, paperback...
Hiring? Approving Mortgages? It’s the Same Thing
 Imagine that Chris wants to buy a house and...
Personalities That Are Barriers to Model Deployment (And How to Partner With Them) Part II: The Skeptic
 So you have gathered your data and completed your...
5 Types of Analytics in Business: One to Go After and One to Avoid
 I have been lucky enough to work in some...
The Beginner’s Guide to Predictive Workforce Analytics
 Human Resources Feels Pressure to Begin Using Predictive Analytics...
Predictive Modeling Forensics: Identifying Data Problems
 Excerpted and modified from Chapters 3 and 4 of...
Five Wins for Retail with Predictive Analytics
 We’ve heard a lot about how big data is...
Faster Credit Scoring Dev With Specialized Binning Code – R Package
 Introduction One of the main concerns in a credit...
Good Predictions != Good Decisions
 A Fateful Tale Ted is having a rough week...
Using Predictive Analytics to Bring Retailers Closer to Their Customers
 Based on the amount of retailers that have been...
Visualization: Panacea for Building Analytics Solutions?
 Data,data,data everywhere and what do I do with it....
Five Challenges in Using Predictive Analytics to Improve Patient Outcomes
 In the increasingly patient-centric world of healthcare, predictive analytics...
Personalities That Are Barriers to Model Deployment (And How to Partner With Them) Part I: The Early Adopter
 So you have gathered your data and completed your...
Five Ways Predictive Analytics Will Shape the Future of Advertising
 Predictive analytics sounds almost mystical, and in a way,...
Five Ways Predictive Analytics Can Improve Patient Outcomes
 The use of analytics in healthcare is gaining momentum...
Wise Practitioner – Predictive Analytics Interview Series: Scott Lancaster at State Street Corp.
 In anticipation of his upcoming conference presentation, Predictive Analytics...
Can Employee Development Lead to Business Mediocrity?
 Our predictive workforce assignments yield staggering results; saving /...
Empathy and Data Science: A Fable of Near-Success
 Editor’s Note: While the story is fiction, the events...
Wise Practitioner – Predictive Analytics Interview Series: Jeff Butler at IRS Research, Analysis, and Statistics organization
 In anticipation of his upcoming conference presentation, The Changing...
A Look at How Big Data is Changing Sports on the Field and in the Press Box
 While major rules rarely change, everything else about professional...
Wise Practitioner – Predictive Analytics Interview Series: Dr. Satyam Priyadarshy at Halliburton
 In anticipation of his upcoming conference presentation, Challenges in...
Automation: Friend or Foe to the Predictive Analytics Practitioner
 Technologies and Big Data continue to bombard our working...
Winning Roles: Moneyball 2.0, for your Hiring and Succession Planning Processes
 Business can learn a lot from sports in terms...
Wise Practitioner – Predictive Analytics Interview Series: Werner Britz at RCS Group
 In anticipation of his upcoming conference presentation, Recoveries: External...
Wise Practitioner – Predictive Analytics Interview Series: Dr. Michael Dulin, Carolinas Healthcare System
 In anticipation of his upcoming keynote conference presentation at...
Wise Practitioner – Predictive Analytics Interview Series: Benjamin Uminsky, Los Angeles County
 In anticipation of his upcoming conference presentation, Mining the...
Wise Practitioner – Predictive Analytics Interview Series: Jessica Taylor of St. Joseph Healthcare
 In anticipation of her upcoming conference co-presentation at Predictive...
Wise Practitioner – Predictive Analytics Interview Series: COL William Saxon, Department of the Army
 In anticipation of his upcoming conference presentation, From Wisdom...
Defensive Data Science: What we can Learn from Software Engineers
  To view this content OR subscribe for free...
Wise Practitioner – Predictive Analytics Interview Series: Patty Larsen, Co-Director, National Insider Threat Task Force
  To view this content OR subscribe for free...
Wise Practitioner – Predictive Analytics Interview Series: Bin Mu at MetLife
 In anticipation of his upcoming conference presentation, Establishing Value:...
Wise Practitioner – Predictive Analytics Interview Series: Michael Berry of TripAdvisor
 In anticipation of his upcoming conference presentation, Picking the...
Wise Practitioner – Predictive Analytics Interview Series: Catherine Templeton, PAWGOV Keynote Speaker
 In anticipation of her upcoming keynote conference presentation, Reforming...
Wise Practitioner – Predictive Analytics Interview Series: William Wood of St. Joseph Healthcare
 In anticipation of his upcoming conference co-presentation at Predictive...
The Key to Modelling Success-The Variable Selection Process (Part 2)
 Last month, I discussed the importance of variable selection...
Wise Practitioner – Predictive Analytics Interview Series: Madhusudan Raman at Verizon
 In anticipation of his upcoming conference presentation, Predicting Behavioral...
Wise Practitioner – Predictive Analytics Interview Series: Scott Jelinsky of Pfizer, Inc.
 In anticipation of his upcoming conference presentation at Predictive...
Wise Practitioner – Predictive Analytics Interview Series: Chris Franciskovich at OSF Healthcare System
  To view this content OR subscribe for free...
Wise Practitioner – Predictive Analytics Interview Series: Philip O’Brien at Paychex
  To view this content OR subscribe for free...
Wise Practitioner – Predictive Analytics for Healthcare Interview Series: Daniel Chertok at NorthShore University HealthSystem
  To view this content OR subscribe for free...
Wise Practitioner – Predictive Analytics Interview Series: Herman Jopia of American Savings Bank
 In anticipation of his upcoming conference presentation, Driving Superior...
Stop Hiring Data Scientists Until You’re Ready for Data Science
  To view this content OR subscribe for free...
How to manage projects in Predictive Analytics
 In the previous five years, the analytical scene has...
Wise Practitioner – Predictive Analytics Interview Series: Lawrence Cowan of Cicero Group
  To view this content OR subscribe for free...
Defining Measures of Success for Cluster Models
  To view this content OR subscribe for free...
Good luck placing Analytics in an org chart
  To view this content OR subscribe for free...
Retail Predictive Analytics Solves the Missing Link in Cross Selling, Up Selling, and Suggestive Selling
  To view this content OR subscribe for free...
Sameer Chopra’s Hotlist of Training Resources for Predictive Analytics
  To view this content OR subscribe for free...
Wise Practitioner – Predictive Analytics Interview Series: John Smits of EMC
  To view this content OR subscribe for free...
Be a Data Detective
  To view this content OR subscribe for free...
Predicting Employee Flight Risk: My Take
  To view this content OR subscribe for free...
The Key to Modelling Success -The Variable Selection Process (Part 1)
  To view this content OR subscribe for free...
Predictive Analytics World in Color [Infographic]
  To view this content OR subscribe for free...
Space Alien Eager to Convey Thoughts on Data Science
  To view this content OR subscribe for free...
Defining Measures of Success for Predictive Models
 Excerpted from Chapters 2 and 9 of his book...
Overstatement of Results in Predictive Analytics
  To view this content OR subscribe for free...
The Biggest Lever to Success in Predictive Analytics
  To view this content OR subscribe for free...
Wise Practitioner – Manufacturing Predictive Analytics Interview Series: Field Cady at Think Big Analytics
 In anticipation of his upcoming Predictive Analytics World for Manufacturing conference...
Wise Practitioner – Manufacturing Predictive Analytics Interview Series: Jeffrey Thompson of Robert Bosch, LLC
 In anticipation of his upcoming Predictive Analytics World for Manufacturing conference...
Wise Practitioner – Manufacturing Predictive Analytics Interview Series: Kumar Satyam of PricewaterhouseCoopers, LLP
 In anticipation of his upcoming Predictive Analytics World for Manufacturing conference...
White Paper – Immediate Access
Thank you for your interest in the white paper,...
Predicting Rare Events In Insurance
 As we all know, predictive analytics is a discipline,...
Python, Predictive Analytics & Big Data oh my!
 Python has seen significant growth in utilization in the...
Wise Practitioner – Predictive Analytics Interview Series: Thomas Schleicher of National Consumer Panel
  To view this content OR subscribe for free...
Leveraging Open Data: Improve Customer Experience and Drive New Market Opportunities
  To view this content OR subscribe for free...
From Code to Reports with knitr & Markdown
  To view this content OR subscribe for free...
Predictive Analytics Optimizes Prices and Markdowns for Retail
  To view this content OR subscribe for free...
Wise Practitioner – Predictive Analytics Interview Series: Delena D. Spann of US Government
  To view this content OR subscribe for free...
Infographic – PAW SF
5-Minute Recap
 In San Francisco this past March and April, Predictive...
Wise Practitioner – Predictive Analytics Interview Series: Dr. Patrick Surry of Hopper
 In anticipation of his upcoming conference presentation, Buy or...
Wise Practitioner – Predictive Analytics Interview Series: Viswanath Srikanth of Cisco
 In anticipation of his upcoming conference co-presentation, Building a...
Wise Practitioner – Predictive Analytics Interview Series: Jack Levis of UPS
 In anticipation of his upcoming conference keynote presentation, UPS Analytics...
Trust in Analytics Work: Why it’s Needed and How to Build It
 Much has been written about data-driven decision making. Someone...
Guiding Principles to Build a Demand Forecast
 Demand forecasting is one of the most challenging fields...
Wise Practitioner – Predictive Analytics Interview Series: Arcangelo Di Balsamo of IBM
 In anticipation of his upcoming conference presentation, Applied Predictive...
Wise Practitioner – Predictive Analytics Interview Series: Dean Abbott of Smarter Remarketer
 In anticipation of his upcoming conference presentation, The Revolution...
Predictive Analytics in Sports
 The world of sports has seen exponential increases in...
5 Things I Learned at Predictive Analytics World for Workforce
 As a trained researcher, I’ve always been fascinated with...
Predictive Analytics as a Strategic HR Solution
 This interview is the second in a series on...
Visualizations Get Some Snap from R Shiny
 “Numbers have an important story to tell. They rely...
What Programming do Predictive Modelers Need to Know?
 In most lists of the most popular software for...
Wise Practitioner – Workforce Predictive Analytics Interview Series: John Callery at AOL
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Analytic Professionals — Share your views: Participate in the Rexer Analytics 2015 Data Miner Survey
 Data Analysts, Predictive Modelers, Data Scientists, Data Miners, and...
Charlie Batch and the Cost of Obfuscation
 The hot Florida sun shone down on fans, coaches,...
Predictive Analytics for Insurance Risk: A New Level of Data Scrutiny-Part 2-Development and Implementation
 For more on the application of predictive analytics and...
Is Big Data Better?
 Big data is usually defined as lots of records...
Wise Practitioner – Workforce Predictive Analytics Interview Series: Chad Harness at Fifth Third Bank
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Building the Optimal Retail Assortment Plan with Predictive Analytics
 After making a financial plan for the year ahead,...
Using Advanced Clustering Techniques to Better Predict Purchasing Behaviors in Targeted Marketing Campaigns
 Today’s forward-thinking retailers are seeking relevant, agile and intelligent...
Wise Practitioner – Workforce Predictive Analytics Interview Series: Patrick Coolen of ABN-AMRO Bank
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Using Predictive Analytics to Predict and Manage Business Travel Burnout
 For more on the application of predictive analytics for...
Wise Practitioner – Workforce Predictive Analytics Interview Series: Scott Mondore at Strategic Management Decisions, LLC
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
Wise Practitioner – Predictive Analytics Interview Series: Bob Bress of Visible World
 In anticipation of his upcoming conference presentation, TV Audience...
Wise Practitioner – Workforce Predictive Analytics Interview Series: Holger Mueller of Constellation Research, Inc.
 In anticipation of his upcoming Predictive Analytics World for Workforce keynote...
The Imminent Future of Predictive Modeling
 Predictive modeling tools and services are undergoing an inevitable...
Using Statistics and Visualization as Complementary Validation
 I don’t often find myself thinking or saying common...
Predictive Analytics for Insurance: A New Level of Data Scrutiny
 For more on the application of predictive analytics and...
The Three-Legged Stool of an Analytics Project
 Perhaps you have heard rumors going around that analytics...
Defining Measures of Success for Predictive Models
 Excerpted from Chapter 2 of Mr. Abbott’s book Applied...
Effective Framing of Predictive Analytic Projects
 For more from James Taylor, see his presentation on...
Wise Practitioner – Predictive Analytics Interview Series: Richard Boire of Boire Filler Group
 In anticipation of his upcoming conference presentation, Predicting Extreme...
Wise Practitioner – Predictive Analytics Interview Series: Sarah Holder of Duke Energy
 In anticipation of her upcoming conference presentation, It’s Not...
Infographic – Predictive Analytics World by the Numbers
 Predictive Analytics World continues to grow by popular demand....
Wise Practitioner – Predictive Analytics Interview Series: Mohamad Khatib of Nielsen
 In anticipation of his upcoming conference presentation, Pizza Analytics...
A Critical Step Toward Organizational Data Maturity: Thinking in Terms of Distributions!
 For more from the Josh Hemann, see his presentation...
It is a Mistake to…. Answer Every Inquiry
 (Part 9 (of 11) of the Top 10 Data...
Wise Practitioner – Predictive Analytics Interview Series: Dominic Fortin of TD Insurance
 In anticipation of his upcoming conference presentation, A Success...
5 Ways Retail Predictive Analytics helps Fashion Retailers Maximize Gross Margin
 Fashion retailers have one of the most dynamic environments...
Wise Practitioner – Workforce Predictive Analytics Interview Series: Pasha Roberts at Talent Analytics
 In anticipation of his upcoming conference presentation, A Transaction-Based...
Predictive Modeling Skills: Expect to be Surprised
 Excerpted from Chapter 1 of Mr. Abbott’s book Applied...
Big Data: To Analyze or Not to Analyze
 Much has been written about Big Data and how...
Wise Practitioner – Predictive Analytics Interview Series: Bryan Guenther of RightShip
 In anticipation of his upcoming conference presentation, The Impact...
Wise Practitioner – Predictive Analytics Interview Series: Aaron Lanzen of Cisco
 In anticipation of his upcoming conference presentation, Integrating Predictive...
Wise Practitioner – Predictive Analytics Interview Series: David Schey of Digitas
 In anticipation of his upcoming conference presentation, Uplift Modeling...
It is a Mistake to…. Extrapolate
 (Part 8 (of 11) of the Top 10 Data...
Wise Practitioner – Workforce Predictive Analytics Interview Series: Carl Schleyer of 3D Results
 In anticipation of his upcoming Predictive Analytics World for Workforce conference...
The Trouble with Numbers
 Previous discussions in other publications have often revolved around...
Wise Practitioner – Predictive Analytics Interview Series: Josh Hemann of Activision
 In anticipation of his upcoming conference presentation, Cheating Detection in...
Practical Predictive Modeling: Quick Variable Selection
 This post is largely excerpted from Dean Abbott’s book...
It is a Mistake to…. Discount Pesky Cases
 (Part 7 (of 11) of the Top 10 Data...
Wise Practitioner – Workforce Predictive Analytics Interview Series: Scott Gillespie, Managing Partner of tClara
 In anticipation of his upcoming Predictive Analytics World for...
Want to Improve Your Prototype-to-Production Analytics Process? Embrace Thinking Inside the Box
 Anyone in the business of analytics knows that the...
Using Decision Trees in Variable Creation: Minimizing Information Loss-Part 1
 Numerous articles have been written about the use of...
Predictive Analytics turns Multi Channel Retailing into Omni Channel Retailing.
 Multi Channel retailing has existed for a very long...
Eric Siegel Discusses Predictive Analytics and Civil Liberties on KCRW’s Radio Show
 Last week on “To the Point,” an NPR-syndicated radio...
Wise Practitioner – Predictive Analytics Interview Series: Dean Abbott, Smarter Remarketer
 In anticipation of his upcoming conference keynote and workshops...
Wise Practitioner – Predictive Analytics Interview Series: Elpida Ormanidou of Walmart
 In anticipation of Predictive Analytics World for Workforce, March...
Future of Analytics: Big Data Integration, Transforming Organizations and Processes, Providing Speed and Foresight
 With Analytics being a buzzword, most business executives have...
Haystacks and Needles: Anomaly Detection
 Anomalies vs Outliers Anomaly detection, or finding needles in...
Auditing the Data When Deploying Predictive Analytics Solutions
 Much of the discussion in the predictive analytics discipline...
Leveraging Dark Data: Q&A with Melissa McCormack
 Melissa McCormack,Research Manager at predictive analytics research firm Software...
Wise Practitioner – Predictive Analytics Interview Series: Nephi Walton, M.D., Washington University/University of Utah
 In anticipation of his upcoming conference presentation at Predictive...
Using Predictive Modeling Algorithms for Non-Modeling Tasks
 It is obvious what predictive modeling algorithms like decision...
Wise Practitioner – Predictive Analytics Interview Series: Greta Roberts of Talent Analytics
 In anticipation of her upcoming keynote conference presentation at...
Wise Practitioner – Predictive Analytics Interview Series: George Savage, M.D., Proteus Digital
 In anticipation of his upcoming keynote conference presentation at...
From Human Screen to Machine: Predictive Analytics Helps Avoid a Major Point of Hiring Failure
 What is an employer’s most business-critical corporate process? At...
Predictive Analytics in Health Care: Helping to Navigate Uncertainties and Change
 A recent billion-dollar forecasting error in Walgreen’s Medicare-related business...
The Power of Predictive Analytics for Retail Replenishment
 Replenishment is an essential process in the retail supply...
Voice of the HR Profession: “Charts and Graphs are Hard to Follow”
 In early August, our Director of Marketing reached out...
Wise Practitioner – Predictive Analytics Interview Series: John Cromwell, M.D., University of Iowa Hospitals & Clinics
 In anticipation of his upcoming keynote conference presentation at...
Creating the All-important Analytical File-The Key Step in Building Successful Predictive Analytics Solutions
 In the data audit process, canned routines are programmed...
Wise Practitioner – Predictive Analytics Interview Series: Linda Miner, Ph.D., Southern Nazerene University
 In anticipation of her upcoming conference presentation at Predictive...
It is a Mistake to…. Accept Leaks from the Future
 (Part 6 (of 11) of the Top 10 Data...
Wise Practitioner – Predictive Analytics Interview Series: Marty Kohn, M.D. of Jointly Health
 In anticipation of his upcoming conference keynote at Predictive...
What’s the Government’s Role in Big Data Surveillance?
 What if predictive analytics could help prevent acts of...
Wise Practitioner – Predictive Analytics Interview Series: John Foreman of MailChimp
  In anticipation of his upcoming conference keynote at...
Is Predictive Analytics Insidious? National Radio Interview
 Interviewing Eric Siegel on the radio, a Hollywood big...
5 Reasons Predictive Analytics World for Workforce is Different – And Better
  If you follow the workforce analytics space, you...
Should Employee Analytics “Go Fishing” or Solve Business Problems?
 Over the years, our firm has had many discussions...
Wise Practitioner – Predictive Analytics Interview Series: Sameer Chopra of Orbitz
 In anticipation of his upcoming conference keynote at Predictive...
Wise Practitioner – Predictive Analytics Interview Series: Jack Levis of UPS
 In anticipation of his upcoming conference keynote presentation, UPS Analytics...
Connecting the Experts with the Data Scientists
 “Can Machines Think?” was the cover of Time magazine...
Why analysts should master public speaking
 Industry leader and consultant Geert Verstraeten serves as program...
Defining the Target Variable in Predictive Analytics- A Not so Easy Process
 Defining a target variable is one of the preliminary...
Book Review of “Applied Predictive Analytics” by Dean Abbott
  Industry leader and author Dean Abbott will be...
Recognizing and Avoiding Overfitting, Part 1
 In my last two posts I described why overfitting...
Webinar: Towards Solving Employee Attrition: Cost Modeling
 Presented by: Pasha Roberts, Chief Scientist, Talent Analytics, Corp....
It is a Mistake to…. Listen Only to the Data
 (Part 5 of 11 of the Top 10 Data...
The Data Audit Process (Part 1)-The Initial Step in Building Successful Predictive Analytics Solutions
 Building predictive analytics solutions is very much in-vogue for...
10 Practical Actions that Could Improve Your Model
   (adapted from Chapter 13 of the Handbook of...
The Great Analytical Divide: Data Scientist vs. Value Architect
 In the analytics space, it is quite common for...
Employee Churn 202: Good and Bad Churn
 Our prior article on this venue began outlining the...
Why Overfitting is More Dangerous than Just Poor Accuracy, Part II
 In part one, I described one problem with overfitting...
Predictive Analytics is the Answer to Smart Fulfillment and Omni-Channel Retailing
 Over the past 5 years there have been several...
Employee Churn 201: Calculating Employee Value
 Much has been written about customer churn – predicting...
Why Overfitting is More Dangerous than Just Poor Accuracy, Part I
 Arguably, the most important safeguard in building predictive models...
5 Ways to Become Extinct as Big Data Evolves
 The need to adopt sophisticated data analytics has become...
It is a Mistake to…. Ask the Wrong Question
 (Part 4 (of 11) of the Top 10 Data...
What Role can Network Analysis play in Business Intelligence?
 Network analysis is an emerging Business Intelligence technique that’s...
The Data Behind Data Scientists: Top Kaggle Performers
 Kaggle, an online platform that hosts data analytics competitions,...
It’s Predictive Analytics, not Forecasting!
 This is my final article for this year. It’s...
A Good Business Objective Beats a Good Algorithm
 Predictive Modeling competitions, once the arena for a few...
Retail Predictive Analytics for Price Optimization & Markdown Management
 There is no doubt that price is one of...
It is a Mistake to…. Rely on One Technique
 (Part 3 of 11 of the Top 10 Data...
The Musings of a (Young) Data Scientist
I quit my job as a Mathematical Statistician after...
Big Data Continued…
Big Data is not a singular concept but rather...
How to Calculate the Optimal Safety Stock using Retail Predictive Analytics.
In a perfect world, a retailer knows exactly how...
The Role of Analysts After Model Deployment
Last month I made the case for discussing model...
How predictive analytics will power the internet of things
Recently, Nissan Motor announced that they will
Prediction Isn’t Just About Stocks. Predictive Persuasion
Prediction isn’t just for the stock market. Trading is...
The Greatest Power of Big Data: Predictive Analytics
Every day’s a struggle. I’ve faced some tough challenges...
It is a Mistake to… Focus on Training Results
(Part 2 of 11 of the Top 10 Data...
7 Ways Predictive Analytics Helps Retailers Manage Suppliers
One of the most challenging aspects of the retail...
Why Don’t We Talk about Deployment?
The Cross Industry Standard Process for Data Mining...
Understanding Predictive Analytics: A Spotlight Q&A with Eric Siegel, author of Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die
This BeyeNETWORK spotlight features Ron Powell's interview with Eric...
Predictive Analytics: “Freakonomics” Meets Big Data
While writing my book, Predictive Analytics: The Power...
5 Reasons to Not Care About Predictive Analytics
Technology: complex and alienating, or promising and fascinating?...
SHARE THIS:

2 years ago
What Programming Do Predictive Modelers Need to Know?

 

In most lists of the most popular software for doing data analysis, statistics, and predictive modeling, the top software tools are Python and R—command line languages rather than GUI-based modeling packages. There are several reasons for this, perhaps most importantly that they are free, they are robust programming languages supported by a very broad user community, and they have extensive sets of algorithms.

One recent survey of software was published at r4stats.com (http://r4stats.com/articles/popularity/) and contains additional metrics not usually found in software comparisons, including scholarly articles that include software, google scholar hits, and job trends in addition to the more typical summaries by user-identified use of software such as with the Rexer Analytics surveys (http://www.rexeranalytics.com/Data-Miner-Survey-Results-2013.html) and polls on kdnuggets.com and reviews of software by technology research companies such as Gartner (http://pages.alteryx.com/GartnerMQAdvancedAnalyticsNowAvailable-T.html), Forrester (http://global.sap.com/campaign/na/usa/CRM-XU13-BIP-PATDWS/index.html?urlid=CRM-XU13-BIP-PATDWS), and Hurwitz & Associates (http://www.sas.com/content/dam/SAS/en_us/doc/analystreport/hurwitz-advanced-analytics-107212.pdf). Thanks to r4stats for providing these links in their article.

For those interested in getting a job in analytics, the article provides very useful rankings of software by the number of job postings, led by Java, SAS, Python, C/C++/C#, R, SPSS, and Matlab. They also provide a few examples of the trending in the job postings of the tools over the past 7 years—important trending to consider as well. This reveals for example a nearly identical increase in Python and R compared with a decrease in SAS over the past few years (SAS is still #2 overall in job postings though because of the huge SAS install base).

The user interface that appears to have won the day in commercial software for predictive modeling is the workflow-style interface where a user connects icons that represent functions or tasks into a flow of functions. This kind of interface has been in use for decades, and one that I was first introduce to in the software package Khoros / Cantata in the early 90s (http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.22.9854&rep=rep1&type=pdf). Clementine was an early commercial tool using this paradigm (now IBM Modeler), and now most tools, including those that have historically used drop-down “Windows-style” menus, are embracing a workflow interface. And it’s not just commercial tools that are built from the ground up using a workflow-style interface: many open source software like KNIME and RapidMiner embraced this style from their beginnings.

Even though workflow interfaces have won the day, there are several excellent software tools that still use a typical drop-down file menu interface for a variety of reasons: some legacy and some functional. I still use several of them myself.

There are several reasons I like the workflow interface. First, it is self-documenting, much like command-line interfaces. You see exactly what you did in the analysis, at least at a high level. To be fair, some of these nodes have considerable critical customization options set inside the nodes. Second, you can reuse the workflow easily. For example, if you want to run your validation data through the exact same data preparation steps you used in building your models, you merely connect a new data source to the workflow. Third, you can explain what you did to your manager very easily and visually without the manager needing to understand code.

Another way to think of the workflow interface is as a visual programming interface. You string together functional blocks from a list of functions (nodes) made available to you by the software. Command line programming does largely the same thing: you have a set of functions available to you that you string together. For example, you may want to load a csv file, replace missing values with the mean, transform your positively-skewed variables with a log transform, split you data into training and testing subsets, then build a decision tree. Each of these steps can be done with a particular function.

From this perspective, the biggest difference between visual programming and command line programming is that R and Python have a larger set of functional available to you. From an algorithm standpoint, this is primarily a difference in less-used or cutting-edge algorithms that are available in R or Python but not in most commercial. This is one important reason why most visual programming interface tools have added R and Python integration into their software, typically through a node that will run the external code within the workflow itself. The intent isn’t to replace the software, but to enhance it with functions not yet added to the software itself. This is especially the case with leading-edge algorithms that have support in R or Python already because of their ties with the academic community.

Personally, I used to create code regularly for building models, primarily in C and FORTRAN (3rd generation languages) though also in other scripting (4th generation) languages like unix shell programming (sh, csh, ksh, bash), Matlab, Mathematica, and others. But eventually I used commercial software tools because my consulting clients used them, and they contained most of what I needed to do to solve data mining problems. Since they all have limited sets of functions, I would have to sometimes make creative use of the existing functions to accomplish what I needed to do, but it didn’t stop me from being successful with them. And I didn’t have to write and re-write code for each of these clients.

Each of these tools have their own way of performing an analysis. Command line tools also have their own way of performing an analysis. Much of what separates novices and experts in a software tool is not an awareness of the particular functions or building blocks, but an understanding of how best to use the existing building blocks. This is why I recommend analysts learn a tool and learn it well, becoming an expert in the tool so that the tool is used to its fullest potential.

NOTE: as of the posting of this article, the Rexer Analytics 2015 survey is still open. If you haven’t participated in the survey, go to http://www.rexeranalytics.com/Data-Miner-Survey-2015-Intro2.html and use the code 7A9A52 when prompted.

Author Bio:

Dean Abbott is Co-Founder and Chief Data Scientist of Smarter Remarketer, Inc., and President of Abbott Analytics, Inc. in San Diego, California. Mr. Abbott is an internationally recognized data mining and predictive analytics expert with over two decades of experience applying advanced data mining algorithms, data preparation techniques, and data visualization methods to real-world problems, including fraud detection, risk modeling, text mining, personality assessment, response modeling, survey analysis, planned giving, and predictive toxicology.

2 thoughts on “What Programming Do Predictive Modelers Need to Know?

  1. Thank you, Dean, for reminding people to participate in the 2015 Data Miner Survey (http://www.rexeranalytics.com/Data-Miner-Survey-2015-Intro2.html). We will keep it open through May. And I completely agree with you — anyone interested in this topic of programming languages for predictive modeling will want to immediately go read Bob Muenchen’s very thorough analyses at r4stats.com (http://r4stats.com/articles/popularity/).
    — Karl Rexer (President, Rexer Analytics)

     
  2. Pingback: What Programming do Predictive Modelers Need to Know? | Learn programming language-Create a new programLearn programming language-Create a new program

Leave a Reply