Our data center machines, due to all the information we feed them, are getting smarter. How can you use machine learning to your advantage?
Machine learning is a key part of how big data brings operational intelligence into our organizations. But while machine learning algorithms are fascinating, the science gets complex very quickly. We can’t all be data scientists, but IT professionals need to learn about how our machines are learning.
We are increasingly seeing practical and achievable goals for machine learning, such as finding usable patterns in our data and then making predictions. Often, these predictive models are used in operational processes to optimize an ongoing decision-making process, but they can also provide key insight and information to inform strategic decisions.
The basic premise of machine learning is to train an algorithm to predict an output value within some probabilistic bounds when it is given specific input data. Keep in mind that machine learning techniques today are inductive, not deductive — it leads to probabilistic correlations, not definitive conclusions.
The process of building these algorithms is called predictive modeling. Once you have mastered such a model on your data, you can sometimes examine it directly for insight into that original data, and/or apply the model to new data to predict something important. Broadly, a model’s output can be a classification of something, a likely outcome, a hidden relationship or attribute or an estimate of value.
Typically, machine learning techniques predict a value that is categorical, such as a label, color, membership or quality. For example, does this subject belong to a set of customers that we should try to retain, or that will buy something, or that will respond favorably to an offer?
Predictions can also be numerical if we are concerned with estimating quantities or value on a continuous scale. The output type determines the best learning method, and affects the measurements we use to judge the quality of the modeling.
Machine learning methods are either supervised or unsupervised. The difference isn’t whether the algorithms are free to misbehave, but rather whether they learn from training data that has the true outcome available — previously determined and added to the data set to provide supervision — or instead try to discover any natural patterns within a given set of data. Most business use predictive modeling, exploit supervised methods on training data, and usually aim to predict if a given instance — an email, person, company, or transaction — belongs to an interesting category — spam, likely buyer, good for credit, gets follow-up offer.
This excerpt is from TechTarget. To view the whole article click here.
Mike Matchett is a senior analyst and consultant at Taneja Group.