A Better Way to Tackle All That Data


In a world inundated with data, the biggest challenge any organization faces is the time it takes to make informed decisions. We can gather all the data in the world, but if it doesn’t help save a life, allocate resources better, fund the organization, or avoid a crisis, what good is it? The rise of big data is outpacing our ability to conduct research and reach conclusions fast enough, largely due to a shortage of qualified data scientists. At the root of this problem is our concept of what constitutes data. The boundaries of what we can digitize and analyze are expanding daily.
Gartner predicts that the Internet of Things (IoT) will add 50 billion machine voices to today’s 2 billion connected users. This explosion of data means that humans will struggle to manage the process of amassing the correct data and performing the proper analysis. The measure of how long it takes analytics to conclude is often called “time to decision.” If we accept that big data’s holy grail is better, faster decisions, then as data grow in volume, velocity, and variety, making management more complex and potentially slowing decision-making time, something has to give. This problem is crying out for a solution that has long been in development but has only recently become practical and economically feasible enough for widespread adoption — machine learning.
Understanding Machine Learning
Machine learning is a branch of computer science where algorithms learn from and react to data just as humans do. Machine-learning software identifies hidden patterns in data and uses those patterns both to group similar data and to make predictions. Each time new data are added and analyzed, the software gains a clearer view of data patterns and gets closer to making the optimal prediction or reaching a meaningful understanding. It does this by turning the conventional data-mining practice on its head. Rather than scientists beginning with a (possibly biased) hypothesis that they seek to confirm or disprove in a body of data, the machine starts with a definition of an ideal outcome which it uses to decide what data matter and how they should factor into solving problems. If we know the optimal way for something to operate, we can figure out exactly what to change in a suboptimal situation.
For example, a complex system like a commuter train service has targets for the on-time, safe delivery of passengers that present an optimization problem in real-time based on various fluctuating variables, ranging from the weather to load size to even the availability and cost of energy. Machine-learning software onboard the trains themselves can consider all these factors, running hundreds of calculations a second to direct an engineer to operate at the proper speed. The Nest thermostat is a well-known example of machine learning applied to local data. As people turn the dial on the Nest thermostat, it learns their temperature preferences. It begins to automatically manage the heating and cooling, regardless of time and day of the week. The system never stops learning, allowing people to define the optimum continuously.
Machine Learning in Healthcare
Applying machine learning in healthcare is essential to achieving the goal of personalized medicine (the concept that every patient is subtly different and should be treated uniquely). Nowhere is this more easily seen than in cancer treatment, where genomic medicine enables highly customized therapy based on an individual’s type of tumour and myriad other factors. Here machine-learning algorithms help sort the various treatments available to oncologists, classifying them by cost, efficacy, toxicity, etc. As patients are treated, these systems grow in intelligence, learning from outcomes and additional evidence-based guidelines. This leaves oncologists free to optimize treatment plans and share information with their patients.
With the rise of off-the-shelf software, such as LIONsolver, the winner of a recent crowdsourcing contest to find better ways to recognize Parkinson’s disease, machine learning is at last entering the mainstream, available to a wider variety of businesses than the likes of Yahoo, Google, and Facebook that first made big data headlines. More and more companies may now see it as a viable alternative to addressing the rapid proliferation of data, with increasing numbers of data scientists spending more and more time analyzing data.
Machine Learning in Business
Expect to see machine learning used to train supply chain systems, predict the weather, spot fraud, and, especially in customer experience management, help decide what variables and context matter for customer response to marketing. For instance, retail giants like Amazon use machine learning to recommend products to customers based on their browsing and purchase history. This not only enhances the customer experience but also drives sales. Similarly, financial institutions use machine learning to detect fraudulent transactions by identifying unusual patterns in customer behaviour. This ensures that the majority of the day-to-day transaction analysis is handled by the computer, allowing the financial institution to focus on more complex issues.
In the supply chain industry, machine learning can optimize inventory management by predicting demand based on historical data and current trends. This reduces waste and ensures that products are available when and where they are needed. Additionally, machine learning can be used to predict equipment failures, allowing for proactive maintenance and reducing downtime.
Statistics & Tables
To illustrate the impact of machine learning, consider the following statistics:
Healthcare: Machine learning algorithms have been shown to improve diagnostic accuracy by up to 30% in certain cases. For example, a study published in Nature found that machine learning models could detect skin cancer with an accuracy of 95%, compared to 86% for human dermatologists.
Retail: According to a report by McKinsey, retailers using machine learning for personalized recommendations have seen a 20-35% increase in sales.
Finance: A study by the Federal Reserve found that machine learning models could detect fraudulent transactions with an accuracy of 98%, significantly reducing the financial losses associated with fraud.