TDWI | Training & Research | Business Intelligence, Analytics, Big Data, Data Warehousing

Think
- Research & Resources
  - TDWI Playbook | Next Generation Data Science: The AI-Driven Data Science Life Cycle
  - TDWI Data Points | The Data Foundation for AI
  - TDWI Best Practices Report | Data Strategies and Foundations for Modern Data Management
  - TDWI Insight Accelerator | Adopting a Platform Approach for Gaining Insights from Unstructured Data
- Webinars
  - Expert Panel: What's Next in Data Integration: Powering the AI-Driven Enterprise August 25, 2025
  - Expert Panel: Improving Data Quality, Accuracy, and Consistency August 27, 2025
  - The State of Self-Service Analytics: Results from TDWI’s Latest Research September 8, 2025
  - Expert Panel: Building an AI-Driven Data Strategy September 15, 2025
- Virtual Summits
  - Virtual Events Keys to Making Your Data AI Ready September 10, 2025
  - Virtual Events Data Quality for BI, Analytics and AI October 22, 2025
  - Virtual Events Modern Data Strategy November 12, 2025
  - Virtual Events What’s Ahead in 2026 for Data & Analytics December 10, 2025
- By Topic
  - By Topic
    
    Explore the Latest AI, Analytics, and Data Research and Training by Topic
  - BI, Analytics, and Data Literacy
  - AI, Data Science, and Machine Learning
  - Data Management and Governance
  - Platforms and Architecture
  - Strategy and Methods
- Speaking of Data Podcast
  
  Current Research Surveys
Train
- In-Person Events
  - Conference TDWI Transform 2025 San Diego August 18, 2025
  - Executive Summit TDWI Modern Data Leader's Summit San Diego: AI in the Enterprise August 18, 2025
  - Conference TDWI Transform 2025 Orlando November 16, 2025
  - Executive Summit TDWI Data & AI Leaders Summit Orlando: Governing Data, Analytics, and AI November 17, 2025
- Virtual Live Seminars
  - Data Governance Week July 30, 2025
  - Platforms & Architecture Week July 30, 2025
  - AI Bootcamp Week July 30, 2025
- Online Learning
- By Topic
  - By Topic
    
    Explore the Latest AI, Analytics, and Data Research and Training by Topic
  - BI, Analytics, and Data Literacy
  - AI, Data Science, and Machine Learning
  - Data Management and Governance
  - Platforms and Architecture
  - Strategy and Methods
- Train Your TeamCustom solutions for training your team
  
  Get CertifiedEarn a professional credential in BI and Analytics, Data Governance, or AI
  
  TDWI MembershipExclusive access to the research, tools, training, and connections
Engage
- Connect
  - Connect and Contribute to Our Vibrant Community of Data Leaders
    
    Subscribe to TDWI Stay up to date on the latest news and events. Sign Up
    
    Become a TDWI Member Gain exclusive access to the research, tools, training, and connections to move your careers, teams, and projects forward. Learn More
    
    Become a Part of the TDWI Research Panel Make a difference in the data and analytics industry and earn incentives by sharing your insights with TDWI. Explore Now
    
    Speak at TDWI Events Share your expertise and build your personal brand as a speaker at a TDWI In-Person or Virtual Event. Submit a Proposal
    
    Become a TDWI Research Fellow Apply to be a member of TDWI’s industry leading research team. Apply Today
    
    Become a Member of the Data & AI Leaders Forum Engage in collaborative discussions, stay ahead of the curve, and stay in the know. Apply Now
    
    Showcase Your Data & AI Solutions Reach and engage with TDWI community through multi-channel marketing programs. Learn More

TDWI Articles

Putting Predictive Analytics into Practice in the Real World (Part 1 of 2)

Predictive analytics doesn't have to be mystifying. Here are the basics you need to know.

By Shanif Dhanani
September 21, 2020

The AI revolution is in full swing as firms in virtually all industries are using AI techniques to boost their bottom lines. An MIT Sloan Management Review survey-based report found that enterprises increased AI spending by 62 percent in 2019, and more organizations are expected to invest in AI in the coming years.

For Further Reading:

AI and Predictive Analytics: Myth, Math, or Magic?

Predictive Analytics and Employee Retention: A Winning Combination

Putting Analytics and AI in Context for Better Outcomes

However, techniques such as predictive analytics can be seen as "rocket science" by nontechnical teams, leaving money on the table. In spite of massive investment, many AI initiatives fail. In fact, MIT Sloan Management Review also reports that "seven out of 10 companies surveyed report minimal or no impact from AI so far." AI is difficult to understand and even harder to implement.

My goal in this article is to demystify predictive analytics. In Part 2, I'll explore how nontechnical teams can implement predictive analytics.

An Intro to Predictive Analytics

Predictive analytics is, as the term implies, a way to predict future outcomes based on historical data. Data that is relevant to the problem is used as the input or the source of new knowledge.

For instance, an HR manager may be interested in applying predictive analytics to employee attrition because talent is the most valuable asset of any company. If you were to guess whether an employee would leave your company, you might look at things like an employee's job satisfaction, performance reports, how many days they take off, or even how far they live from the office.

A predictive analytics model would do the same thing using a mathematical function, as opposed to a gut feeling. There are many types of algorithms used in predictive modeling, but a common one for tasks such as this would be a decision tree.

You've used decision trees in your own life, even if you haven't realized it. A decision tree is simply a set of sequential, hierarchical decisions that lead to some final result. For example, you might be deciding whether to go to the park or to the cinema. It might depend on whether or not it's sunny, whether your friends are available to meet, whether you want to meet with your friends, what movies are showing, and so on.

Predictive analytics uses historical data to make predictions. To build a decision-tree model, a data scientist feeds in historical "training data," which is simply the data relevant to the problem at hand (such as employees' job satisfaction). This data set contains "labels" -- the KPI(s) you're interested in (in the case of employee attrition, whether each employee quit).

The decision tree is created as the training data is divided by various factors (for example, possibly splitting employees between high and low job satisfaction and then dividing each group by length of commute). The tree shape is created from the sequence of these factors and their relationship to the label (e.g., what percentage of low satisfaction/long commute employees quit compared to other possible combinations of factors).

The same principles apply to other predictive analytics use cases, such as analyzing churn to increase customer lifetime value (CLV). For example, a telecom company interested in reducing churn might use a decision tree that relies on data such as the customer's tenure, whether they have multiple telephone lines, their age, and their type of contract.

Using a Predictive Model

After a data scientist creates a predictive model, typically using a programming language such as Python or R, they then deploy it so a user can make predictions.

This can be done with a mix of complex tools such as Kubernetes and Google Cloud Platform, which would each require its own series of articles to explain. Suffice it to say, once you have a deployed model, you can enter data and receive a prediction in return.

Suppose you have a predictive employee attrition model. A manager could enter a current employee's job satisfaction, performance report data, how many days they take off, and so on, and the model will calculate the probability that the employee will quit.

The manager could also make an aggregate prediction and use data from all employees to estimate recruitment costs for the next year.

Again, the same principles apply for any use case. Consider another example: a telecom company that wants to predict churn would enter data from a current customer, such as their tenure, age, and contract type. The telecom could also make aggregate predictions using data from all customers to estimate overall churn and profit.

Where Predictive Analytics Falls Short

Although it's clear that the ability to predict the future is useful for any industry, there are times when predictive analytics falls short.

Because predictive analytics relies on past data, we run into trouble when the data is inaccurate, biased, or of generally low quality.

For instance, if a start-up wants to predict employee attrition, but hasn't conducted many performance reports or surveys, then there isn't much past data to base a prediction on, and it will be difficult to build an accurate model.

Further, an organization may have biased data, which would lead to a biased predictive model. One infamous example is a model built by Amazon that scored job candidates to accelerate hiring. Because the tech industry, including Amazon, has historically been male-dominated, the training data taught the algorithm that male candidates were preferable.

Just as it would be difficult for you to personally predict the weather if you weren't able to look at the sky or wanted to predict the weather in a location you don't know anything about, predictive models have a hard time making accurate predictions if they don't have complete, relevant data.

Learning the tools, such as Python, R, Kubernetes, and GCP, can take years, which is why many companies hire specialized data scientists to handle predictive modeling.

In Part 2 of this series, I'll explore exactly how nontechnical teams can implement predictive analytics -- with years of training no longer required.

About the Author

Shanif Dhanani is the cofounder and CEO of Apteo and a former data scientist and software engineer at Twitter. You can reach Shanif via email, @shanif on Twitter, or Shanif Dhanani on LinkedIn.

TDWI Membership

Accelerate Your Projects,
and Your Career

TDWI Members have access to exclusive research reports, publications, communities and training.

Individual, Student, and Team memberships available.

↑

TDWI | Training & Research | Business Intelligence, Analytics, Big Data, Data Warehousing

Research & Resources

Webinars

Virtual Summits

By Topic

In-Person Events

Virtual Live Seminars

Online Learning

By Topic

Connect and Contribute to Our Vibrant Community of Data Leaders

TDWI Articles

Putting Predictive Analytics into Practice in the Real World (Part 1 of 2)

Related Articles

Trending Articles

Breaking Barriers in Conversational BI/AI with a Semantic Layer

AI in 2025: Key Considerations for Technology Leaders

The Tech Blanket: Building a Seamless Tech Ecosystem

What’s Ahead in Generative AI in 2025? (Part Two)

TDWI Membership

Accelerate Your Projects,
and Your Career

TDWI

Engage

Research

Research & Resources

Webinars

Virtual Summits

By Topic

In-Person Events

Virtual Live Seminars

Online Learning

By Topic

Connect and Contribute to Our Vibrant Community of Data Leaders

TDWI Articles

Putting Predictive Analytics into Practice in the Real World (Part 1 of 2)

Related Articles

Trending Articles

Breaking Barriers in Conversational BI/AI with a Semantic Layer

AI in 2025: Key Considerations for Technology Leaders

The Tech Blanket: Building a Seamless Tech Ecosystem

What’s Ahead in Generative AI in 2025? (Part Two)

TDWI Membership

Accelerate Your Projects, and Your Career

TDWI

Engage

Research

Accelerate Your Projects,
and Your Career