TDWI | Training & Research | Business Intelligence, Analytics, Big Data, Data Warehousing

Think
- Research & Resources
  - TDWI Digital Dialogue | Developing a Modern Data Strategy for AI: Evolving Roles and Practices
  - Digital Dialogue | Driving Data Quality at Scale with High-Performance Observability in the Cloud
  - TDWI Checklist Report | Five Considerations for a Data Platform to Support AI Application Development
  - TDWI Digital Dialogue | Scaling Data Integration and Analytics Pipelines
- Webinars
  - Building a Real-Time Data Intelligence Platform for Generative AI April 25, 2025
  - Expert Panel: Democratizing Data and AI Across the Enterprise April 28, 2025
  - Unlocking the Value of Modern Business Intelligence: Moving Beyond Legacy Tools April 28, 2025
  - From Mistakes to Mastery: Navigating Cloud Data Migration April 29, 2025
- Virtual Summits
  - Virtual Events Building a Successful Data and AI Governance Framework May 21, 2025
  - Virtual Events Modern Data Strategy June 25, 2025
  - Virtual Events Keys to Making Your Data AI Ready September 10, 2025
  - Virtual Events Data Quality for BI, Analytics and AI October 22, 2025
- By Topic
  - By Topic
    
    Explore the Latest AI, Analytics, and Data Research and Training by Topic
  - BI, Analytics, and Data Literacy
  - AI, Data Science, and Machine Learning
  - Data Management and Governance
  - Platforms and Architecture
  - Strategy and Methods
Train
- In-Person Events
  - Conference TDWI Transform 2025 San Diego August 18, 2025
  - Executive Summit TDWI Modern Data Leader's Summit San Diego: AI in the Enterprise August 18, 2025
  - Executive Summit AI Accelerate 2025, Brought to You by AI Boadroom & TDWI August 18, 2025
  - Conference TDWI Transform 2025 Orlando November 16, 2025
- Virtual Live Seminars
  - Data as a Product: Establishing a Value-Driven Model for BI and Analytics Delivery February 12, 2025
  - Building Your Company's Data Governance Roadmap March 26, 2025
  - TDWI Information Dashboard Design: Dashboard Development and Performance Management February 12, 2025
  - How to Create an AI Action Plan February 12, 2025
- Online Learning
- By Topic
  - By Topic
    
    Explore the Latest AI, Analytics, and Data Research and Training by Topic
  - BI, Analytics, and Data Literacy
  - AI, Data Science, and Machine Learning
  - Data Management and Governance
  - Platforms and Architecture
  - Strategy and Methods
- Train Your TeamCustom solutions for training your team
  
  Get CertifiedEarn a professional credential in BI and Analytics, Data Governance, or AI
  
  TDWI MembershipExclusive access to the research, tools, training, and connections
Engage
- Connect
  - Connect and Contribute to Our Vibrant Community of Data Leaders
    
    Subscribe to TDWI Stay up to date on the latest news and events. Sign Up
    
    Become a TDWI Member Gain exclusive access to the research, tools, training, and connections to move your careers, teams, and projects forward. Learn More
    
    Become a Part of the TDWI Research Panel Make a difference in the data and analytics industry and earn incentives by sharing your insights with TDWI. Explore Now
    
    Speak at TDWI Events Share your expertise and build your personal brand as a speaker at a TDWI In-Person or Virtual Event. Submit a Proposal
    
    Become a TDWI Research Fellow Apply to be a member of TDWI’s industry leading research team. Apply Today
    
    Become a Member of the Data & AI Leaders Forum Engage in collaborative discussions, stay ahead of the curve, and stay in the know. Apply Now
    
    Showcase Your Data & AI Solutions Reach and engage with TDWI community through multi-channel marketing programs. Learn More

RESEARCH & RESOURCES

A Sentiment-al Education: Text Analytics Comes of Age

Text analytics is more than just sentiment analysis. Text analytics is being used to enable churn analysis, fraud detection, risk analysis, warranty analysis, medical research, and other non-traditional use cases.

By Stephen Swoyer
October 22, 2013

Getting started with text analytics can seem daunting, if not mystifying.

A new report from TDWI Research aims to make it less so, describing applicable text analytic use cases as well as strategies for developing and implementing a text analytic program.

This means taking text analytics beyond its core use cases -- sentiment and customer experience analysis. According to Fern Halper, research director for advanced analytics at TDWI Research, text analytics is increasingly used for applications other than these bread-and-butter use cases.

"Text analytics is being used across industries in numerous ways, including customer-focused solutions such as voice of the customer, churn analysis, and fraud detection," writes Halper, author of How to Gain Insight from Text, the latest release in TDWI Research's "Checklist Reports" series.

"Many early adopters have used the technology to better understand customer experience, and this is still one of the most popular use cases," she acknowledges, noting that "text analytics is also being used in other areas such as risk analysis, warranty analysis, and medical research."

With this in mind, how does one get started? The good news is that text analytics depends less on specialized software and expertise than it used to. For one thing, most business intelligence (BI) vendors ship limited text analytic features with their tools: Microsoft Corp., for example, exposes a wizard-driven front-end for its SQL Server Analysis Services (namely, SQL Server Data Tools, or SSDT) the purpose of which is to automate the steps of selecting and preparing data sources -- including semi-structured text sources.

Most other BI platforms -- including SAP BusinessObjects, IBM Cognos, WebFOCUS from Information Builders Inc., MicroStrategy, Oracle Business Intelligence Enterprise Edition, QlikView, SAS, and Tableau, and others -- incorporate (limited) self-service text-analytic features, too. Stepping up to text analytics doesn't have to entail a huge commitment or capital outlay, such as purchasing a solution from a specialty vendor such as SAS or IBM and hiring the requisite -- and typically costly -- talent to use it.

There's a difference between stepping up to text analytics -- e.g., by using text in one-off projects or as a component of BI/analytic discovery -- and developing a mature text analytic program.

Halper's report addresses the latter requirement. She outlines a pragmatic approach for developing a text analytic practice and getting started with text analytics. "It generally makes sense to pick an initial problem that has relatively high visibility and where it is fairly easy to get at the data. If possible, it should be a quick win that uses a proof of concept," she writes.

Halper points out that the selection of a high-visibility problem -- preferably one with the promise of tangible ROI -- "will earn a seat at the executive table, which can help to keep momentum high." The POC is important, she explains, because it "ensure[s] that the technology you're using works with your specific data."

Depending on their needs, adopters must distinguish between general-purpose text analytics and built-for-purpose text analytics products, Halper says, citing the surfeit of available sentiment analysis and customer experience improvement offerings.

"Another factor to consider as part of the business case is whether the solution is multi-purpose," she writes. "For example, there are numerous products on the market that use text analytics to gain insight into social media to understand customer opinions and sentiment. It is important to think beyond the first use case and consider your options wisely: i.e., point solutions versus more robust, integrated solutions."

Halper's report addresses a total of nine checklist items, including the importance of pro-actively determining data access, timeliness, and security requirements; the role of data visualization in text analytics; the use of sentiment analysis; and more advanced uses of text analytics.

She also considers the problem of accurately identifying so-called "text features" for extraction. These consist of entities, such as the names of persons, companies, or products; geographical locations; dates or times; themes, such as important phrases or words/concepts that occur or co-occur with one another; and concepts, such as words or phrases that have semantic significance.

"The goal is to accurately extract the entities, concepts, themes, and sentiment in which you are interested," Halper writes, explaining that different text analytic tools address this problem in different ways: "A vendor might include only a dictionary, list of names, or synonym list. Another might support hierarchical taxonomies to better organize information. The disadvantage of any purely list-based or taxonomic solution is that you're limited to finding what's in the list."

In this respect, she concludes, text analytic technologies are getting both more sophisticated and more usable. "Some vendors now incorporate statistical models based on machine learning into their solutions to help users extract features that were not preconfigured. Vendors that provide models often pre-train them so users don't need to do anything but simply use the model. Some vendors provide hybrid approaches -- statistical and rules-based -- which provide the benefits of collection investigation combined with the specificity that comes from linguistic rules."

You can download a copy of Halper's report here. (A short registration is required if you are downloading a free TDWI report for the first time.)

TDWI Membership

Get immediate access to training discounts, video library, research, and more.

Find the right level of Membership for you.

Learn More

↑

Research & Resources

Webinars

Virtual Summits

By Topic

In-Person Events

Virtual Live Seminars

Online Learning

By Topic

Connect and Contribute to Our Vibrant Community of Data Leaders

RESEARCH & RESOURCES

A Sentiment-al Education: Text Analytics Comes of Age

TDWI Membership

Get immediate access to training discounts, video library, research, and more.

TDWI

Engage

Research