TDWI | Training & Research | Business Intelligence, Analytics, Big Data, Data Warehousing

Think
- Research & Resources
  - TDWI Playbook | Next Generation Data Science: The AI-Driven Data Science Life Cycle
  - TDWI Data Points | The Data Foundation for AI
  - TDWI Best Practices Report | Data Strategies and Foundations for Modern Data Management
  - TDWI Insight Accelerator | Adopting a Platform Approach for Gaining Insights from Unstructured Data
- Webinars
  - Data Integration for AI: Overcoming Modern Pipeline Challenges July 23, 2025
  - From Silos to Insights: Centralizing Data to Drive AI July 24, 2025
  - Expert Panel: Leveraging AI-Powered Solutions for Data Management July 28, 2025
  - A Generative AI Framework for Credit and Financial Markets July 29, 2025
- Virtual Summits
  - Virtual Events Keys to Making Your Data AI Ready September 10, 2025
  - Virtual Events Data Quality for BI, Analytics and AI October 22, 2025
  - Virtual Events Modern Data Strategy November 12, 2025
  - Virtual Events What’s Ahead in 2026 for Data & Analytics December 10, 2025
- By Topic
  - By Topic
    
    Explore the Latest AI, Analytics, and Data Research and Training by Topic
  - BI, Analytics, and Data Literacy
  - AI, Data Science, and Machine Learning
  - Data Management and Governance
  - Platforms and Architecture
  - Strategy and Methods
- Speaking of Data Podcast
  
  Current Research Surveys
Train
- In-Person Events
  - Conference TDWI Transform 2025 San Diego August 18, 2025
  - Executive Summit TDWI Modern Data Leader's Summit San Diego: AI in the Enterprise August 18, 2025
  - Executive Summit AI Accelerate 2025, Brought to You by AI Boadroom & TDWI August 18, 2025
  - Conference TDWI Transform 2025 Orlando November 16, 2025
- Virtual Live Seminars
  - TDWI Data Governance Principles and Practices: Managing Data as an Asset June 25, 2025
  - Building Your Company’s Data Governance Roadmap June 25, 2025
  - Data Governance: Driving Engagement and Organizational Change June 26, 2025
  - A Framework for Modern Data Governance June 25, 2025
- Online Learning
- By Topic
  - By Topic
    
    Explore the Latest AI, Analytics, and Data Research and Training by Topic
  - BI, Analytics, and Data Literacy
  - AI, Data Science, and Machine Learning
  - Data Management and Governance
  - Platforms and Architecture
  - Strategy and Methods
- Train Your TeamCustom solutions for training your team
  
  Get CertifiedEarn a professional credential in BI and Analytics, Data Governance, or AI
  
  TDWI MembershipExclusive access to the research, tools, training, and connections
Engage
- Connect
  - Connect and Contribute to Our Vibrant Community of Data Leaders
    
    Subscribe to TDWI Stay up to date on the latest news and events. Sign Up
    
    Become a TDWI Member Gain exclusive access to the research, tools, training, and connections to move your careers, teams, and projects forward. Learn More
    
    Become a Part of the TDWI Research Panel Make a difference in the data and analytics industry and earn incentives by sharing your insights with TDWI. Explore Now
    
    Speak at TDWI Events Share your expertise and build your personal brand as a speaker at a TDWI In-Person or Virtual Event. Submit a Proposal
    
    Become a TDWI Research Fellow Apply to be a member of TDWI’s industry leading research team. Apply Today
    
    Become a Member of the Data & AI Leaders Forum Engage in collaborative discussions, stay ahead of the curve, and stay in the know. Apply Now
    
    Showcase Your Data & AI Solutions Reach and engage with TDWI community through multi-channel marketing programs. Learn More

TDWI Articles

Moving the Textual Analytics Cheese

There are plenty of natural obstacles to dealing with text, but those who feel threatened by advances in textual processing are inventing artificial obstacles as well.

By Bill Inmon
May 23, 2016

Text is hard enough as it is. Text has double meanings, innuendo, improper syntax, and slang to deal with. Text has antecedents and precedents, and it comes in many languages. There is enough chaos and complexity found in text to challenge any system of textual analysis. We don’t need any more obstacles, but some people are trying to deny progress in text processing.

For years text has been a great challenge for data analytics. Text exists in the enterprise in vast supply, but text is almost perfectly resistant to any form of computerized analysis.

The computer has proven to be amazingly effective when it comes to structured data, but when it comes to text, the computer shrivels and hides in a corner. The unpredictable and imperfect nature of text remains impervious to most attempts to computerize it effectively, despite the many applications and opportunities for using text.

New technology now exists -- textual disambiguation -- that promises to bring text into the world of normal computer processing. With textual disambiguation, unstructured text can be turned into ordinary structured data so the great and powerful computer can handle it. The computer demands that data be structured, and now it is possible to take unstructured text and turn it into structured data.

What do we find accompanies this turn of events? We find people are uncomfortable with the thought that there has been a sea change in their industry. People resist change, whatever the change might be. People don’t like to have their cheese moved. People think their cheese belongs where it has always has been.

Having text and doing nothing with it was good enough for my father, so it is good enough for me. Don’t bother me with reality and facts and progress. I don’t want to hear it. Leave my cheese where it is!

Years ago when ETL came onto the scene, programmers of the day resisted the idea that one could automate writing transformation code. To the programmer of the day, humans wrote code and that was that. Programmers were threatened by the thought that a machine could write accurate and useful transformation code (and transformation is precisely what ETL does).

Some programmers gave transformations to the vendor that were so complex and so horrendous that humans had to write specialized code to perform the transformation. In doing so, programmers proved to management that ETL could not work. The strange thing was that management believed them, at least at first.

What the programmers didn’t tell management was that ETL solutions could easily write 99 percent of transformations. Of course, there are always some transformations so complex that they cannot be written automatically, but using that 1 percent of complex transformations as proof that ETL would not work was a stupid thing to do. Programmers who misrepresented their transformations just didn’t want someone coming in and moving their cheese.

The same thing is happening with text and text analytics today. The possibilities are changing and some humans are reacting predictably. They don’t like their cheese moved.

The other day I was at a conference and a speaker said he could prove that textual, analytical processing does not work. The gentleman offered a sentence that was very complex and full of ambiguities and said, “See, the computer cannot understand or make sense of this sentence. Therefore textual disambiguation does not work.”

You know what? The gentleman was partially right. There are sentences that are so obscure, so devious, so twisted that no amount of textual disambiguation will ever unravel the sentence. Does that mean that disambiguation does not work? Not at all.

You see, most sentences are straightforward. In normal conversation, it is rare to find sentences that are tortuously complex. Consider writers. Yes, there was William Faulkner and Henry James, both famous for their difficult and obscure form of writing. On the other hand, for every Faulkner and James there is an Ernest Hemingway, Danielle Steele, John Grisham, Mark Twain, and a whole host of other writers who write in an understandable, clear, concise fashion. Most writers actually want you to understand what is being said.

It is disingenuous and artificial for the world to offer examples of strange and uncommon speech and use them to prove textual disambiguation does not work. Time will prove that the textual analytics cheese has been moved despite the artificial arguments offered up by those who would stand in the way.

About the Author

Bill Inmon has written 54 books published in 9 languages. Bill’s company -- Forest Rim Technology -- reads textual narrative and disambiguates the text and places the output in a standard data base. Once in the standard data base, the text can be analyzed using standard analytical tools such as Tableau, Qlikview, Concurrent Technologies, SAS, and many more analytical technologies. His latest book is Data Lake Architecture.

TDWI Membership

Accelerate Your Projects,
and Your Career

TDWI Members have access to exclusive research reports, publications, communities and training.

Individual, Student, and Team memberships available.

↑

TDWI | Training & Research | Business Intelligence, Analytics, Big Data, Data Warehousing

Research & Resources

Webinars

Virtual Summits

By Topic

In-Person Events

Virtual Live Seminars

Online Learning

By Topic

Connect and Contribute to Our Vibrant Community of Data Leaders

TDWI Articles

Moving the Textual Analytics Cheese

Related Articles

Trending Articles

Breaking Barriers in Conversational BI/AI with a Semantic Layer

AI in 2025: Key Considerations for Technology Leaders

The Tech Blanket: Building a Seamless Tech Ecosystem

What’s Ahead in Generative AI in 2025? (Part Two)

TDWI Membership

Accelerate Your Projects,
and Your Career

TDWI

Engage

Research

Research & Resources

Webinars

Virtual Summits

By Topic

In-Person Events

Virtual Live Seminars

Online Learning

By Topic

Connect and Contribute to Our Vibrant Community of Data Leaders

TDWI Articles

Moving the Textual Analytics Cheese

Related Articles

Trending Articles

Breaking Barriers in Conversational BI/AI with a Semantic Layer

AI in 2025: Key Considerations for Technology Leaders

The Tech Blanket: Building a Seamless Tech Ecosystem

What’s Ahead in Generative AI in 2025? (Part Two)

TDWI Membership

Accelerate Your Projects, and Your Career

TDWI

Engage

Research

Accelerate Your Projects,
and Your Career