Deriving Value from Unstructured Data for Machine Learning: What You Need to Know
Webinar Speaker: Fern Halper, TDWI VP Research, Senior Research Director for Advanced Analytics
Date: Friday, January 21, 2022
Time: 9:00 a.m. PT, 12:00 p.m. ET
By many industry estimates, the bulk of data generated today (80 to 90 percent) is unstructured data, such as text, images, audio files, and other formats. Unstructured data is everywhere—it comes from internal sources such as emails, company documents, call center notes, claims forms, and surveys, as well as external sources such as Tweets, sensor data, and external media. The collection of certain types of unstructured data, such as text data, is already mainstream according to TDWI research. Organizations want to use this unstructured data to enrich structured data for use cases such as predictive maintenance, customer retention models, image classification—to name a few.
Unstructured data can consume large volumes of storage, and without any way to manage, govern, and analyze this data, it provides no value. As organizations look to use unstructured data for data science use cases, they will need to consider issues such as providing a data foundation, processing, analyzing, and governing the data. For example, how do you know if unstructured data is sensitive? How do you classify it? Who can access it? Once it is accessed, how is it processed? Join TDWI VP of Research Fern Halper as she drills into the world of unstructured data.
Attend this webinar and learn:
- The value proposition for unstructured data
- Tools for analyzing unstructured data
- Governance considerations for unstructured data
- Enriching unstructured data for better analysis
Fern Halper, Ph.D.