Data Catalog: Creating a Single Source of Reference
January 1, 2019
Every business is trying to become data driven, and managing data as a true economic asset is a fundamental part of that transformation. However, today’s analytic environments are rapidly growing, both in the volume of data that they store and the number of self-service analytics users that they support.
Thirty years ago, the goal was to organize all enterprise data in one location to achieve a "single source of truth," but now there are simply too many data sources and too many self-service tools. The machine learning data catalog has emerged as a key technology enabler for a "single source of reference" -- one place for anyone within the organization to find curated data, understand how that data has been used and why it was created, and trust that it is right for the analysis at hand, whether they are a data scientist, an analyst, or even a casual business consumer of data.