As the world of analytics evolves, the roles and participants in data ecosystems must adapt. With the convergence of several influences—big data, self-service analytics, and self-service data preparation—we need to actively manage the inventory of self-service data.
Data curation is both a data inventory management process and a data governance activity. The data curator is responsible for overseeing a collection of data assets and making it available to and findable by anyone who needs data. Cataloging maintains the collection of metadata that is necessary to support browsing, searching, evaluating, accessing, and securing data sets.
Curating and cataloging work together to meet the data needs of business and data analysts, to provide self-service data to complement self-service analytics, and to realize the promise of democratizing data analytics.
You Will Learn
- The concepts, responsibilities, and skills of data curation
- The role of the data curator in data governance and the differences between a data curator and a data steward
- The needs of data seekers and the ways that curating and cataloging help meet them
- The purpose, content, and uses of a data catalog
- The state of data cataloging tools and technology
- Guidelines for getting started with data curating and cataloging
- Business and IT leaders struggling with the paradoxes of modern data management
- Analytics and BI designers and developers who are dependent on fresh and relevant data for every analytics use case
- Data management professionals at all levels from architects to engineers
- Data governance professionals—especially data stewards who need to adapt to the changing world of modern data management