Level: Intermediate to Advanced
Prerequisite: None
Data science is the key to business success in the information economy. This course will teach you about best practices in deploying a data science capability for your organization. Technology is the easy part—the hard part is creating the right organizational and delivery framework in which data science can succeed.
We will discuss the necessary skill sets for a successful data scientist and the environment that will allow them to thrive. We will draw a strong distinction between “data R&D” and “data product” capabilities within an enterprise and speak to the different skill sets, governance, and technologies needed across these areas. We will also explore the use of open data sets and open source software tools to enable best results from data science in large organizations, as well as the many pitfalls and how to avoid them.
You Will Learn
- How to innovate using data science in the age of big data
- The most common mistakes made with big data analytics
- How to deploy a data lake, data R&D, and data product capabilities within your organization