August 6, 2018
2:00 pm - 5:15 pm
Duration: Half Day Course
You may have heard that data scientists spend 80 percent of their time sourcing, cleaning, and preparing data. Although this may or may not be an exaggeration, data preparation is certainly a large and important part of data science and predictive analytics. Data often does not start out in the ideal format; it may contain bad values, may not be easily accessible, or may need to be transformed before we can really start exploring it and building models.
In this session, we will provide an overview of sourcing and preparing data for data science and predictive analytics projects. We will use a motivating example from the speaker’s work and also touch on how Python, SQL, and Hadoop can be used in the data preparation workflow.
Rest easy—online registrations for this conference are secure. Our secured server environment keeps your information private.