Level: Beginner to Intermediate
Data manipulation and cleaning in machine learning is estimated to take more than 50% of the time allotted for a machine learning project. Python is a powerful tool that enables robust manipulation, and cleansing is also leveraged highly in Python. This course will cover topics important in handling structured and unstructured data, as well as scraping data in Python, including using key packages such as Pandas, NumPy, and Matplotlib. Participants will complete exercises to solidify understanding and build skills with the intent of finishing the course with a toolkit that can be leveraged in building Python manipulation and cleaning skills.
You Will Learn
- Programming review
- Data acquisition via APIs
- Overview of scraping and wrangling
- Cleansing for machine learning
- Leveraging Pandas
- Leveraging NumPy and Matplotlib
- Dealing with strings
- Visualizing results
- Data engineers
- Data scientists
- Business and data analysts
- Project managers
- Roles that need insight into best practices and techniques related to data understanding and preparation
In this session, the instructor will teach you principles and practices, show you how to use the tools, and demonstrate with live examples. You will receive installation instructions and take-home workshop materials to complete hands-on exercises on your own, after the live session.
Completion of take-home workshop exercises will require Python and several open-source libraries. Detailed installation instructions will be provided during class.
TDWI LIVE STUDIO AUDIENCE:
This session will be recorded for development of an online learning course which will subsequently be available for purchase directly or via subscription. By attending, you agree that your likeness may appear in the online course, including audio and video.