x

Topics

Earn a Certificate

TDWI Chicago, May 8-13, has been postponed.

We will now be delivering this conference program at TDWI San Diego, August 7–12, 2022. The conference site will be live soon! Sign up for email updates below. Register here.

If you have any questions, please contact [email protected].

By using tdwi.org website you agree to our use of cookies as described in our cookie policy. Learn More

Course Description

TH6 Hands-On: Data Manipulation and Cleaning in Python

May 12, 2022

9:00 am - 5:00 pm

Duration: Full Day Course

Level: Beginner to Intermediate

Prerequisite: Basic Python programming. You will need a laptop computer with specific software installed prior to the session. When you register for the class, you will receive detailed instructions for software download and installation.

Deanne Larson, Ph.D.

DM, CBIP

President

Larson & Associates

Data manipulation and cleaning in machine learning is estimated to take more than 50% of the time allotted for a machine learning project. Python is a powerful tool that enables robust manipulation, and cleansing is also leveraged highly in Python. This course will cover topics important in handling structured and unstructured data, as well as scraping data in Python, including using key packages such as Pandas, NumPy, and Matplotlib. Participants will complete exercises to solidify understanding and build skills with the intent of finishing the course with a toolkit that can be leveraged in building Python manipulation and cleaning skills.

You Will Learn

  • Programming review
  • Data acquisition via APIs
  • Overview of scraping and wrangling
  • Cleansing for machine learning
  • Leveraging Pandas
  • Leveraging NumPy and Matplotlib
  • Dealing with strings
  • Visualizing results

Geared To

  • Data engineers, data scientists, business and data analysts, project managers
  • Roles that need insight into best practices and techniques related to data understanding and preparation

Laptop Setup

Students will use their own computers to complete lab exercises. Installation of Python and select open source libraries will be required in advance of the course.

Installation instructions will be emailed to registrants prior to the event. You must prepare your computer BEFORE the event.

There is no time allotted in class for computer preparation..

* Enrollment is limited to 35 attendees.

The clock is ticking.

Register Now

Register Online

Rest easy—online registrations for this conference are secure. Our secured server environment keeps your information private.