You must use the best data if your team wants to build the most valuable machine learning models. This idea is summarized in a famous quote, “data trumps algorithm.” In this hands-on course, your team will learn some of the most useful data-wrangling techniques for producing the best data to use in your machine learning models.
Via a series of labs, your team will get hands-on experience wrangling data using R via libraries like dplyr and lubridate. This course is designed for a broad audience, and no prior knowledge of R programming is required.
This course aims for your team to return to work and immediately employ these techniques to wrangle data, enhance data analyses, and craft the most valuable machine learning models.
Want to know the best part?
While the course uses R because it is free, all the concepts/techniques you will learn are applicable to any machine learning technology you might use.
Your Team Will Learn
- How to wrangle data in R the tidyverse way
- Working with character data
- Wrangling date and time data
- Pivoting/aggregating tables of data
- Joining tables of data
- Strategies for dealing with missing data
- Additional resources to extend your learning
- Business/data analysts
- Database developers
- BI/report developers
- Anyone interested in building machine learning models
No prior skills in programming are required.
Attendees will need a laptop computer with specific software installed before the session. In advance of the class, attendees will receive detailed software download and installation instructions.