Getting and Cleaning Data

1 week to complete at 10 hours a week
Flexible Schedule

Jeff Leek, PhD , Roger D. Peng, PhD , Brian Caffo, PhD

What You’ll Learn

Understand common data storage systems

Apply data cleaning basics to make data "tidy"

Use R for text and date manipulation

Obtain usable data from the web, APIs, and databases

Skills You’ll Gain

Data Access Data Import/Export R Programming Web Scraping Application Programming Interface (API) MySQL Data Quality Data Cleansing Data Management Data Collection File Management Exploratory Data Analysis SQL Data Wrangling Data Integration Data Manipulation Data Transformation

Shareable Certificate

Earn a shareable certificate to add to your LinkedIn profile.

Develop Your Specialized Knowledge

Learn new concepts from industry experts

Gain a foundational understanding of a subject or tool

Develop job-relevant skills with hands-on projects

Earn a shareable career certificate

There are 4 modules in this course

In this first week of the course, we look at finding data and reading different file types.

Welcome to Week 2 of Getting and Cleaning Data! The primary goal is to introduce you to the most common data storage systems and the appropriate tools to extract data from web or from databases like MySQL.

Welcome to Week 3 of Getting and Cleaning Data! This week the lectures will focus on organizing, merging and managing the data you have collected using the lectures from Weeks 1 and 2.

Welcome to Week 4 of Getting and Cleaning Data! This week we finish up with lectures on text and date manipulation in R. In this final week we will also focus on peer grading of Course Projects.