Transform, visualize, summarize, and analyze data in R, with packages from the Tidyverse, using RStudio
Carry out analyses in a reproducible and shareable manner with Quarto
Learn to effectively communicate results through an optional written project version controlled with Git and hosted on GitHub
Earn a shareable certificate to add to your LinkedIn profile.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
Hello World! In the first module, you will learn about what data science is and how data science techniques are used to make meaning from data and inform data-driven decisions. There is also discussion around the importance of reproducibility in science and the techniques used to achieve this. Next, you will learn the technology languages of R, RStudio, Quarto, and GitHub, as well as their role in data science and reproducibility.
In our second module, we'll advance our understanding of R to set the stage for creating data visualizations using tidyverse’s data visualization package: ggplot2. We'll learn all about different data types and the appropriate data visualization techniques that can be used to plot these data. The majority of this module is to help best understand ggplot2 syntax and how it relates to the Grammar of Graphics. By the end of this module, you will have started building up the foundation of your statistical tool-kit needed to create basic data visualizations in R.
In this module, we will take a step back and learn about tools for transforming data that might not yet be ready for visualization as well as for summarizing data with tidyverse’s data wrangling package: dplyr. In addition to describing distributions of single variables, you will also learn to explore relationships between two or more variables. Finally, you will continue to hone your data visualization skills with plots for various data types.