Data Science Methodology
Instructor: Alex Aklson , Polong Lin
Beginner Level • 4 hours to complete • Flexible Schedule
What You'll Learn
- Describe what a data science methodology is and why data scientists need a methodology.
- Apply the six stages in the Cross-Industry Process for Data Mining (CRISP-DM) methodology to analyze a case study.
- Evaluate which analytic model is appropriate among predictive, descriptive, and classification models used to analyze a case study.
- Determine appropriate data sources for your data science analysis methodology.
Skills You'll Gain
Data Modeling
Jupyter
Predictive Modeling
Data Science
Data Quality
Analytical Skills
Decision Tree Learning
Business Analysis
Stakeholder Engagement
Data Mining
Data Cleansing
Peer Review
User Feedback
Data Storytelling
Shareable Certificate
Earn a shareable certificate to add to your LinkedIn profile
Outcomes
-
Learn new concepts from industry experts
-
Gain a foundational understanding of a subject or tool
-
Develop job-relevant skills with hands-on projects
-
Earn a shareable career certificate
There are 4 modules in this course
In this module, you will discover what makes data science interesting, learn what a data science methodology is, and why data scientists need a data science methodology. Next, you’ll gain more in-depth knowledge of the first two data science methodology stages: Business Understanding and Analytic Approach. You’ll discover how to identify considerations and steps needed to define the data requirements for decision tree classification during the Data Requirements stage. Next, learn about the processes and techniques data scientists use to assess data content, quality, and initial insights and how data scientists manage data gaps. Round out this week with practical hands-on experience learning how to approach the Business Understanding and the Analytic Approach stage tasks and the Data Requirements and Collection stage tasks for any data science problem.
In this module, you will learn what data scientists do when their tasks and goals are to understand, prepare, and clean the data. You’ll examine the purposes, characteristics, and goals of the data modeling process. You’ll also explore how to prepare a data set by handling missing, invalid, or misleading data.
Then check out the hands-on labs where you can gain experience completing tasks relevant to the Data Understanding, Data Preparation, and Modeling and Evaluation stages. You’ll be able to apply the skills you learn to future data science problems.
When you complete this module, you’ll be able to describe the deployment and feedback stages of the data science methodology. You’ll learn how to assess a data model’s performance, impact, and readiness. You’ll be able to identify the stakeholders who usually contribute to model refinement. You’ll also be able to explain why deployment and feedback should be an iterative process.
To complete your hands-on lab experience, you’ll devise a business problem to solve using data related to email, hospitals, or credit cards. You’ll demonstrate your understanding of data science methodology by applying it to a given problem. You’ll construct responses that address each phase of the CRISP-DM based on a chosen business problem. After submitting your work, you’ll evaluate your peers’ final projects and provide constructive ideas and suggestions that fellow learners can apply right away.
Before completing your final project, learn how CRISP-DM data science methodology compares to John Rollins’ foundational data science methodology. Then, apply what you learned to complete a peer-graded assignment using CRISP-DM data science methodology to solve a business problem you define. You'll first take on both the client and data scientist role and describe how you would apply CRISP-DM data science methodology to solve the business problem.
Then, take on the role of a data scientist and apply your knowledge of CRISP-DM data methodology stages to describe how you would solve the business problem.
After you submit your assignment, you'll grade the assignment of one peer who is enrolled in this session. Let's get started!