Data Science: Probability

Learn probability theory -- essential for a data scientist -- using a case study on the financial crisis of 2007-2008.

In this online course taught by Harvard Professor Rafael Irizarry, learn probability theory -- essential for a data scientist -- using a case study on the financial crisis of 2007-2008.

Featuring faculty from:
Self-Paced
Length
8 weeks
1-2 hours a week
Certificate Price
$149
Program Dates
Self-Paced
Length
8 weeks
1-2 hours a week
Certificate Price
$149
Program Dates
Start Data Science: Probability Today

What You'll Learn

In this course,part of our Professional Certificate Program in Data Science, you will learn valuable concepts in probability theory. The motivation for this course is the circumstances surrounding the financial crisis of 2007-2008. Part of what caused this financial crisis was that the risk of some securities sold by financial institutions was underestimated. To begin to understand this very complicated event, we need to understand the basics of probability.

We will introduce important concepts such as random variables, independence, Monte Carlo simulations, expected values, standard errors, and the Central Limit Theorem. These statistical concepts are fundamental to conducting statistical tests on data and understanding whether the data you are analyzing is likely occurring due to an experimental method or to chance.

Probability theory is the mathematical foundation of statistical inference which is indispensable for analyzing data affected by chance, and thus essential for data scientists.

The course will be delivered via edX and connect learners around the world. By the end of the course, participants will learn:

  • Important concepts in probability theory including random variables and independence
  • How to perform a Monte Carlo simulation
  • The meaning of expected values and standard errors and how to compute them in R
  • The importance of the Central Limit Theorem

Your Instructors

Image
Rafael Irizarry

Rafael Irizarry

Professor of Biostatistics at Harvard University
Read full bio.

Ways to take this course

When you enroll in this course, you will have the option of pursuing a Verified Certificate or Auditing the Course.

A Verified Certificate costs $149 and provides unlimited access to full course materials, activities, tests, and forums. At the end of the course, learners who earn a passing grade can receive a certificate. 

Alternatively, learners can Audit the course for free and have access to select course material, activities, tests, and forums. Please note that this track does not offer a certificate for learners who earn a passing grade.

Read More

Introduction to Linear Models and Matrix Algebra

Perform matrix operations

Learn to use R programming to apply linear models to analyze data in life sciences.

Read More

Data Science: Inference and Modeling

Key concepts through a motivating case study

Learn inference and modeling: two of the most widely used statistical tools in data analysis.

Read More

Data Science: Capstone

To become an expert you need practice and experience.

Show what you’ve learned from the Professional Certificate Program in Data Science.