What you'll learn

Organizing high throughput data
Multiple comparison problem
Family Wide Error Rates
False Discovery Rate
Error Rate Control procedures
Bonferroni Correction

Course description

In this course, you’ll learn various statistics topics including multiple testing problems, error rates, error rate controlling procedures, false discovery rates, q-values, and exploratory data analysis. We then introduce statistical modeling and how it is applied to high-throughput data. In particular, we will discuss parametric distributions, including binomial, exponential, and gamma, and describe maximum likelihood estimation. We provide several examples of how these concepts are applied in next-generation sequencing and microarray data. Finally, we will discuss hierarchical models and empirical Bayes along with some examples of how these are used in practice. We provide R programming examples in a way that will help make the connection between concepts and implementation.

This class was supported in part by NIH grant R25GM114818.

Learn More

Instructors

4 weeks long

Available now

lines of genomic data (dna is made up of sequences of a, t, g, c)

Data Science

Online

Introduction to Bioconductor

The structure, annotation, normalization, and interpretation of genome scale assays.

Free^*

4 weeks long

Available now

Young man sitting at desk with computer and a thought bubble saying, "What did that code do?"

Data Science

Online

Principles, Statistical and Computational Tools for Reproducible Data Science

Learn skills and tools that support data science and reproducible research, to ensure you can trust your own research results, reproduce them yourself, and communicate them to others.

Free^*

Available now

Browse by Subject Area

Statistical Inference and Modeling for High-throughput Experiments

Associated Schools

Harvard T.H. Chan School of Public Health

What you'll learn

Course description

Instructors

Rafael Irizarry

Michael Love

You may also like

High-Dimensional Data Analysis

Introduction to Bioconductor

Principles, Statistical and Computational Tools for Reproducible Data Science

Statistical Inference and Modeling for High-throughput Experiments

Associated Schools

Harvard T.H. Chan School of Public Health

What you'll learn

Course description

Instructors

Rafael Irizarry

Michael Love

You may also like

High-Dimensional Data Analysis

Introduction to Bioconductor

Principles, Statistical and Computational Tools for Reproducible Data Science

Join our list to learn more