This course is offered by the Statistics department at UC Berkeley and is designed to follow the UC Berkeley course "Foundations of Data Science" (Data8) or STAT 20.

The course will teach a broad range of statistical methods that are used to solve data problems. Topics will include group comparisons, standard parametric statistical models, multivariate data visualization, multiple linear regression and classification, classification and regression trees and random forests. Students will be introduced to the widely used R statistical language and they will obtain hands-on experience in implementing a range of statistical methods on numerous real world datasets.

Students who are interested in taking this course should take a look at our FAQ link above where there is more discussion about pre-requisites and how this course compares to other courses offered in Statistics and Data Science.

The course does not follow a standard textbook but follows the online manuscript available here.

Also see this news release about the course.