Spring 2017
In Spring 2017, this course was offered under the number ‘STAT 28’.
Lectures
MWF 11-12:00pm, Barrows 20
Labs
Thursday 2-4P, 342 Evans Thursday 4-6pm, 342 Evans
Instructors | Office | Office Hours | |
---|---|---|---|
Adityanand Guntuboyina | 423 Evans Hall | agun@berkeley.edu | W 2-4 |
Elizabeth Purdom | 433 Evans Hall | epurdom@stat.berkeley.edu | W 2-4 |
GSI | Office Hours | |
---|---|---|
Boying Gong | jorothy_gong@berkeley.edu | M 4-6, Evans 342 and F 9-11am Evans 444 |
There is no book for this class. Instead, we have provided extensive lecture notes. If you would like some additional optional reading, you can try the following books.
- The Statistical Sleuth: A Course in Methods of Data Analysis by Ramsey and Schafer
- Introductory Statistics with R by Peter Dalgaard
Neither of these books covers all of the topics we will cover, nor do they have the same perspective and focus as this class – they do not have extensive use of bootstrapping and resampling methods. But for those students wanting some additional structure or R assistance these books may be helpful and should be at the right level for this class
Syllabus
Week | Description | Chapter | Lab Link | Assignment Due |
---|---|---|---|---|
01 | Boxplots, discrete distributions, intro to continuous distributions | 01 | Lab 1 | |
02 | Continuous distributions, density curves, density estimation | 01 | Lab 2 | |
03 | Permutation test, t-test and assumptions | 02 | Lab 3 | Hw1 Due (F) |
04 | More on assumptions, type I error, multiple testing, Bonferroni corrections | 02 | Lab 4 | |
05 | Confidence intervals, Bonferroni corrections, review simple regression | 02, 03 | Lab 5 | Hw2 Due (W) |
06 | Polynomial regression, loess curves | 03 | Lab 6 | |
07 | Finish loess curves, smooth density plots, pairs plots, alluvial plots, mosaic plots | 03, 04 | Lab 7 | Hw3 Due (M) |
08 | Heatmaps, hierarchical clustering, PCA | 04 | Midterm Review | Hw4 Due (W) |
09 | Midterm (M) Finish PCA, start multiple regression |
04, 05 | Lab 8 | Project 1 Due (F) |
10 | Multiple linear regression, fitting and interpretation, fitted values, residuals, Multiple R-squared, Residual degrees of freedom and residual standard error | 05 | Lab 9 | |
Spring Break | ||||
11 | Multiple regression with categorical explanatory variables and interactions, Inference in multiple regression: F-tests via the anova function | 05 | Lab 10 | |
12 | Inference in multiple regression: t-tests, standard errors, confidence intervals and prediction intervals. Variable selection in linear regression. Regression diagnostics | 05 | Lab 11 | Hw5 Due (M) |
13 | The Classification problem and logistic regression, interpretation in terms of odds, binary predictions via confusion matrices, precision and recall, deviance, variable selection via AIC | 06 | Lab 12 | Hw6 Due (W) |
14 | Regression trees, classification trees and Random Forests | 07 | Lab 13 | Project 2 Due (F) |
15 | Reading and recitation week: no class | Project 3 Due (F) |