Each week, classes will be a mix of lecture, class discussion, and
short activities.
- You should bring your laptop with you to every class
period. We will be using R and RStudio extensively in this
course. In particular, students are encouraged to write homework
assignments in R Markdown and compile to pdf for the submission.
- Assigned readings for each week are posted. Whether you learn better
by reading prior to lecture, or hearing a lecture prior to reading is up
to you.
- Weeks will typically alternate between homework assignments and
in-class quizzes, with homework due dates and quizzes falling on
Thursdays.
- Homework assignments should be submitted in D2L by 11pm on the due
date.
- Each quiz will cover the content from the previous week’s homework,
e.g., Quiz 1 will cover content from Homework 1.
Week 1 (Jan 19–21)
Course Overview
Week 2 (Jan 24–28)
Content
- Probability distributions for categorical data: binomial,
multinomial, negative binomial
- Asymptotic inference for one and two proportions
- Exact (binomial) inference for one proportion
- Maximum likelihood estimation
- Types of sampling and studies
- Probability structure for \(I\times
J\) contingency tables
Assigned readings
- Chapter 1 Sections 1.1–1.3, 1.6
- Chapter 2 Section 2.1
Week 3 (Jan 31–Feb 4)
Content
- Asymptotic inference for \(2\times
2\) tables: difference in proportions, relative risk, odds
ratio
Assigned readings
- Chapter 2 Sections 2.2–2.3
Homework/Quiz
- Quiz 1 in class Thur Feb 3
Week 4 (Feb 7–11)
Content
- Randomization tests for \(2\times
2\) tables
- Fisher’s Exact Test for \(2\times
2\) tables
- Chi-squared tests of independence for \(I\times J\) contingency tables
- Association in three-way tables
Assigned readings
- Chapter 2 Sections 2.4, 2.6–2.7
Homework/Quiz
- Homework 2 (Rmd) due Thur Feb 10 by 11pm —
types of sampling and studies, asymptotic inference for \(2\times 2\) tables, randomization tests,
Fisher’s Exact Test
Week 5 (Feb 14–18)
Content
- Components of a generalized linear model (GLM)
- GLMs for binary data
Assigned readings
- Chapter 3 Sections 3.1–3.2
- Chapter 4 Sections 4.1–4.6
- Chapter 5 Section 5.3
Homework/Quiz
- Quiz 2 in class Thur Feb 17
Week 6 (Feb 21–25)
Content
- GLMs for binary data (continued)
Homework/Quiz
- Homework 3 (Rmd) due Thur Feb 24 by 11pm —
chi-squared tests of independence, three-way tables, components of a
GLM
Week 7 (Feb 28–Mar 4)
Content
- Logistic regression with categorical predictors
Homework/Quiz
- Quiz 3 in class Thur Mar 3
Week 8 (Mar 7–11)
Content
- GLMs for count data
- Model fitting, selection and diagnostics for GLMs
Assigned readings
- Chapter 3 Sections 3.3–3.5
- Chapter 5 Sections 5.1–5.3
Thursday
- Homicides and gun registraction Poisson regression example (Rmd) (html) (updated 3/22 and
3/24)
- In-class notes
Homework/Quiz
- Homework 4 (Rmd) due Thur Mar 10 by 11pm in
Gradescope — GLMs for binary data (logistic regression)
Week 9 (Mar 21–25)
Content
- GLMs for count data (continued)
- Model fitting, selection and diagnostics for GLMs
Homework/Quiz
- Quiz 4 in class Thur Mar 24
Week 10 (Mar 28–Apr 1)
Content
- Model fitting, selection and diagnostics for GLMs (cont)
Assigned readings
- Chapter 6 Sections 6.1–6.2
Homework/Quiz
- Homework 5 (Rmd) due Thur Mar 31 by 11pm —
GLMS for Poisson regression, model fitting, selection and diagnostics
for GLMs
Week 11 (Apr 4–8)
Content
- Multicategory logit models (cont)
Assigned readings
- Chapter 8 Sections 8.1, 8.3, 8.5
Homework/Quiz
- Quiz 5 in class Thur Apr 7
Project Deadline
- Data analysis proposal due by 11pm Friday Apr 8 in Gradescope.
Week 12 (Apr 11–15)
Content
- Modeling correlated data
- Models for matched pairs
- Marginal models (GEEs)
- Generalized linear mixed models (GLMMs)
Assigned readings
- Chapter 8 Sections 8.1–8.2 (skip 8.3–8.6)
- Chapter 9 Sections 9.1–9.2 (skip 9.3–9.5)
- Chapter 10 Sections 10.1–10.2 (skip 10.3–10.5)
Homework/Quiz
- Homework 6 (Rmd) due Mon Apr 18 by 11pm —
baseline and cumulative logit models for multinomial data
Week 13 (Apr 18–22)
Content
- Modeling correlated data (continued)
Thursday
- Correlated data slides and epilepsy example (continued)
- In-class notes
Homework/Quiz
- Quiz 6 in class Thur Apr 21
Project Deadline
- Draft report due by 11pm Friday Apr 22 in Gradescope.
Week 14 (Apr 25–29)
Content
- Modeling correlated data (continued)
- GLM leftovers:
- Residual diagnostics
- Dealing with missing data
Assigned readings
- Continued from last week
- Some content not in textbook
Thursday
Correlated binary data example:
Homework/Quiz
- Homework 7 (Rmd) due Thur Apr 28 by 11pm —
population-averaged models (i.e., marginal models, GEEs),
subject-specific models (i.e., generalized linear mixed effects
models)
Project Deadline
- Peer assessments due by 11pm Friday Apr 29 in Gradescope.
Week 15 (May 2–6)
Content
- Classification and clustering
Assigned readings
- Review Chapter 4 Section 4.6
- Chapter 11 Sections 11.1–11.3
Homework/Quiz
- Quiz 7 in class Thur May 5 (make-up quiz only)
Project Deadline
- Final report due by 11pm Friday May 6 in Gradescope.
Final Exam Week
Take-home final exam
Released Friday, May 6. Due by 11pm Wednesday, May 11 in
Gradescope.
Project presentations
Thursday, May 12 8:00–9:50am