Skip to main content

Advanced Data Analysis Workshop

Summer Institute
Academic Year
2022 - 2023
Instruction Method
Synchronous Online
Start Date
Monday, June 27, 2022
End Date
Friday, July 1, 2022
Class Time(s)
M, Tu, W, Th, F, 1:30 - 5:00pm
Auditors Allowed
Available to Undergraduate
Grading Restriction
Letter Grade or Pass/Fail
Course Instructor(s)
Contact Name
Frequency Schedule
Every Year

Data Analysis Workshop I and II (140.613 and 140.614)

Covers methods for the organization, management, exploration, and statistical inference from data derived from multivariable regression models, including linear, logistic, Poisson and Cox regression models. Students apply these concepts to two or three public health data sets in a computer laboratory setting using STATA statistical software. Topics covered include generalized linear models, product-limit (Kaplan-Meier) estimation, Cox proportional hazards model.
Learning Objectives
Upon successfully completing this course, students will be able to:
  1. Conduct a simple linear, logistic or survival regression and correctly interpret the regression coefficients and their confidence interval
  2. Conduct a multiple linear, logistic or survival regression and correctly interpret the coefficients and their confidence intervals
  3. Examine residuals and adjusted variable plots for inconsistencies between the regression model and patterns in the data and for outliers and high leverage observations
  4. Fit and compare different models to explore the association between outcome and predictor variables in an observational study
Methods of Assessment
This course is evaluated as follows:
  • 40% Quizzes
  • 60% Final Exam
Special Comments

This is a hybrid course with both a synchronous online section (140.620.49) and an in-person section (140.620.11). Please choose the modality you need (either online or in-person) when registering in SIS.