140.629.41
Data Science for Public Health II
Course Status
Discontinued
Course Status
Discontinued
Location
Internet
Term
4th Term
Department
Biostatistics
Credit(s)
4
Academic Year
2024 - 2025
Instruction Method
Synchronous Online
Tu, Th, 8:30 - 9:50am
Auditors Allowed
Yes, with instructor consent
Available to Undergraduate
Yes
Grading Restriction
Letter Grade or Pass/Fail
Course Instructor(s)
Contact Name
Frequency Schedule
Every Year
Resources
Prerequisite
140.628, prior programming experience, precalculus
mathematics
Presents the basics of data science using the python programming language. Teaches basic unix, version control, graphing and plotting techniques, creating interactive graphics, web app development, reproducible research tools and practices, resampling based statistics and artificial intelligence via deep learning, focusing on practical
implementation specifically tied to computational tools and core
fundamentals necessary for practical implementation. Culminates with a web app development project chosen by student (who will come out of this course sequence well-equipped to tackle many of the data science problems that they will see in their
research).
Learning Objectives
Upon successfully completing this course, students will be able to:
- Demonstrate proficiency in data-oriented python programming
- Practice basic data cleaning in pythnon
- Implement and demonstrate proficiency in tidyverse commands
- Implement plotting and interactive graphics tools on novel data sets
- Implement artificial intelligence programs on novel data sets
- Create a web application
- Implement resampling-based statistics
- Synthesize concepts of machine learning overfitting
Methods of Assessment
This course is evaluated as follows:
- 66% Homeworks/coding projects
- 33% Final Capstone Project
Please note: This is the virtual/online section of a course that is also offered onsite. Students will need to commit to the modality for which they register.