Prof: Michael Schatz (mschatz @ cs.jhu.edu)
TA: Samantha Zarate (slzarate @ jhu.edu)
Class Hours: Monday + Wednesday @ 3:00 - 3:50p in Hodson 211
Schatz Office Hours: By appointment
Zarate Office Hours: Wednesdays 4:00 - 5:00p via Zoom (link on Piazza) and by appointment
The goal of this course is to prepare undergraduates to understand and perform state-of-the-art biomedical research. This will be accomplished through three main components: (1) classroom-style lectures on cross cutting techniques for biomedical research focusing on data visualization, statistical inference, and scientific computing; (2) research presentations from distinguished faculty on their active research projects; and (3) a major research project to be performed under the mentorship of a JHU professor. Students will present their research during an in-class symposium at the end of the semester. Grading will be based on homework exercises, a midterm exam, a written research proposal, an interim research report, an oral research presentation, and a final research report.
Within the course, we will study the leading computational and quantitative approaches for comparing and analyzing genomes starting from raw sequencing data. The course will focus on human genomics and human medical applications, but the techniques will be broadly applicable across the tree of life. The topics will include genome assembly & comparative genomics, variant identification & analysis, gene expression & regulation, personal genome analysis, and cancer genomics. There are no formal course prerequisites, although the course will require familiarity with UNIX scripting and/or programming to complete the assignments and course project.
- Online introduction to Unix/Linux. Students are strongly recommended to complete one of the following online tutorials (or both) before class begins.
- Access to a Linux machine, and/or install Docker (unfortunately, even Mac will not work correctly for some programs)
- Syllabus and Policies
- Piazza Discussion Board
- GradeScope Entry Code: D5GDXP
- Spring 2021 Graduate Course Materials
- Applied Comparative Genomics by Aaron Quinlan
- Algorithms for DNA Sequencing by Ben Langmead
- Computational Biology by Rob Patro
- HarvardX Biomedical Data Science
- PLOS Computational Biology Translational Bioinformatics
- Biostars Handbook
- Molecular Biology of the Gene (Watson et al)
- Molecular Biology of the Cell (Alberts)
- Biological Sequence Analysis (Durbin et al)
- Modern Statistics for Modern Biology (Holmes & Huber)