Course
Course Catalog URL:
Identifier:
COM S 5510
Offered during Fall Semester of odd years.
- Credits: 3 credit hours
- Instructor's or course coordinator's name: Xiaoqiu Huang
- Textbook, title, author, and year: None
- Other supplemental materials: None
Course Information
- Brief description of the content of the course: Introduction to a big data research area in bioinformatics. Focus on applying computational techniques to huge genomic sequence data. These techniques include finding optimal sequence alignments, generating genome assemblies, finding genes in genomic sequences, mapping short sequences onto a genome assembly, finding single-nucleotide and structural variations, building phylogenetic trees from genome sequences, and performing genome-wide association studies.
- Prerequisites or co-requisites: COM S 311
Course Outcomes
- Be able to understand and use bioinformatics tools for analysis of next-generation data.
- Be able to conduct a research project in genomics through analysis of next-generation data.
Topics
- Computing local alignments with SIM
- Computing a global alignment with GAP3
- Genome assembly with PCAP.Solexa
- Genome assembly using Velvet
- Viewing an assembly in Consed
- Transcriptome assembly with Trinity
- Gene finding with Augustus
- Gene finding with AAT
- Read mapping with Bowtie2
- Sequence mapping with BWA
- Manipulating alignment/map formats with Samtools and Picard
- Calling SNVs and CNVs with SpeedSeq
- Genome alignment using TBA
- Phylogenetic analysis using SNPs
- Viewing read alignments to a reference sequence using IGV
- Performing a genome-wide association study with PLINK and Haploview