RNA测序数据分析
合集下载
相关主题
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
1. Known gene differential expression
Genes, DESeq2 , edgeR Transcripts Exons, DEXSeq
2. Novel transcripts
RNA-seq work flow
1. Experimental design – keep it simple; replicate. 2. Wet-lab preparation – covariates & opportunities for ‘batch’ effects 3. Sequencing – paired-end valuable for transcript-level inference 4. Alignment – typically to whole genome; requires aligner capable of gapped alignments 5. Summary – reads overlapping each gene or region of interest 6. Analysis – linear model (e.g., t-test) fit to each region of interest; ‘top table’ of differentially expressed genes 7. Comprehension – annotation of differentially expressed regions, gene set enrichment, comparison to other studies, integration with other data types
RNA-seq summary: counts per region of interest
Input: BAM files of aligned reads, typically one per sample How to count?
What is an ‘overlap’ ? What (Bioconductor ) software to use?
Output: region × sample matrix of read counts Not RPKM or other ‘normalized’ measure
RNA-seq summary: how to count?
Counting modes
Counting in Bioconductor GenomicAlignments
Counting reads for RNA-seq
Martin T. Morgan mtmorgan@fhcrc.org Fred Hutchinson Cancer Research Center Seattle, WA, USA
25 August 2014
Varieties of RNA-seq
summarizeOverlaps() –
standard adn customized counting modes RsubreadLeabharlann BaidufeatureCounts() – fast; Linux and Mac only