
昆虫RNASeq数据的分析流程
Insect RNASeq data analysis pipeline
刘金定1,2张赞1黄水清2李飞1
点击:2137次 下载:74次
DOI:
作者单位:1.南京农业大学植物保护学院南京210095;2.南京农业大学信息科学技术学院南京210095
中文关键词:高通量RNA测序,段序列比对,转录组拼接,基因功能注释,基因表达定量,基因差异表达
英文关键词:highthroughput RNA sequencing, short sequence alignment, transcriptome reconstruction, gene function annotation, gene expression quantification, gene differential expression
中文摘要:
随着高通量RNA测序(RNASeq)技术的发展和测序成本迅速下降,RNASeq技术已经成为生物学研究的重要工具,为生物学家全面地了解和研究转录组提供了机遇。高通量测序具有读长短、存在一定比例的测序错误、数据量大等特点,因此RNASeq数据分析与基因组分析和传统的EST数据分析有所不同。本文通过介绍不同的测序平台、原始数据产生和低质量数据过滤的计算流程,对短序列比对、转录组拼接、功能注释、以及差异表达分析进行了研究和分析,最后对RNASeq在昆虫学研究中的应用进行了综述,并对RNASeq技术进行了总结和展望。
英文摘要:With the rapid development of highthroughput RNA sequencing (RNASeq) technology and the rapidly decreasing cost of this method, RNASeq is becoming an important tool for biological research, especially investigating gene function at the transcriptome level. RNASeq typically reads sequences rapidly with a certain percentage of sequence errors and bias producing a huge amount of data. RNASeq data analysis faces lots of challenges. Here, we describe different RNA sequencing platforms, raw data generation processes and data filtering and introduce short sequence alignment, transcriptome assembly, functional annotation and gene expression analysis. Finally, we briefly review the application of RNASeq in insects. The prospects of RNASeq techniques and their application are also discussed.