دورية أكاديمية

Comparative studies of differential gene calling using RNA-Seq data.

التفاصيل البيبلوغرافية
العنوان: Comparative studies of differential gene calling using RNA-Seq data.
المؤلفون: Ximeng Zheng, Moriyama, Etsuko N.
المصدر: BMC Bioinformatics; 2013, Vol. 14 Issue Suppl 13, p1-10, 10p, 5 Charts, 4 Graphs
مصطلحات موضوعية: NUCLEOTIDE sequence, GENE expression profiling, COMPARATIVE studies, MICROARRAY technology, NONPARAMETRIC statistics
مستخلص: Background: With its massive amount of data, gene-expression profiling by RNA-Seq has many advantanges compared with microarray experiments. RNA-Seq analysis, however, is fundamentally different from microarray data analysis. Techniques developed for analyzing microarray data thus cannot be directly applicable for the digital gene expression data. Several statistical methods have been developed for identifying differentially expressed genes specifically from RNA-Seq data over the past few years. Results: In this study, we examined the performance of differential gene-calling methods using RNA-Seq data in practical situations. We focused on two representative methods: one parametric method, DESeq, and one nonparametric method, NOISeq. We examined their performance using both simulated and real datasets. Our simulation followed the RNA-Seq process and produced more realistic short read data. Both DESeq and NOISeq identified over-expressed genes more correctly than under-expressed genes. While DESeq was more likely to call longer genes as differentially expressed than shorter ones, NOISeq did not have such bias. When the underlying variation increased, both methods showed higher rates of false positives. When replicates were not available in the experiments, both methods showed lower rates of true positives and higher rates of false positives. Conclusions: The level of variation clearly affected the performance of both methods, showing the importance of understanding the variation in the data as well as having replications in RNA-Seq experiments. We showed that it is possible to obtain improved differential gene-calling results by combining the results obtained by the two methods. We suggested strategies to use these two methods individually or combined according to the characteristics of the data. [ABSTRACT FROM AUTHOR]
Copyright of BMC Bioinformatics is the property of BioMed Central and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:14712105
DOI:10.1186/1471-2105-14-S13-S7