دورية أكاديمية

Normalization, testing, and false discovery rate estimation for RNA-sequencing data.

التفاصيل البيبلوغرافية
العنوان: Normalization, testing, and false discovery rate estimation for RNA-sequencing data.
المؤلفون: Li J, Witten DM, Johnstone IM, Tibshirani R, Li, Jun, Witten, Daniela M, Johnstone, Iain M, Tibshirani, Robert
المصدر: Biostatistics; Jul2012, Vol. 13 Issue 3, p523-538, 16p
مصطلحات موضوعية: POLYMERASE chain reaction, RNA, STATISTICS, DATA analysis, REVERSE transcriptase polymerase chain reaction, STATISTICAL models, SEQUENCE analysis
مستخلص: We discuss the identification of genes that are associated with an outcome in RNA sequencing and other sequence-based comparative genomic experiments. RNA-sequencing data take the form of counts, so models based on the Gaussian distribution are unsuitable. Moreover, normalization is challenging because different sequencing experiments may generate quite different total numbers of reads. To overcome these difficulties, we use a log-linear model with a new approach to normalization. We derive a novel procedure to estimate the false discovery rate (FDR). Our method can be applied to data with quantitative, two-class, or multiple-class outcomes, and the computation is fast even for large data sets. We study the accuracy of our approaches for significance calculation and FDR estimation, and we demonstrate that our method has potential advantages over existing methods that are based on a Poisson or negative binomial model. In summary, this work provides a pipeline for the significance analysis of sequencing data. [ABSTRACT FROM AUTHOR]
Copyright of Biostatistics is the property of Oxford University Press / USA and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:14654644
DOI:10.1093/biostatistics/kxr031