دورية أكاديمية

Zipper plot: visualizing transcriptional activity of genomic regions

التفاصيل البيبلوغرافية
العنوان: Zipper plot: visualizing transcriptional activity of genomic regions
المؤلفون: Francisco Avila Cobos, Jasper Anckaert, Pieter-Jan Volders, Celine Everaert, Dries Rombaut, Jo Vandesompele, Katleen De Preter, Pieter Mestdagh
المصدر: BMC Bioinformatics, Vol 18, Iss 1, Pp 1-9 (2017)
بيانات النشر: BMC, 2017.
سنة النشر: 2017
المجموعة: LCC:Computer applications to medicine. Medical informatics
LCC:Biology (General)
مصطلحات موضوعية: Transcription Start Site, Genomic Feature, Transcript Model, lncRNA Transcript, Close Peak, Computer applications to medicine. Medical informatics, R858-859.7, Biology (General), QH301-705.5
الوصف: Abstract Background Reconstructing transcript models from RNA-sequencing (RNA-seq) data and establishing these as independent transcriptional units can be a challenging task. Current state-of-the-art tools for long non-coding RNA (lncRNA) annotation are mainly based on evolutionary constraints, which may result in false negatives due to the overall limited conservation of lncRNAs. Results To tackle this problem we have developed the Zipper plot, a novel visualization and analysis method that enables users to simultaneously interrogate thousands of human putative transcription start sites (TSSs) in relation to various features that are indicative for transcriptional activity. These include publicly available CAGE-sequencing, ChIP-sequencing and DNase-sequencing datasets. Our method only requires three tab-separated fields (chromosome, genomic coordinate of the TSS and strand) as input and generates a report that includes a detailed summary table, a Zipper plot and several statistics derived from this plot. Conclusion Using the Zipper plot, we found evidence of transcription for a set of well-characterized lncRNAs and observed that fewer mono-exonic lncRNAs have CAGE peaks overlapping with their TSSs compared to multi-exonic lncRNAs. Using publicly available RNA-seq data, we found more than one hundred cases where junction reads connected protein-coding gene exons with a downstream mono-exonic lncRNA, revealing the need for a careful evaluation of lncRNA 5′-boundaries. Our method is implemented using the statistical programming language R and is freely available as a webtool.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 1471-2105
Relation: http://link.springer.com/article/10.1186/s12859-017-1651-7; https://doaj.org/toc/1471-2105
DOI: 10.1186/s12859-017-1651-7
URL الوصول: https://doaj.org/article/63965fae480e470b887bdcccb0daefa9
رقم الأكسشن: edsdoj.63965fae480e470b887bdcccb0daefa9
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:14712105
DOI:10.1186/s12859-017-1651-7