Protein Lipograms

التفاصيل البيبلوغرافية
العنوان: Protein Lipograms
المؤلفون: Laurie, Jason, Chattopadhyay, Amit K, Flower, Darren R
المصدر: Journal of Theoretical Biology, vol 430, pg 109, 2017
سنة النشر: 2017
المجموعة: Physics (Other)
Quantitative Biology
مصطلحات موضوعية: Quantitative Biology - Quantitative Methods, Physics - Biological Physics
الوصف: Linguistic analysis of protein sequences is an underexploited technique. Here, we capitalize on the concept of the lipogram to characterize sequences at the proteome levels. A lipogram is a literary composition which omits one or more letters. A protein lipogram likewise omits one or more types of amino acid. In this article, we establish a usable terminology for the decomposition of a sequence collection in terms of the lipogram. Next, we characterize Uniref50 using a lipogram decomposition. At the global level, protein lipograms exhibit power-law properties. A clear correlation with metabolic cost is seen. Finally, we use the lipogram construction to differentiate proteomes between the four branches of the tree-of-life: archaea, bacteria, eukaryotes and viruses. We conclude from this pilot study that the lipogram demonstrates considerable potential as an additional tool for sequence analysis and proteome classification.
Comment: 8 pages, 2 columns, 5 figures
نوع الوثيقة: Working Paper
DOI: 10.1016/j.jtbi.2017.07.009
URL الوصول: http://arxiv.org/abs/1707.08984
رقم الأكسشن: edsarx.1707.08984
قاعدة البيانات: arXiv
الوصف
DOI:10.1016/j.jtbi.2017.07.009