De novo glycan structural identification from mass spectra using tree merging strategy

التفاصيل البيبلوغرافية
العنوان: De novo glycan structural identification from mass spectra using tree merging strategy
المؤلفون: Fusong Ju, Dongbo Bu, Jinyu Zhou, Yan Li, Chuncui Huang, Jingwei Zhang, Hui Wang, Yaojun Wang, Shiwei Sun
المصدر: Computational Biology and Chemistry. 80:217-224
بيانات النشر: Elsevier BV, 2019.
سنة النشر: 2019
مصطلحات موضوعية: 0301 basic medicine, Glycan, Computer science, Biochemistry, 03 medical and health sciences, 0302 clinical medicine, Polysaccharides, Structural Biology, Glycoproteins, Binary tree, biology, Organic Chemistry, Adalimumab, Monosaccharide composition, Dynamic programming, Computational Mathematics, 030104 developmental biology, Tree structure, Carbohydrate Sequence, Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization, 030220 oncology & carcinogenesis, Test set, Mass spectrum, biology.protein, Algorithm, Algorithms
الوصف: Motivation Glycans are large molecules with specific tree structures. Glycans play important roles in a great variety of biological processes. These roles are primarily determined by the fine details of their structures, making glycan structural identification highly desirable. Mass spectrometry (MS) has become the major technology for elucidation of glycan structures. Most de novo approaches to glycan structural identification from mass spectra fall into three categories: enumerating followed by filtering approaches, heuristic and dynamic programming-based approaches. The former suffers from its low efficiency while the latter two suffer from the possibility of missing the actual glycan structures. Thus, how to reliably and efficiently identify glycan structures from mass spectra still remains challenging. Results In this study we propose an efficient and reliable approach to glycan structure identification using tree merging strategy. Briefly, for each MS peak, our approach first calculated monosaccharide composition of its corresponding fragment ion, and then built a constraint that forces these monosaccharides to be directly connected in the underlying glycan tree structure. According to these connecting constraints, we next merged constituting monosaccharides of the glycan into a complete structure step by step. During this process, the intermediate structures were represented as subtrees, which were merged iteratively until a complete tree structure was generated. Finally the generated complete structures were ranked according to their compatibility to the input mass spectra. Unlike the traditional enumerating followed by filtering strategy, our approach performed deisomorphism to remove isomorphic subtrees, and ruled out invalid structures that violates the connection constraints at each tree merging step, thus significantly increasing efficiency. In addition, all complete structures satisfying the connection constraints were enumerated without any missing structure. Over a test set of 10 N-glycan standards, our approach accomplished structural identification in minutes and gave the manually-validated structure first three highest score. We further successfully applied our approach to profiling and subsequent structure assignment of glycans released from glycoprotein mAb, which was in perfect agreement with previous studies and CE analysis.
تدمد: 1476-9271
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::50560c4392a9201579ffd586fc497b5a
https://doi.org/10.1016/j.compbiolchem.2019.03.015
حقوق: CLOSED
رقم الأكسشن: edsair.doi.dedup.....50560c4392a9201579ffd586fc497b5a
قاعدة البيانات: OpenAIRE