دورية أكاديمية

The optimal metric for viral genome space

التفاصيل البيبلوغرافية
العنوان: The optimal metric for viral genome space
المؤلفون: Hongyu Yu, Stephen S.-T. Yau
المصدر: Computational and Structural Biotechnology Journal, Vol 23, Iss , Pp 2083-2096 (2024)
بيانات النشر: Elsevier, 2024.
سنة النشر: 2024
المجموعة: LCC:Biotechnology
مصطلحات موضوعية: Alignment-free methods, Feature integration, Natural vector, Optimal metric, Viral genomes, Classification, Biotechnology, TP248.13-248.65
الوصف: Understanding the structural similarity between genomes is pivotal in classification and phylogenetic analysis. As the number of known genomes rockets, alignment-free methods have gained considerable attention. Among these methods, the natural vector method stands out as it represents sequences as vectors using statistical moments, enabling effective clustering based on families in biological taxonomy. However, determining an optimal metric that combines different elements in natural vectors remains challenging due to the absence of a rigorous theoretical framework for weighting different k-mers and orders. In this study, we address this challenge by transforming the determination of optimal weights into an optimization problem and resolving it through gradient-based techniques. Our experimental results underscore the substantial improvement in classification accuracy achieved by employing these optimal weights, reaching an impressive 92.73% on the testing set, surpassing other alignment-free methods. On one hand, our method offers an outstanding metric for virus classification, and on the other hand, it provides valuable insights into feature integration within alignment-free methods.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2001-0370
Relation: http://www.sciencedirect.com/science/article/pii/S200103702400151X; https://doaj.org/toc/2001-0370
DOI: 10.1016/j.csbj.2024.05.005
URL الوصول: https://doaj.org/article/07cdc645e42749c680175460cf766e69
رقم الأكسشن: edsdoj.07cdc645e42749c680175460cf766e69
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:20010370
DOI:10.1016/j.csbj.2024.05.005