دورية أكاديمية

Optimized model architectures for deep learning on genomic data.

التفاصيل البيبلوغرافية
العنوان: Optimized model architectures for deep learning on genomic data.
المؤلفون: Gündüz, Hüseyin Anil, Mreches, René, Moosbauer, Julia, Robertson, Gary, To, Xiao-Yin, Franzosa, Eric A., Huttenhower, Curtis, Rezaei, Mina, McHardy, Alice C., Bischl, Bernd, Münch, Philipp C., Binder, Martin
المصدر: Communications Biology; 5/23/2024, Vol. 7 Issue 1, p1-10, 10p
مصطلحات موضوعية: DEEP learning, ARCHITECTURAL design, COMPUTER vision, COMPUTATIONAL biology, VISUAL fields
مستخلص: The success of deep learning in various applications depends on task-specific architecture design choices, including the types, hyperparameters, and number of layers. In computational biology, there is no consensus on the optimal architecture design, and decisions are often made using insights from more well-established fields such as computer vision. These may not consider the domain-specific characteristics of genome sequences, potentially limiting performance. Here, we present GenomeNet-Architect, a neural architecture design framework that automatically optimizes deep learning models for genome sequence data. It optimizes the overall layout of the architecture, with a search space specifically designed for genomics. Additionally, it optimizes hyperparameters of individual layers and the model training procedure. On a viral classification task, GenomeNet-Architect reduced the read-level misclassification rate by 19%, with 67% faster inference and 83% fewer parameters, and achieved similar contig-level accuracy with ~100 times fewer parameters compared to the best-performing deep learning baselines. Introducing GenomeNet-Architect, a neural architecture design framework that automatically optimises the overall layout of the architecture, the hyperparameters, and the training procedure of deep learning models for genome sequence data. [ABSTRACT FROM AUTHOR]
Copyright of Communications Biology is the property of Springer Nature and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:23993642
DOI:10.1038/s42003-024-06161-1