دورية أكاديمية

CodonBert large language model for mRNA vaccines.

التفاصيل البيبلوغرافية
العنوان: CodonBert large language model for mRNA vaccines.
المؤلفون: Li S; Sanofi., Moayedpour S; Sanofi., Li R; Sanofi., Bailey M; Sanofi., Riahi S; Sanofi., Kogler-Anele L; Sanofi., Miladi M; Sanofi., Miner J; Sanofi., Pertuy F; Sanofi., Zheng D; Sanofi., Wang J; Sanofi., Balsubramani A; Sanofi., Tran K; Sanofi., Zacharia M; Sanofi., Wu M; Sanofi., Gu X; Sanofi., Clinton R; Sanofi., Asquith C; Sanofi., Skaleski J; Sanofi., Boeglin L; Sanofi., Chivukula S; Sanofi., Dias A; Sanofi., Strugnell T; Sanofi., Montoya FU; Sanofi., Agarwal V; Sanofi., Bar-Joseph Z; Sanofi zivbj@cs.cmu.edu., Jager S; Sanofi.
المصدر: Genome research [Genome Res] 2024 Jul 01. Date of Electronic Publication: 2024 Jul 01.
Publication Model: Ahead of Print
نوع المنشور: Journal Article
اللغة: English
بيانات الدورية: Publisher: Cold Spring Harbor Laboratory Press Country of Publication: United States NLM ID: 9518021 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1549-5469 (Electronic) Linking ISSN: 10889051 NLM ISO Abbreviation: Genome Res Subsets: MEDLINE
أسماء مطبوعة: Original Publication: Cold Spring Harbor, N.Y. : Cold Spring Harbor Laboratory Press, c1995-
مستخلص: mRNA-based vaccines and therapeutics are gaining popularity and usage across a wide range of conditions. One of the critical issues when designing such mRNAs is sequence optimization. Even small proteins or peptides can be encoded by an enormously large number of mRNAs. The actual mRNA sequence can have a large impact on several properties including expression, stability, immunogenicity, and more. To enable the selection of an optimal sequence, we developed CodonBERT, a large language model (LLM) for mRNAs. Unlike prior models, CodonBERT uses codons as inputs which enables it to learn better representations. CodonBERT was trained using more than 10 million mRNA sequences from a diverse set of organisms. The resulting model captures important biological concepts. CodonBERT can also be extended to perform prediction tasks for various mRNA properties. CodonBERT outperforms previous mRNA prediction methods including on a new flu vaccine dataset.
(Published by Cold Spring Harbor Laboratory Press.)
تواريخ الأحداث: Date Created: 20240701 Latest Revision: 20240701
رمز التحديث: 20240702
DOI: 10.1101/gr.278870.123
PMID: 38951026
قاعدة البيانات: MEDLINE
الوصف
تدمد:1549-5469
DOI:10.1101/gr.278870.123