دورية أكاديمية
CodonBert large language model for mRNA vaccines.
العنوان: | CodonBert large language model for mRNA vaccines. |
---|---|
المؤلفون: | Li S; Sanofi., Moayedpour S; Sanofi., Li R; Sanofi., Bailey M; Sanofi., Riahi S; Sanofi., Kogler-Anele L; Sanofi., Miladi M; Sanofi., Miner J; Sanofi., Pertuy F; Sanofi., Zheng D; Sanofi., Wang J; Sanofi., Balsubramani A; Sanofi., Tran K; Sanofi., Zacharia M; Sanofi., Wu M; Sanofi., Gu X; Sanofi., Clinton R; Sanofi., Asquith C; Sanofi., Skaleski J; Sanofi., Boeglin L; Sanofi., Chivukula S; Sanofi., Dias A; Sanofi., Strugnell T; Sanofi., Montoya FU; Sanofi., Agarwal V; Sanofi., Bar-Joseph Z; Sanofi zivbj@cs.cmu.edu., Jager S; Sanofi. |
المصدر: | Genome research [Genome Res] 2024 Jul 01. Date of Electronic Publication: 2024 Jul 01. |
Publication Model: | Ahead of Print |
نوع المنشور: | Journal Article |
اللغة: | English |
بيانات الدورية: | Publisher: Cold Spring Harbor Laboratory Press Country of Publication: United States NLM ID: 9518021 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1549-5469 (Electronic) Linking ISSN: 10889051 NLM ISO Abbreviation: Genome Res Subsets: MEDLINE |
أسماء مطبوعة: | Original Publication: Cold Spring Harbor, N.Y. : Cold Spring Harbor Laboratory Press, c1995- |
مستخلص: | mRNA-based vaccines and therapeutics are gaining popularity and usage across a wide range of conditions. One of the critical issues when designing such mRNAs is sequence optimization. Even small proteins or peptides can be encoded by an enormously large number of mRNAs. The actual mRNA sequence can have a large impact on several properties including expression, stability, immunogenicity, and more. To enable the selection of an optimal sequence, we developed CodonBERT, a large language model (LLM) for mRNAs. Unlike prior models, CodonBERT uses codons as inputs which enables it to learn better representations. CodonBERT was trained using more than 10 million mRNA sequences from a diverse set of organisms. The resulting model captures important biological concepts. CodonBERT can also be extended to perform prediction tasks for various mRNA properties. CodonBERT outperforms previous mRNA prediction methods including on a new flu vaccine dataset. (Published by Cold Spring Harbor Laboratory Press.) |
تواريخ الأحداث: | Date Created: 20240701 Latest Revision: 20240701 |
رمز التحديث: | 20240702 |
DOI: | 10.1101/gr.278870.123 |
PMID: | 38951026 |
قاعدة البيانات: | MEDLINE |
تدمد: | 1549-5469 |
---|---|
DOI: | 10.1101/gr.278870.123 |