دورية أكاديمية

Species abundance information improves sequence taxonomy classification accuracy.

التفاصيل البيبلوغرافية
العنوان: Species abundance information improves sequence taxonomy classification accuracy.
المؤلفون: Kaehler BD; Research School of Biology, Australian National University, Canberra, Australia. b.kaehler@adfa.edu.au.; School of Science, University of New South Wales, Canberra, Australia. b.kaehler@adfa.edu.au., Bokulich NA; Center for Applied Microbiome Science, The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA. nicholas.bokulich@nau.edu.; Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA. nicholas.bokulich@nau.edu., McDonald D; Department of Pediatrics, University of California San Diego, La Jolla, CA, USA., Knight R; Department of Pediatrics, University of California San Diego, La Jolla, CA, USA.; Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA.; Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA., Caporaso JG; Center for Applied Microbiome Science, The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA. gregcaporaso@gmail.com.; Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA. gregcaporaso@gmail.com., Huttley GA; Research School of Biology, Australian National University, Canberra, Australia. gavin.huttley@anu.edu.au.
المصدر: Nature communications [Nat Commun] 2019 Oct 11; Vol. 10 (1), pp. 4643. Date of Electronic Publication: 2019 Oct 11.
نوع المنشور: Journal Article; Research Support, Non-U.S. Gov't; Research Support, U.S. Gov't, Non-P.H.S.
اللغة: English
بيانات الدورية: Publisher: Nature Pub. Group Country of Publication: England NLM ID: 101528555 Publication Model: Electronic Cited Medium: Internet ISSN: 2041-1723 (Electronic) Linking ISSN: 20411723 NLM ISO Abbreviation: Nat Commun Subsets: MEDLINE
أسماء مطبوعة: Original Publication: [London] : Nature Pub. Group
مواضيع طبية MeSH: Phylogeny*, Microbiota/*genetics, Bacteria/genetics ; Classification/methods ; Computational Biology ; Metagenomics/methods ; Population Density ; Software
مستخلص: Popular naive Bayes taxonomic classifiers for amplicon sequences assume that all species in the reference database are equally likely to be observed. We demonstrate that classification accuracy degrades linearly with the degree to which that assumption is violated, and in practice it is always violated. By incorporating environment-specific taxonomic abundance information, we demonstrate a significant increase in the species-level classification accuracy across common sample types. At the species level, overall average error rates decline from 25% to 14%, which is favourably comparable to the error rates that existing classifiers achieve at the genus level (16%). Our findings indicate that for most practical purposes, the assumption that reference species are equally likely to be observed is untenable. q2-clawback provides a straightforward alternative for samples from common environments.
References: mSystems. 2017 Mar 7;2(2):. (PMID: 28289731)
Front Plant Sci. 2016 Jun 20;7:820. (PMID: 27379119)
Front Microbiol. 2017 Oct 12;8:1937. (PMID: 29075239)
Environ Sci Technol. 2018 Nov 20;52(22):13438-13447. (PMID: 30335369)
mSystems. 2018 Jun 5;3(3):. (PMID: 29896566)
ISME J. 2012 Mar;6(3):610-8. (PMID: 22134646)
PeerJ. 2018 Apr 18;6:e4652. (PMID: 29682424)
mSphere. 2018 Sep 5;3(5):. (PMID: 30185512)
J Biotechnol. 2006 Oct 20;126(1):37-51. (PMID: 16757050)
Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45. (PMID: 26553804)
Nature. 2012 Jun 13;486(7402):207-14. (PMID: 22699609)
Methods Enzymol. 2013;531:371-444. (PMID: 24060131)
Science. 2014 Aug 29;345(6200):1048-52. (PMID: 25170151)
J Clin Microbiol. 2007 Sep;45(9):2761-4. (PMID: 17626177)
BMC Genomics. 2015 Dec 12;16:1056. (PMID: 26651617)
Environ Microbiol. 2016 Jun;18(6):2039-51. (PMID: 26914164)
BMC Genomics. 2012;13 Suppl 8:S17. (PMID: 23282177)
Nature. 2017 Nov 23;551(7681):457-463. (PMID: 29088705)
FEMS Microbiol Rev. 2012 Mar;36(2):435-62. (PMID: 22092350)
Nat Methods. 2016 Jul;13(7):581-3. (PMID: 27214047)
mSystems. 2019 Jun 25;4(4):. (PMID: 31239397)
Sci Rep. 2017 Nov 21;7(1):15902. (PMID: 29162884)
Appl Environ Microbiol. 2007 Aug;73(16):5261-7. (PMID: 17586664)
mSystems. 2016 Aug 2;1(4):. (PMID: 27822543)
Nat Microbiol. 2018 Feb;3(2):234-242. (PMID: 29180726)
Front Microbiol. 2012 Jan 06;2:268. (PMID: 22232619)
Front Microbiol. 2016 Apr 20;7:459. (PMID: 27148170)
PLoS One. 2014 May 19;9(5):e97435. (PMID: 24841417)
Mol Ecol. 2014 Mar;23(6):1301-17. (PMID: 24118574)
Genome Biol. 2011;12(5):R50. (PMID: 21624126)
Nat Methods. 2018 Oct;15(10):796-798. (PMID: 30275573)
Am J Clin Nutr. 2015 Feb;101(2):251-61. (PMID: 25646321)
Microbiol Mol Biol Rev. 2004 Sep;68(3):403-31, table of contents. (PMID: 15353563)
mBio. 2015 Mar 24;6(2):. (PMID: 25805735)
PLoS Negl Trop Dis. 2016 Feb 18;10(2):e0004403. (PMID: 26890609)
Nat Methods. 2015 Oct;12(10):902-3. (PMID: 26418763)
J Immunol Methods. 2015 Jun;421:112-121. (PMID: 25891793)
Clin Microbiol Rev. 2007 Jul;20(3):511-32, table of contents. (PMID: 17630338)
Cell. 2014 Jul 17;158(2):250-262. (PMID: 25036628)
Microbiome. 2018 May 17;6(1):90. (PMID: 29773078)
Nat Biotechnol. 2019 Aug;37(8):852-857. (PMID: 31341288)
Front Microbiol. 2018 Oct 30;9:2559. (PMID: 30425690)
معلومات مُعتمدة: P30 MH062512 United States MH NIMH NIH HHS
تواريخ الأحداث: Date Created: 20191013 Date Completed: 20200204 Latest Revision: 20210110
رمز التحديث: 20231215
مُعرف محوري في PubMed: PMC6789115
DOI: 10.1038/s41467-019-12669-6
PMID: 31604942
قاعدة البيانات: MEDLINE
الوصف
تدمد:2041-1723
DOI:10.1038/s41467-019-12669-6