دورية أكاديمية

Introducing zoid: A mixture model and R package for modeling proportional data with zeros and ones in ecology.

التفاصيل البيبلوغرافية
العنوان: Introducing zoid: A mixture model and R package for modeling proportional data with zeros and ones in ecology.
المؤلفون: Jensen AJ; School of Marine and Environmental Affairs, University of Washington, Seattle, Washington, USA., Kelly RP; School of Marine and Environmental Affairs, University of Washington, Seattle, Washington, USA., Anderson EC; Fisheries Ecology Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic & Atmospheric Administration, Santa Cruz, California, USA., Satterthwaite WH; Fisheries Ecology Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic & Atmospheric Administration, Santa Cruz, California, USA., Shelton AO; Conservation Biology Division, Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Seattle, Washington, USA., Ward EJ; Conservation Biology Division, Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Seattle, Washington, USA.
المصدر: Ecology [Ecology] 2022 Nov; Vol. 103 (11), pp. e3804. Date of Electronic Publication: 2022 Aug 17.
نوع المنشور: Journal Article; Research Support, Non-U.S. Gov't
اللغة: English
بيانات الدورية: Publisher: Ecological Society of America Country of Publication: United States NLM ID: 0043541 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1939-9170 (Electronic) Linking ISSN: 00129658 NLM ISO Abbreviation: Ecology Subsets: MEDLINE
أسماء مطبوعة: Publication: Washington, DC : Ecological Society of America
Original Publication: Brooklyn, NY : Brooklyn Botanical Garden
مواضيع طبية MeSH: Models, Statistical* , Software*, Animals ; Fisheries ; Research Design ; Salmon
مستخلص: Many ecological data sets are proportional, representing mixtures of constituent elements such as species, populations, or strains. Analyses of proportional data are challenged by categories with zero observations (zeros), all observations (ones), and overdispersion. In lieu of ad hoc data adjustments, we describe and evaluate a zero-and-one inflated Dirichlet regression model, with its corresponding R package (zoid), capable of handling observed data x $$ x $$ consisting of three possible categories: zeros, proportions, or ones. Instead of fitting the model to observations of single biological units (e.g., individual organisms) within a sample, we sum proportional contributions across units and estimate mixture proportions using one aggregated observation per sample. Optional estimation of overdispersion and covariate influences expand model applications. We evaluate model performance, as implemented in Stan, using simulations and two ecological case studies. We show that zoid successfully estimates mixture proportions using simulated data with varying sample sizes and is robust to overdispersion and covariate structure. In empirical case studies, we estimate the composition of a mixed-stock Chinook salmon (Oncorhynchus tshawytscha) fishery and analyze the stomach contents of Atlantic cod (Gadus morhua). Our implementation of the model as an R package facilitates its application to varied ecological data sets composed of proportional observations.
(© 2022 The Ecological Society of America.)
References: Berger, J. O., J. M. Bernardo, and D. Sun. 2015. “Overall Objective Priors.” Bayesian Analysis 10: 189-221.
Bolker, B. M., T. Okuyama, K. A. Bjorndal, and A. B. Bolten. 2007. “Incorporating Multiple Mixed Stocks in Mixed Stock Analysis: ‘Many-to-Many’ Analyses.” Molecular Ecology 16: 685-95.
Cutler, D. R., T. C. Edwards, Jr., K. H. Beard, A. Cutler, K. T. Hess, J. Gibson, and J. J. Lawler. 2007. “Random Forests for Classification in Ecology.” Ecology 88: 2783-92.
Douma, J. C., and J. T. Weedon. 2019. “Analyzing Continuous Proportions in Ecology and Evolution: A Practical Introduction to Beta and Dirichlet Regression.” Methods in Ecology and Evolution 10: 1412-30.
Freshwater, C., S. C. Anderson, T. D. Beacham, W. Luedke, C. Wor, and J. King. 2021. “An Integrated Model of Seasonal Changes in Stock Composition and Abundance with an Application to Chinook Salmon.” PeerJ 9: e11163.
Gabry, J., D. Simpson, A. Vehtari, M. Betancourt, and A. Gelman. 2019. “Visualization in Bayesian Workflow.” Journal of the Royal Statistical Society, Series A 182: 389-402.
Gelman, A., J. B. Carlin, H. S. Stern, D. B. Dunson, A. Vehtari, and D. B. Rubin. 2014. Bayesian Data Analysis. Third edition. New York: Chapman and Hall/CRC.
Koljonen, M.-L., J. J. Pella, and M. Masuda. 2005. “Classical Individual Assignments Versus Mixture Modeling to Estimate Stock Prportions in Atlantic Salmon (Salmo salar) Catches from DNA Microsatellite Data.” Canadian Journal of Fisheries and Aquatic Sciences 62: 2143-58.
Liu, F. 2021. “Zoib: Bayesian Inference for Beta Regression and Zero-or-One Inflated Beta Regression.” R package version 1.5.5. https://CRAN.R-project.org/package=zoib.
Loosvelt, L., J. Peters, H. Skriver, H. Lievens, F. M. B. Van Coillie, B. De Baets, and N. E. C. Verhoest. 2012. “Random Forests as a Tool for Estimating Uncertainty at Pixel-Level in SAR Image Classification.” International Journal of Applied Earth Observation and Geoinformation 19: 173-84.
Magnussen, E. 2011. “Food and Feeding Habits of Cod (Gadus morhua) on the Faroe Bank.” ICES Journal of Marine Science 68: 1909-17.
Moran, P., J. Dazey, L. LaVoy, and S. Young. 2018. “Genetic Mixture Analysis Supports Recalibration of the Fishery Regulation Assessment Model.” Fisheries 43: 83-97.
Ospina, R., and S. L. Ferrari. 2012. “A General Class of Zero-or-One Inflated Beta Regression Models.” Computational Statistics and Data Analysis 56: 1609-23.
Paanenen, T., J. Piironen, P.-C. Burkner, and A. Vehtari. 2021. “Implicitly Adaptive Importance Sampling.” Statistics and Computing 31: 16.
R Core Team. 2019. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing.
Satterthwaite, W. H., J. Ciancio, E. Crandall, M. L. Palmer-Zwahlen, A. M. Grover, M. R. O'Farrell, E. C. Anderson, M. S. Mohr, and J. C. Garza. 2015. “Stock Composition and Ocean Spatial Distribution from California Recreational Chinook Salmon Fisheries Using Genetic Stock Identification.” Fisheries Research 170: 166-78.
Smithson, M., and J. Verkuilen. 2006. “A Better Lemon Squeezer? Maximum-Likelihood Regression with Beta-Distributed Dependent Variables.” Psychological Methods 11: 54-71.
Stan Development Team. 2020. “RStan: The R Interface to Stan.” R package version 2.21.2. https://mc-stan.org.
Stan Development Team. 2021. “Stan Modeling Language Users Guide and Reference Manual.” Version 2.27. https://mc-stan.org.
Tang, Z.-Z., and G. Chen. 2019. “Zero-Inflated Generalized Dirichlet Multinomial Regression Model for Microbiome Compositional Data Analysis.” Biostatistics 20: 698-713.
Thorson, J. T., M. L. Arimitsu, T. Levi, and G. H. Roffler. 2022. “Diet Analysis Using Generalized Linear Models Derived from Foraging Processes Using r Package Mvtweedie.” Ecology 103: e3637.
Thorson, J. T., K. Johnson, and R. D. Methot. 2017. “Model-Based Estimates of Effective Sample Size in Stock Assessment Models Using the Dirichlet-Multinomial Distribution.” Fisheries Research 192: 84-93.
Tsagris, M., and C. Steward. 2018. “A Dirichlet Regression Model for Compositional Data with Zeroes.” Lobachevskii Journal of Mathematics 39: 398-412.
Utter, F., and N. Ryman. 1993. “Genetic Markers and Mixed Stock Fisheries.” Fisheries 18: 11-21.
Vehtari, A., J. Gabry, M. Magnusson, Y. Yao, P. Burkner, T. Paanenen, and A. Gelman. 2020. “loo: Efficient Leave-One-Out Cross-Validation and WAIC for Bayesian Models.” R package version 2.4.1. https://mc-stan.org/loo.
Vehtari, A., A. Gelman, D. Simpson, B. Carpenter, and P.-C. Bürkner. 2021. “Rank-Normalization, Folding, and Localization: An Improved R^ for Assessing Convergence of MCMC (with Discussion).” Bayesian Analysis 16: 667-718.
Ward, E., and O. Shelton. 2022. “Nwfsc-cb/zoid: CRAN Update.” Zenodo. https://doi.org/10.5281/zenodo.6028506.
Warton, D. I., and F. K. C. Hui. 2011. “The Arcsine Is Asinine: The Analysis of Proportions in Ecology.” Ecology 92: 3-10.
Wright, W. J., K. M. Irvine, J. M. Warren, and J. K. Barnett. 2017. “Statistical Design and Analysis for Plant Cover Studies with Multiple Sources of Observation Errors.” Methods in Ecology and Evolution 8: 1832-41.
فهرسة مساهمة: Keywords: diet composition; mixed-stock fishery; mixture model; overdispersion; proportional data; zero-and-one inflated Dirichlet regression
تواريخ الأحداث: Date Created: 20220709 Date Completed: 20221107 Latest Revision: 20221215
رمز التحديث: 20221216
DOI: 10.1002/ecy.3804
PMID: 35804486
قاعدة البيانات: MEDLINE
الوصف
تدمد:1939-9170
DOI:10.1002/ecy.3804