Variance prior forms for high-dimensional Bayesian variable selection

التفاصيل البيبلوغرافية
العنوان: Variance prior forms for high-dimensional Bayesian variable selection
المؤلفون: Veronika Rockova, Edward I. George, Gemma E. Moran
المصدر: Bayesian Anal. 14, no. 4 (2019), 1091-1119
سنة النشر: 2018
مصطلحات موضوعية: FOS: Computer and information sciences, Statistics and Probability, Shrinkage estimator, Spike-and-Slab Lasso, Bayesian probability, 01 natural sciences, Conjugate prior, Methodology (stat.ME), 010104 statistics & probability, Lasso (statistics), 0502 economics and business, Prior probability, Statistics, Linear regression, Statistics::Methodology, 0101 mathematics, penalized likelihood, Statistics - Methodology, 050205 econometrics, Mathematics, Bayesian variable selection, Jeffreys’ priors, Applied Mathematics, 05 social sciences, Linear model, Bayesian shrinkage, Variance (accounting), Statistics::Computation
الوصف: Consider the problem of high dimensional variable selection for the Gaussian linear model when the unknown error variance is also of interest. In this paper, we show that the use of conjugate shrinkage priors for Bayesian variable selection can have detrimental consequences for such variance estimation. Such priors are often motivated by the invariance argument of Jeffreys (1961). Revisiting this work, however, we highlight a caveat that Jeffreys himself noticed; namely that biased estimators can result from inducing dependence between parameters a priori. In a similar way, we show that conjugate priors for linear regression, which induce prior dependence, can lead to such underestimation in the Bayesian high-dimensional regression setting. Following Jeffreys, we recommend as a remedy to treat regression coefficients and the error variance as independent a priori. Using such an independence prior framework, we extend the Spike-and-Slab Lasso of Ročková and George (2018) to the unknown variance case. This extended procedure outperforms both the fixed variance approach and alternative penalized likelihood methods on simulated data. On the protein activity dataset of Clyde and Parmigiani (1998), the Spike-and-Slab Lasso with unknown variance achieves lower cross-validation error than alternative penalized likelihood methods, demonstrating the gains in predictive accuracy afforded by simultaneous error variance estimation. The unknown variance implementation of the Spike-and-Slab Lasso is provided in the publicly available R package SSLASSO (Ročková and Moran, 2017).
وصف الملف: application/pdf
اللغة: English
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::dcd1362d18f13b9aed666616d74f41b7
http://arxiv.org/abs/1801.03019
حقوق: OPEN
رقم الأكسشن: edsair.doi.dedup.....dcd1362d18f13b9aed666616d74f41b7
قاعدة البيانات: OpenAIRE