Genetic effect estimates in case‐control studies when a continuous variable is omitted from the model

التفاصيل البيبلوغرافية
العنوان: Genetic effect estimates in case‐control studies when a continuous variable is omitted from the model
المؤلفون: Chiung Yu Huang, Iryna Lobach, Ying Sheng, Lydia B. Zablotska, Siarhei Lobach
المصدر: Genetic Epidemiology, vol 44, iss 3
بيانات النشر: Wiley, 2020.
سنة النشر: 2020
مصطلحات موضوعية: omitted continuous variable, bias, Epidemiology, case-control study, Logistic regression, 01 natural sciences, Continuous variable, 010104 statistics & probability, 03 medical and health sciences, Neuroimaging, Statistics, Genetics, 2.1 Biological and endogenous factors, odds ratio, Aetiology, 0101 mathematics, Genetics (clinical), Mathematics, 030304 developmental biology, 0303 health sciences, Human Genome, 030305 genetics & heredity, Case-control study, Omitted-variable bias, Conditional probability distribution, Odds ratio, Alzheimer's disease, Distribution (mathematics), Public Health and Health Services, False positive rate, Psychology
الوصف: Large-scale genome-wide analyses scans provide massive volumes of genetic variants on large number of cases and controls that can be used to estimate the genetic effects. Yet, the sets of non-genetic variables available in publicly available databases are often brief. It is known that omitting a continuous variable from a logistic regression model can result in biased estimates of odds ratios (OR) (e.g., Gail et al (1984), Neuhaus et al (1993), Hauck et al (1991), Zeger et al (1988)). We are interested to assess what information is needed to recover the bias in the OR estimate of genotype due to omitting a continuous variable in settings when the actual values of the omitted variable are not available. We derive two estimating procedures that can recover the degree of bias based on a conditional density of the omitted variable or knowing the distribution of the omitted variable. Importantly, our derivations show that omitting a continuous variable can result in either under- or over-estimation of the genetic effects. We performed extensive simulation studies to examine bias, variability, false positive rate, and power in the model that omits a continuous variable. We show the application to two genome-wide studies of Alzheimer’s disease.Data Availability StatementThe data that support the findings of this study are openly available in the Database of Genotypes and Phenotypes at [https://www.ncbi.nlm.nih.gov/projects/gap/cgibin/study.cgi?study_id=phs000372.v1.p1], reference number [phs000372.v1.p1] and at the Alzheimer’s Disease Neuroimaging Initiative http://adni.loni.usc.edu/.
تدمد: 1098-2272
0741-0395
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c6bed84868f35b17981746c796701c72
https://doi.org/10.1002/gepi.22278
حقوق: OPEN
رقم الأكسشن: edsair.doi.dedup.....c6bed84868f35b17981746c796701c72
قاعدة البيانات: OpenAIRE