A maximin optimal approach for sampling designs in two-phase studies

التفاصيل البيبلوغرافية
العنوان: A maximin optimal approach for sampling designs in two-phase studies
المؤلفون: Wang, Ruoyu, Wang, Qihua, Miao, Wang
سنة النشر: 2023
المجموعة: Statistics
مصطلحات موضوعية: Statistics - Methodology
الوصف: Data collection costs can vary widely across variables in data science tasks. Two-phase designs are often employed to save data collection costs. In two-phase studies, inexpensive variables are collected for all subjects in the first phase, and expensive variables are measured for a subset of subjects in the second phase based on a predetermined sampling rule. The estimation efficiency under two-phase designs relies heavily on the sampling rule. Existing literature primarily focuses on designing sampling rules for estimating a scalar parameter in some parametric models or specific estimating problems. However, real-world scenarios are usually model-unknown and involve two-phase designs for model-free estimation of a scalar or multi-dimensional parameter. This paper proposes a maximin criterion to design an optimal sampling rule based on semiparametric efficiency bounds. The proposed method is model-free and applicable to general estimating problems. The resulting sampling rule can minimize the semiparametric efficiency bound when the parameter is scalar and improve the bound for every component when the parameter is multi-dimensional. Simulation studies demonstrate that the proposed designs reduce the variance of the resulting estimator in various settings. The implementation of the proposed design is illustrated in a real data analysis.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2312.10596
رقم الأكسشن: edsarx.2312.10596
قاعدة البيانات: arXiv