دورية أكاديمية

Comparison of different scoring methods based on latent variable models of the PHQ-9: an individual participant data meta-analysis.

التفاصيل البيبلوغرافية
العنوان: Comparison of different scoring methods based on latent variable models of the PHQ-9: an individual participant data meta-analysis.
المؤلفون: Fischer, Felix, Levis, Brooke, Falk, Carl, Sun, Ying, Ioannidis, John P. A., Cuijpers, Pim, Shrier, Ian, Benedetti, Andrea, Thombs, Brett D., the Depression Screening Data (DEPRESSD) PHQ Collaboration, He, Chen, Krishnan, Ankur, Wu, Yin, Negeri, Zelalem, Bhandari, Parash Mani, Neupane, Dipika, Rice, Danielle B., Riehm, Kira E., Saadat, Nazanin, Azar, Marleine
المصدر: Psychological Medicine; Nov2022, Vol. 52 Issue 15, p3472-3483, 12p
مصطلحات موضوعية: MEDICAL screening, MENTAL depression, QUESTIONNAIRES, DESCRIPTIVE statistics, STATISTICAL models, DATA analysis software, SECONDARY analysis
مستخلص: Background: Previous research on the depression scale of the Patient Health Questionnaire (PHQ-9) has found that different latent factor models have maximized empirical measures of goodness-of-fit. The clinical relevance of these differences is unclear. We aimed to investigate whether depression screening accuracy may be improved by employing latent factor model-based scoring rather than sum scores. Methods: We used an individual participant data meta-analysis (IPDMA) database compiled to assess the screening accuracy of the PHQ-9. We included studies that used the Structured Clinical Interview for DSM (SCID) as a reference standard and split those into calibration and validation datasets. In the calibration dataset, we estimated unidimensional, two-dimensional (separating cognitive/affective and somatic symptoms of depression), and bi-factor models, and the respective cut-offs to maximize combined sensitivity and specificity. In the validation dataset, we assessed the differences in (combined) sensitivity and specificity between the latent variable approaches and the optimal sum score (⩾10), using bootstrapping to estimate 95% confidence intervals for the differences. Results: The calibration dataset included 24 studies (4378 participants, 652 major depression cases); the validation dataset 17 studies (4252 participants, 568 cases). In the validation dataset, optimal cut-offs of the unidimensional, two-dimensional, and bi-factor models had higher sensitivity (by 0.036, 0.050, 0.049 points, respectively) but lower specificity (0.017, 0.026, 0.019, respectively) compared to the sum score cut-off of ⩾10. Conclusions: In a comprehensive dataset of diagnostic studies, scoring using complex latent variable models do not improve screening accuracy of the PHQ-9 meaningfully as compared to the simple sum score approach. [ABSTRACT FROM AUTHOR]
Copyright of Psychological Medicine is the property of Cambridge University Press and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:00332917
DOI:10.1017/S0033291721000131