دورية أكاديمية

Prediction of Gastrointestinal Tract Cancers Using Longitudinal Electronic Health Record Data

التفاصيل البيبلوغرافية
العنوان: Prediction of Gastrointestinal Tract Cancers Using Longitudinal Electronic Health Record Data
المؤلفون: Andrew J. Read, Wenjing Zhou, Sameer D. Saini, Ji Zhu, Akbar K. Waljee
المصدر: Cancers, Vol 15, Iss 5, p 1399 (2023)
بيانات النشر: MDPI AG, 2023.
سنة النشر: 2023
المجموعة: LCC:Neoplasms. Tumors. Oncology. Including cancer and carcinogens
مصطلحات موضوعية: gastrointestinal cancers, prediction model, machine learning, Neoplasms. Tumors. Oncology. Including cancer and carcinogens, RC254-282
الوصف: Background: Luminal gastrointestinal (GI) tract cancers, including esophageal, gastric, small bowel, colorectal, and anal cancers, are often diagnosed at late stages. These tumors can cause gradual GI bleeding, which may be unrecognized but detectable by subtle laboratory changes. Our aim was to develop models to predict luminal GI tract cancers using laboratory studies and patient characteristics using logistic regression and random forest machine learning methods. Methods: The study was a single-center, retrospective cohort at an academic medical center, with enrollment between 2004–2013 and with follow-up until 2018, who had at least two complete blood counts (CBCs). The primary outcome was the diagnosis of GI tract cancer. Prediction models were developed using multivariable single timepoint logistic regression, longitudinal logistic regression, and random forest machine learning. Results: The cohort included 148,158 individuals, with 1025 GI tract cancers. For 3-year prediction of GI tract cancers, the longitudinal random forest model performed the best, with an area under the receiver operator curve (AuROC) of 0.750 (95% CI 0.729–0.771) and Brier score of 0.116, compared to the longitudinal logistic regression model, with an AuROC of 0.735 (95% CI 0.713–0.757) and Brier score of 0.205. Conclusions: Prediction models incorporating longitudinal features of the CBC outperformed the single timepoint logistic regression models at 3-years, with a trend toward improved accuracy of prediction using a random forest machine learning model compared to a longitudinal logistic regression model.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2072-6694
84311800
Relation: https://www.mdpi.com/2072-6694/15/5/1399; https://doaj.org/toc/2072-6694
DOI: 10.3390/cancers15051399
URL الوصول: https://doaj.org/article/dc0fc8bf6ef8431180022da172e623a4
رقم الأكسشن: edsdoj.0fc8bf6ef8431180022da172e623a4
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:20726694
84311800
DOI:10.3390/cancers15051399