A fast, resource efficient, and reliable rule-based system for COVID-19 symptom identification

التفاصيل البيبلوغرافية
العنوان: A fast, resource efficient, and reliable rule-based system for COVID-19 symptom identification
المؤلفون: Hongfang Liu, Sijia Liu, Nicholas E. Ingraham, Serguei V. S. Pakhomov, Himanshu S. Sahoo, Christopher J. Tignanelli, Rui Zhang, Genevieve B. Melton, Monica I. Lupei, John Sartori, Benjamin C. Knoll, Michael A. Puskarich, Raymond L. Finzel, Greg M. Silverman
المصدر: JAMIA Open
بيانات النشر: Oxford University Press, 2021.
سنة النشر: 2021
مصطلحات موضوعية: AcademicSubjects/SCI01060, Computer science, Health Informatics, Machine learning, computer.software_genre, Research and Applications, Clinical decision support system, Annotation, Resource (project management), signs, information extraction, natural language processing, clinical decision support systems, and symptoms, business.industry, Unstructured data, Rule-based system, artificial intelligence, follow-up studies, Identification (information), Information extraction, Scalability, Artificial intelligence, AcademicSubjects/SCI01530, business, AcademicSubjects/MED00010, computer
الوصف: Objective With COVID-19, there was a need for a rapidly scalable annotation system that facilitated real-time integration with clinical decision support systems (CDS). Current annotation systems suffer from a high-resource utilization and poor scalability limiting real-world integration with CDS. A potential solution to mitigate these issues is to use the rule-based gazetteer developed at our institution. Materials and Methods Performance, resource utilization, and runtime of the rule-based gazetteer were compared with five annotation systems: BioMedICUS, cTAKES, MetaMap, CLAMP, and MedTagger. Results This rule-based gazetteer was the fastest, had a low resource footprint, and similar performance for weighted microaverage and macroaverage measures of precision, recall, and f1-score compared to other annotation systems. Discussion Opportunities to increase its performance include fine-tuning lexical rules for symptom identification. Additionally, it could run on multiple compute nodes for faster runtime. Conclusion This rule-based gazetteer overcame key technical limitations facilitating real-time symptomatology identification for COVID-19 and integration of unstructured data elements into our CDS. It is ideal for large-scale deployment across a wide variety of healthcare settings for surveillance of acute COVID-19 symptoms for integration into prognostic modeling. Such a system is currently being leveraged for monitoring of postacute sequelae of COVID-19 (PASC) progression in COVID-19 survivors. This study conducted the first in-depth analysis and developed a rule-based gazetteer for COVID-19 symptom extraction with the following key features: low processor and memory utilization, faster runtime, and similar weighted microaverage and macroaverage measures for precision, recall, and f1-score compared to industry-standard annotation systems.
اللغة: English
تدمد: 2574-2531
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ce77adf94890506a48af9f174a538090
http://europepmc.org/articles/PMC8374371
حقوق: OPEN
رقم الأكسشن: edsair.doi.dedup.....ce77adf94890506a48af9f174a538090
قاعدة البيانات: OpenAIRE