Exploiting missing clinical data in Bayesian network modeling for predicting medical problems

التفاصيل البيبلوغرافية
العنوان: Exploiting missing clinical data in Bayesian network modeling for predicting medical problems
المؤلفون: Jau-Huei Lin, Peter J. Haug
المصدر: Journal of Biomedical Informatics. 41:1-14
بيانات النشر: Elsevier BV, 2008.
سنة النشر: 2008
مصطلحات موضوعية: Decision support system, Medical Records Systems, Computerized, Computer science, Bayesian probability, Problem list, Information Storage and Retrieval, Health Informatics, computer.software_genre, Machine learning, Risk Assessment, Decision Support Techniques, Pattern Recognition, Automated, Naive Bayes classifier, Bayes' theorem, Artificial Intelligence, Risk Factors, Diagnosis, Computer-Assisted, Data element, business.industry, Bayesian network, Bayes Theorem, Decision Support Systems, Clinical, Missing data, Computer Science Applications, Database Management Systems, Data mining, Artificial intelligence, business, computer
الوصف: When machine learning algorithms are applied to data collected during the course of clinical care, it is generally accepted that the data has not been consistently collected. The absence of expected data elements is common and the mechanism through which a data element is missing often involves the clinical relevance of that data element in a specific patient. Therefore, the absence of data may have information value of its own. In the process of designing an application intended to support a medical problem list, we have studied whether the ''missingness'' of clinical data can provide useful information in building prediction models. In this study, we experimented with four methods of treating missing values in a clinical data set-two of them explicitly model the absence or ''missingness'' of data. Each of these data sets were used to build four different kinds of Bayesian classifiers-a naive Bayes structure, a human-composed network structure, and two networks based on structural learning algorithms. We compared the performance between groups with and without explicit models of missingness using the area under the ROC curve. The results showed that in most cases the classifiers trained using the explicit missing value treatments performed better. The result suggests that information may exist in ''missingness'' itself. Thus, when designing a decision support system, we suggest one consider explicitly representing the presence/absence of data in the underlying logic.
تدمد: 1532-0464
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b387671a2248b9a3bf41c78e5f4772f9
https://doi.org/10.1016/j.jbi.2007.06.001
حقوق: OPEN
رقم الأكسشن: edsair.doi.dedup.....b387671a2248b9a3bf41c78e5f4772f9
قاعدة البيانات: OpenAIRE