A transformation-based method for auditing the IS-A hierarchy of biomedical terminologies in the Unified Medical Language System

التفاصيل البيبلوغرافية
العنوان: A transformation-based method for auditing the IS-A hierarchy of biomedical terminologies in the Unified Medical Language System
المؤلفون: Fengbo Zheng, W Jim Zheng, Jay Shi, Yuntao Yang, Licong Cui
المصدر: Journal of the American Medical Informatics Association : JAMIA
بيانات النشر: Oxford University Press (OUP), 2020.
سنة النشر: 2020
مصطلحات موضوعية: AcademicSubjects/SCI01060, 020205 medical informatics, Relation (database), Computer science, Interoperability, Health Informatics, quality assurance, 02 engineering and technology, Research and Applications, SNOMED CT, computer.software_genre, Domain (software engineering), Terminology, 03 medical and health sciences, 0302 clinical medicine, Terminology as Topic, Noun, 0202 electrical engineering, electronic engineering, information engineering, 030212 general & internal medicine, biomedical terminologies, AcademicSubjects/MED00580, Language, Hierarchy (mathematics), business.industry, Unified Medical Language System, Systematized Nomenclature of Medicine, Quality Improvement, Gene Ontology, Artificial intelligence, AcademicSubjects/SCI01530, business, computer, Natural language processing
الوصف: Objective The Unified Medical Language System (UMLS) integrates various source terminologies to support interoperability between biomedical information systems. In this article, we introduce a novel transformation-based auditing method that leverages the UMLS knowledge to systematically identify missing hierarchical IS-A relations in the source terminologies. Materials and Methods Given a concept name in the UMLS, we first identify its base and secondary noun chunks. For each identified noun chunk, we generate replacement candidates that are more general than the noun chunk. Then, we replace the noun chunks with their replacement candidates to generate new potential concept names that may serve as supertypes of the original concept. If a newly generated name is an existing concept name in the same source terminology with the original concept, then a potentially missing IS-A relation between the original and the new concept is identified. Results Applying our transformation-based method to English-language concept names in the UMLS (2019AB release), a total of 39 359 potentially missing IS-A relations were detected in 13 source terminologies. Domain experts evaluated a random sample of 200 potentially missing IS-A relations identified in the SNOMED CT (U.S. edition) and 100 in Gene Ontology. A total of 173 of 200 and 63 of 100 potentially missing IS-A relations were confirmed by domain experts, indicating that our method achieved a precision of 86.5% and 63% for the SNOMED CT and Gene Ontology, respectively. Conclusions Our results showed that our transformation-based method is effective in identifying missing IS-A relations in the UMLS source terminologies.
تدمد: 1527-974X
1067-5027
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::25f24e2a9fbcbfe3e8a801a71c858188
https://doi.org/10.1093/jamia/ocaa123
حقوق: OPEN
رقم الأكسشن: edsair.doi.dedup.....25f24e2a9fbcbfe3e8a801a71c858188
قاعدة البيانات: OpenAIRE