Zero-Shot Pill-Prescription Matching With Graph Convolutional Network and Contrastive Learning

التفاصيل البيبلوغرافية
العنوان:	Zero-Shot Pill-Prescription Matching With Graph Convolutional Network and Contrastive Learning
المؤلفون:	Trung Thanh Nguyen, Phi Le Nguyen, Yasutomo Kawanishi, Takahiro Komamizu, Ichiro Ide
المصدر:	IEEE Access, Vol 12, Pp 55889-55904 (2024)
بيانات النشر:	IEEE, 2024.
سنة النشر:	2024
المجموعة:	LCC:Electrical engineering. Electronics. Nuclear engineering
مصطلحات موضوعية:	Contrastive learning, graph convolutional network, object detection, pill-prescription matching, text-image matching, zero-shot learning, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
الوصف:	Patients’ safety is paramount in the healthcare industry, and reducing medication errors is essential for improvement. A promising solution to this problem involves the development of automated systems capable of assisting patients in verifying their pill intake mistakes. This paper investigates a Pill-Prescription matching task that seeks to associate pills in a multi-pill photo with their corresponding names in the prescription. We specifically aim to overcome the limitations of existing pill detection methods when faced with unseen pills, a situation characteristic of zero-shot learning. We propose a novel method named Zero-PIMA (Zero-shot Pill-Prescription Matching), designed to match pill images with prescription names effectively, even for pills not included in the training dataset. Zero-PIMA is an end-to-end model that includes an object localization module to determine and extract features of pill images and a graph convolutional network to capture the spatial relationship of the pills’ text in the prescription. After that, we leverage the contrastive learning paradigm to increase the distance between mismatched pill images and pill name pairs while minimizing the distance between matched pairs. In addition, to deal with the zero-shot pill detection problem, we leverage pills’ metadata retrieved from the DrugBank database to fine-tune a pre-trained text encoder, thereby incorporating visual information about pills (e.g., shape, color) into their names, making them more informative and ultimately enhancing the pill image-name matching accuracy. Extensive experiments are conducted on our collected real-world VAIPEPP dataset of multi-pill photos and prescriptions. Through a series of comprehensive experiments, the proposed method outperforms other methods for both seen and unseen pills in terms of mean average precision. These results indicate that the proposed method could reduce medication errors and improve patients’ safety.
نوع الوثيقة:	article
وصف الملف:	electronic resource
اللغة:	English
تدمد:	2169-3536
Relation:	https://ieeexplore.ieee.org/document/10504270/; https://doaj.org/toc/2169-3536
DOI:	10.1109/ACCESS.2024.3390153
URL الوصول:	https://doaj.org/article/4f61dc8b4b0147f69fc440115c0d35e3
رقم الأكسشن:	edsdoj.4f61dc8b4b0147f69fc440115c0d35e3
قاعدة البيانات:	Directory of Open Access Journals

Full Text Finder

الوصف
تدمد:	21693536
DOI:	10.1109/ACCESS.2024.3390153