دورية أكاديمية

Pragmatic De-Identification of Cross-Domain Unstructured Documents: A Utility-Preserving Approach with Relation Extraction Filtering.

التفاصيل البيبلوغرافية
العنوان: Pragmatic De-Identification of Cross-Domain Unstructured Documents: A Utility-Preserving Approach with Relation Extraction Filtering.
المؤلفون: Nedoshivina L; IBM Research Europe Dublin, Ireland., Halimi A; IBM Research Europe Dublin, Ireland., Bettencourt-Silva J; IBM Research Europe Dublin, Ireland., Braghin S; IBM Research Europe Dublin, Ireland.
المصدر: AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science [AMIA Jt Summits Transl Sci Proc] 2024 May 31; Vol. 2024, pp. 85-94. Date of Electronic Publication: 2024 May 31 (Print Publication: 2024).
نوع المنشور: Journal Article
اللغة: English
بيانات الدورية: Publisher: AMIA Country of Publication: United States NLM ID: 101539486 Publication Model: eCollection Cited Medium: Internet ISSN: 2153-4063 (Electronic) NLM ISO Abbreviation: AMIA Jt Summits Transl Sci Proc Subsets: PubMed not MEDLINE
أسماء مطبوعة: Original Publication: Bethesda, MD : AMIA, [2011]-
مستخلص: The volume of information, and in particular personal information, generated each day is increasing at a staggering rate. The ability to leverage such information depends greatly on being able to satisfy the many compliance and privacy regulations that are appearing all over the world. We present READI, a utility preserving framework for the unstructured document de-identification. READI leverages Named Entity Recognition and Relation Extraction technology to improve the quality of the entity detection, thus improving the overall quality of the data de-identification process. In this proof of concept study, we evaluate the proposed approach on two different datasets and compare with the existing state-of-the-art approaches. We show that Relation Extraction-based Approach for De-Identification (READI) notably reduces the number of false positives and improves the utility of the de-identified text.
(©2024 AMIA - All rights reserved.)
References: Patterns (N Y). 2021 May 12;2(6):100255. (PMID: 34179842)
J Am Med Inform Assoc. 2023 Jan 18;30(2):318-328. (PMID: 36416419)
Stud Health Technol Inform. 2019 Aug 21;264:1140-1144. (PMID: 31438103)
J Biomed Semantics. 2021 Mar 29;12(1):6. (PMID: 33781334)
Sci Data. 2016 May 24;3:160035. (PMID: 27219127)
J Am Med Inform Assoc. 2013 Mar-Apr;20(2):342-8. (PMID: 22771529)
Sci Rep. 2020 Oct 29;10(1):18600. (PMID: 33122735)
AMIA Annu Symp Proc. 2014 Nov 14;2014:767-76. (PMID: 25954383)
Int J Med Inform. 2010 Dec;79(12):849-59. (PMID: 20951082)
تواريخ الأحداث: Date Created: 20240603 Latest Revision: 20240604
رمز التحديث: 20240604
مُعرف محوري في PubMed: PMC11141830
PMID: 38827069
قاعدة البيانات: MEDLINE