Distributed learning optimisation of Cox models can leak patient data: Risks and solutions

التفاصيل البيبلوغرافية
العنوان: Distributed learning optimisation of Cox models can leak patient data: Risks and solutions
المؤلفون: Brink, Carsten, Hansen, Christian Rønn, Field, Matthew, Price, Gareth, Thwaites, David, Sarup, Nis, Bernchou, Uffe, Holloway, Lois
سنة النشر: 2022
المجموعة: Computer Science
Statistics
مصطلحات موضوعية: Statistics - Machine Learning, Computer Science - Cryptography and Security, Computer Science - Machine Learning
الوصف: Medical data are often highly sensitive, and frequently there are missing data. Due to the data's sensitive nature, there is an interest in creating modelling methods where the data are kept in each local centre to preserve their privacy, but yet the model can be trained on and learn from data across multiple centres. Such an approach might be distributed machine learning (federated learning, collaborative learning) in which a model is iteratively calculated based on aggregated local model information from each centre. However, even though no specific data are leaving the centre, there is a potential risk that the exchanged information is sufficient to reconstruct all or part of the patient data, which would hamper the safety-protecting rationale idea of distributed learning. This paper demonstrates that the optimisation of a Cox survival model can lead to patient data leakage. Following this, we suggest a way to optimise and validate a Cox model that avoids these problems in a secure way. The feasibility of the suggested method is demonstrated in a provided Matlab code that also includes methods for handling missing data.
Comment: 51 pages, 8 figures
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2204.05856
رقم الأكسشن: edsarx.2204.05856
قاعدة البيانات: arXiv