Scalable Backdoor Detection in Neural Networks

التفاصيل البيبلوغرافية
العنوان:	Scalable Backdoor Detection in Neural Networks
المؤلفون:	Harikumar, Haripriya, Le, Vuong, Rana, Santu, Bhattacharya, Sourangshu, Gupta, Sunil, Venkatesh, Svetha
سنة النشر:	2020
المجموعة:	Computer Science
مصطلحات موضوعية:	Computer Science - Computer Vision and Pattern Recognition
الوصف:	Recently, it has been shown that deep learning models are vulnerable to Trojan attacks, where an attacker can install a backdoor during training time to make the resultant model misidentify samples contaminated with a small trigger patch. Current backdoor detection methods fail to achieve good detection performance and are computationally expensive. In this paper, we propose a novel trigger reverse-engineering based approach whose computational complexity does not scale with the number of labels, and is based on a measure that is both interpretable and universal across different network and patch types. In experiments, we observe that our method achieves a perfect score in separating Trojaned models from pure models, which is an improvement over the current state-of-the art method.
نوع الوثيقة:	Working Paper
URL الوصول:	http://arxiv.org/abs/2006.05646
رقم الأكسشن:	edsarx.2006.05646
قاعدة البيانات:	arXiv

الوصف
الوصف غير متاح.