WaterMAS: Sharpness-Aware Maximization for Neural Network Watermarking

التفاصيل البيبلوغرافية
العنوان: WaterMAS: Sharpness-Aware Maximization for Neural Network Watermarking
المؤلفون: Trias, Carl De Sousa, Mitrea, Mihai, Fiandrotti, Attilio, Cagnazzo, Marco, Chaudhuri, Sumanta, Tartaglione, Enzo
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Machine Learning, Computer Science - Cryptography and Security, Computer Science - Multimedia
الوصف: Nowadays, deep neural networks are used for solving complex tasks in several critical applications and protecting both their integrity and intellectual property rights (IPR) has become of utmost importance. To this end, we advance WaterMAS, a substitutive, white-box neural network watermarking method that improves the trade-off among robustness, imperceptibility, and computational complexity, while making provisions for increased data payload and security. WasterMAS insertion keeps unchanged the watermarked weights while sharpening their underlying gradient space. The robustness is thus ensured by limiting the attack's strength: even small alterations of the watermarked weights would impact the model's performance. The imperceptibility is ensured by inserting the watermark during the training process. The relationship among the WaterMAS data payload, imperceptibility, and robustness properties is discussed. The secret key is represented by the positions of the weights conveying the watermark, randomly chosen through multiple layers of the model. The security is evaluated by investigating the case in which an attacker would intercept the key. The experimental validations consider 5 models and 2 tasks (VGG16, ResNet18, MobileNetV3, SwinT for CIFAR10 image classification, and DeepLabV3 for Cityscapes image segmentation) as well as 4 types of attacks (Gaussian noise addition, pruning, fine-tuning, and quantization). The code will be released open-source upon acceptance of the article.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2409.03902
رقم الأكسشن: edsarx.2409.03902
قاعدة البيانات: arXiv