An Accuracy-Maximization Approach for Claims Classifiers in Document Content Analytics for Cybersecurity

التفاصيل البيبلوغرافية
العنوان: An Accuracy-Maximization Approach for Claims Classifiers in Document Content Analytics for Cybersecurity
المؤلفون: Kalyan Perumalla, Hamid Sharif-Kashani, Michael Hempel, Juan Lopez Jr., Kimia Ameri
المصدر: Journal of Cybersecurity and Privacy; Volume 2; Issue 2; Pages: 418-443
بيانات النشر: Multidisciplinary Digital Publishing Institute, 2022.
سنة النشر: 2022
مصطلحات موضوعية: natural language processing, BERT, transfer learning, convolution neural network, classification, cybersecurity, CYVET, accuracy maximization
الوصف: This paper presents our research approach and findings towards maximizing the accuracy of our classifier of feature claims for cybersecurity literature analytics, and introduces the resulting model ClaimsBERT. Its architecture, after extensive evaluations of different approaches, introduces a feature map concatenated with a Bidirectional Encoder Representation from Transformers (BERT) model. We discuss deployment of this new concept and the research insights that resulted in the selection of Convolution Neural Networks for its feature mapping aspects. We also present our results showing ClaimsBERT to outperform all other evaluated approaches. This new claims classifier represents an essential processing stage within our vetting framework aiming to improve the cybersecurity of industrial control systems (ICS). Furthermore, in order to maximize the accuracy of our new ClaimsBERT classifier, we propose an approach for optimal architecture selection and determination of optimized hyperparameters, in particular the best learning rate, number of convolutions, filter sizes, activation function, the number of dense layers, as well as the number of neurons and the drop-out rate for each layer. Fine-tuning these hyperparameters within our model led to an increase in classification accuracy from 76% obtained with BertForSequenceClassification’s original model to a 97% accuracy obtained with ClaimsBERT.
وصف الملف: application/pdf
اللغة: English
تدمد: 2624-800X
DOI: 10.3390/jcp2020022
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::afeb4f18b1f3bb186e38efaffab36b32
حقوق: OPEN
رقم الأكسشن: edsair.doi.dedup.....afeb4f18b1f3bb186e38efaffab36b32
قاعدة البيانات: OpenAIRE
الوصف
تدمد:2624800X
DOI:10.3390/jcp2020022