دورية أكاديمية

Malware Classification Using Probability Scoring and Machine Learning

التفاصيل البيبلوغرافية
العنوان: Malware Classification Using Probability Scoring and Machine Learning
المؤلفون: Di Xue, Jingmei Li, Tu Lv, Weifei Wu, Jiaxiang Wang
المصدر: IEEE Access, Vol 7, Pp 91641-91656 (2019)
بيانات النشر: IEEE, 2019.
سنة النشر: 2019
المجموعة: LCC:Electrical engineering. Electronics. Nuclear engineering
مصطلحات موضوعية: Grayscale image, native API call, malware, machine learning, probability scoring, static and dynamic analysis, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
الوصف: Malware classification plays an important role in tracing the attack sources of computer security. However, existing static analysis methods are fast in classification, but they are inefficient in some malware using packing and obfuscation techniques; the dynamic analysis methods have better universality for packing and obfuscation, but they will cause excessive classification cost. To overcome these shortcomings, in this paper, we propose a classification system Malscore based on the probability scoring and machine learning, which sets the probability threshold to concatenate static analysis (called Phase 1) and dynamic analysis (called Phase 2). The convolutional neural networks with spatial pyramid pooling were used to analyze the grayscale images (static features) in Phase 1, and the variable n-grams and machine learning were used to analyze the native API call sequences (dynamic features) in Phase 2. Malscore combined static analysis with dynamic analysis not only accelerated the static analysis process by taking advantage of the CNN in image recognition but also appeared to be more resilient to obfuscation by the dynamic analysis. Different from other static and dynamic analysis techniques, when malware is detected, due to the fact that malware will most likely be labeled only by static analysis, we could reduce the overheads by dynamically analyzing a few malware that has less obvious features or greater confusion in static analysis. We performed experiments on 174607 malware samples from 63 malware families. The result showed that Malscore achieved 98.82% accuracy for malware classification. Furthermore, Malscore was compared with the method of using static and dynamic analysis. The preprocessing and test time represented a reduction of 59.58% and 61.70%, respectively.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2169-3536
Relation: https://ieeexplore.ieee.org/document/8758215/; https://doaj.org/toc/2169-3536
DOI: 10.1109/ACCESS.2019.2927552
URL الوصول: https://doaj.org/article/49f51d4857204a9d91767ba03be0e558
رقم الأكسشن: edsdoj.49f51d4857204a9d91767ba03be0e558
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:21693536
DOI:10.1109/ACCESS.2019.2927552