تقرير
Logical Segmentation of Source Code
العنوان: | Logical Segmentation of Source Code |
---|---|
المؤلفون: | Dormuth, Jacob, Gelman, Ben, Moore, Jessica, Slater, David |
سنة النشر: | 2019 |
المجموعة: | Computer Science Statistics |
مصطلحات موضوعية: | Computer Science - Software Engineering, Computer Science - Machine Learning, Statistics - Machine Learning |
الوصف: | Many software analysis methods have come to rely on machine learning approaches. Code segmentation - the process of decomposing source code into meaningful blocks - can augment these methods by featurizing code, reducing noise, and limiting the problem space. Traditionally, code segmentation has been done using syntactic cues; current approaches do not intentionally capture logical content. We develop a novel deep learning approach to generate logical code segments regardless of the language or syntactic correctness of the code. Due to the lack of logically segmented source code, we introduce a unique data set construction technique to approximate ground truth for logically segmented code. Logical code segmentation can improve tasks such as automatically commenting code, detecting software vulnerabilities, repairing bugs, labeling code functionality, and synthesizing new code. Comment: SEKE2019 Conference Full Paper |
نوع الوثيقة: | Working Paper |
DOI: | 10.18293/SEKE2019-026 |
URL الوصول: | http://arxiv.org/abs/1907.08615 |
رقم الأكسشن: | edsarx.1907.08615 |
قاعدة البيانات: | arXiv |
DOI: | 10.18293/SEKE2019-026 |
---|