دورية أكاديمية

Online Streaming Feature Selection via Conditional Independence

التفاصيل البيبلوغرافية
العنوان: Online Streaming Feature Selection via Conditional Independence
المؤلفون: Dianlong You, Xindong Wu, Limin Shen, Yi He, Xu Yuan, Zhen Chen, Song Deng, Chuan Ma
المصدر: Applied Sciences, Vol 8, Iss 12, p 2548 (2018)
بيانات النشر: MDPI AG, 2018.
سنة النشر: 2018
المجموعة: LCC:Technology
LCC:Engineering (General). Civil engineering (General)
LCC:Biology (General)
LCC:Physics
LCC:Chemistry
مصطلحات موضوعية: streaming feature, feature selection, conditional independence, markov blanket, Technology, Engineering (General). Civil engineering (General), TA1-2040, Biology (General), QH301-705.5, Physics, QC1-999, Chemistry, QD1-999
الوصف: Online feature selection is a challenging topic in data mining. It aims to reduce the dimensionality of streaming features by removing irrelevant and redundant features in real time. Existing works, such as Alpha-investing and Online Streaming Feature Selection (OSFS), have been proposed to serve this purpose, but they have drawbacks, including low prediction accuracy and high running time if the streaming features exhibit characteristics such as low redundancy and high relevance. In this paper, we propose a novel algorithm about online streaming feature selection, named ConInd that uses a three-layer filtering strategy to process streaming features with the aim of overcoming such drawbacks. Through three-layer filtering, i.e., null-conditional independence, single-conditional independence, and multi-conditional independence, we can obtain an approximate Markov blanket with high accuracy and low running time. To validate the efficiency, we implemented the proposed algorithm and tested its performance on a prevalent dataset, i.e., NIPS 2003 and Causality Workbench. Through extensive experimental results, we demonstrated that ConInd offers significant performance improvements in prediction accuracy and running time compared to Alpha-investing and OSFS. ConInd offers 5.62% higher average prediction accuracy than Alpha-investing, with a 53.56% lower average running time compared to that for OSFS when the dataset is lowly redundant and highly relevant. In addition, the ratio of the average number of features for ConInd is 242% less than that for Alpha-investing.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2076-3417
Relation: https://www.mdpi.com/2076-3417/8/12/2548; https://doaj.org/toc/2076-3417
DOI: 10.3390/app8122548
URL الوصول: https://doaj.org/article/02512780d6e9471d9a374f80a874e8f1
رقم الأكسشن: edsdoj.02512780d6e9471d9a374f80a874e8f1
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:20763417
DOI:10.3390/app8122548