دورية أكاديمية

A generalized l2,p-norm regression based feature selection algorithm

التفاصيل البيبلوغرافية
العنوان: A generalized l2,p-norm regression based feature selection algorithm
المؤلفون: X. Zhi, J. Liu, S. Wu, C. Niu
المصدر: Taylor & Francis Journals, Journal of Applied Statistics. 50(3):703-723
سنة النشر: 2023
الوصف: Feature selection is an important data dimension reduction method, and it has been used widely in applications involving high-dimensional data such as genetic data analysis and image processing. In order to achieve robust feature selection, the latest works apply the $ l_{2,1} $ l2,1 or $ l_{2,p} $ l2,p-norm of matrix to the loss function and regularization terms in regression, and have achieved encouraging results. However, these existing works rigidly set the matrix norms used in the loss function and the regularization terms to the same $ l_{2,1} $ l2,1 or $ l_{2,p} $ l2,p-norm, which limit their applications. In addition, the algorithms for solutions they present either have high computational complexity and are not suitable for large data sets, or cannot provide satisfying performance due to the approximate calculation. To address these problems, we present a generalized $ \textit{l}_{2,p} $ l2,p-norm regression based feature selection ( $ l_{2,p} $ l2,p-RFS) method based on a new optimization criterion. The criterion extends the optimization criterion of ( $ l_{2,p} $ l2,p-RFS) when the loss function and the regularization terms in regression use different matrix norms. We cast the new optimization criterion in a regression framework without regularization. In this framework, the new optimization criterion can be solved using an iterative re-weighted least squares (IRLS) procedure in which the least squares problem can be solved efficiently by using the least square QR decomposition (LSQR) algorithm. We have conducted extensive experiments to evaluate the proposed algorithm on various well-known data sets of both gene expression and image data sets, and compare it with other related feature selection methods.
نوع الوثيقة: redif-article
اللغة: English
DOI: 10.1080/02664763.2021.197
الإتاحة: https://ideas.repec.org/a/taf/japsta/v50y2023i3p703-723.html
رقم الأكسشن: edsrep.a.taf.japsta.v50y2023i3p703.723
قاعدة البيانات: RePEc
الوصف
DOI:10.1080/02664763.2021.197