Constrained nonnegative matrix factorization-based semi-supervised multilabel learning

التفاصيل البيبلوغرافية
العنوان: Constrained nonnegative matrix factorization-based semi-supervised multilabel learning
المؤلفون: Aihong Qin, Guandong Xu, Bin Fu, Dingguo Yu
المصدر: International Journal of Machine Learning and Cybernetics. 10:1093-1100
بيانات النشر: Springer Science and Business Media LLC, 2018.
سنة النشر: 2018
مصطلحات موضوعية: Similarity (geometry), Computer science, business.industry, Process (computing), Computational intelligence, Pattern recognition, 02 engineering and technology, Measure (mathematics), Non-negative matrix factorization, Matrix decomposition, Set (abstract data type), ComputingMethodologies_PATTERNRECOGNITION, Artificial Intelligence, 020204 information systems, Pattern recognition (psychology), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Computer Vision and Pattern Recognition, Artificial intelligence, business, Software
الوصف: In many multilabel learning applications, instances with labels being fully provided are scarce, while partially labelled data and unlabelled data are more common due to the expensive cost of manual labelling. However, most of existing models are based on the assumption that the fully labelled training data is sufficient. To deal with the partially labelled and unlabelled data effectively, we present a novel semi-supervised multilabel learning approach based on constrained non-negative matrix factorization in this paper. This approach assumes that if two instances are highly similar in terms of their features, they would also be similar in their associated labels set. Specifically, We first define three matrices to measure the similarity of each pair of instances in two different ways. Then, the optimal assignation of labels to the unlabelled instance is determined by minimizing the differentiation between these two similarity sets via a non-negative matrix factorization process. We also present a threshold learning algorithm to determine the classification threshold for each label in our proposed approach. Extensive experiment is conducted on various datasets, and the results demonstrate that our method show significantly better performance than other state-of-the-art approaches. It is especially suitable for the situations with a smaller size of labelled training data, or subset of the training data are partially labelled.
تدمد: 1868-808X
1868-8071
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::27a254222e16880ea4f5475ad0f04aa6
https://doi.org/10.1007/s13042-018-0787-8
حقوق: CLOSED
رقم الأكسشن: edsair.doi...........27a254222e16880ea4f5475ad0f04aa6
قاعدة البيانات: OpenAIRE