دورية أكاديمية

Deep Network Architectures as Feature Extractors for Multi-Label Classification of Remote Sensing Images

التفاصيل البيبلوغرافية
العنوان: Deep Network Architectures as Feature Extractors for Multi-Label Classification of Remote Sensing Images
المؤلفون: Marjan Stoimchev, Dragi Kocev, Sašo Džeroski
المصدر: Remote Sensing, Vol 15, Iss 2, p 538 (2023)
بيانات النشر: MDPI AG, 2023.
سنة النشر: 2023
المجموعة: LCC:Science
مصطلحات موضوعية: remote sensing, convolutional neural networks, tree ensemble methods, multi-label classification, Science
الوصف: Data in the form of images are now generated at an unprecedented rate. A case in point is remote sensing images (RSI), now available in large-scale RSI archives, which have attracted a considerable amount of research on image classification within the remote sensing community. The basic task of single-target multi-class image classification considers the case where each image is assigned exactly one label from a predefined finite set of class labels. Recently, however, image annotations have become increasingly complex, with images labeled with several labels (instead of just one). In other words, the goal is to assign multiple semantic categories to an image, based on its high-level context. The corresponding machine learning tasks is called multi-label classification (MLC). The classification of RSI is currently predominantly addressed by deep neural network (DNN) approaches, especially convolutional neural networks (CNNs), which can be utilized as feature extractors as well as end-to-end methods. After only considering single-target classification for a long period, DNNs have recently emerged that address the task of MLC. On the other hand, trees and tree ensembles for MLC have a long tradition and are the best-performing class of MLC methods, but need predefined feature representations to operate on. In this work, we explore different strategies for model training based on the transfer learning paradigm, where we utilize different families of (pre-trained) CNN architectures, such as VGG, EfficientNet, and ResNet. The architectures are trained in an end-to-end manner and used in two different modes of operation, namely, as standalone models that directly perform the MLC task, and as feature extractors. In the latter case, the learned representations are used with tree ensemble methods for MLC, such as random forests and extremely randomized trees. We conduct an extensive experimental analysis of methods over several publicly available RSI datasets and evaluate their effectiveness in terms of standard MLC measures. Of these, ranking-based evaluation measures are most relevant, especially ranking loss. The results show that, for addressing the RSI-MLC task, it is favorable to use lightweight network architectures, such as EfficientNet-B2, which is the best performing end-to-end approach, as well as a feature extractor. Furthermore, in the datasets with a limited number of images, using traditional tree ensembles for MLC can yield better performance compared to end-to-end deep approaches.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2072-4292
Relation: https://www.mdpi.com/2072-4292/15/2/538; https://doaj.org/toc/2072-4292
DOI: 10.3390/rs15020538
URL الوصول: https://doaj.org/article/6374a14d07514e8281727904c1e2ffcf
رقم الأكسشن: edsdoj.6374a14d07514e8281727904c1e2ffcf
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:20724292
DOI:10.3390/rs15020538