دورية أكاديمية
Human Communication-Inspired Semantic–View Collaborative Network for Multispectral Remote Sensing Image Retrieval
العنوان: | Human Communication-Inspired Semantic–View Collaborative Network for Multispectral Remote Sensing Image Retrieval |
---|---|
المؤلفون: | Nan Wu, Wei Jin, Randi Fu |
المصدر: | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, Vol 17, Pp 11230-11245 (2024) |
بيانات النشر: | IEEE, 2024. |
سنة النشر: | 2024 |
المجموعة: | LCC:Ocean engineering LCC:Geophysics. Cosmic physics |
مصطلحات موضوعية: | Content-based image retrieval, graph neural networks (GNNs), multiview learning (MVL), multispectral image (MSI), Ocean engineering, TC1501-1800, Geophysics. Cosmic physics, QC801-809 |
الوصف: | Multispectral images (MSIs) have widespread applications, and efficiently managing these extensive MSIs via remote sensing image retrieval (RSIR) is key to boosting their practical value. While current deep learning-based methods offer strong image representation learning capabilities, adapting to complex and dynamic relationships between objects and spectral information in MSIs remains challenging. This difficulty arises due to the distinct attributes of different spectral bands and the lack of consideration of interactions among spectral combinations in MSIs, which limits their retrieval performance. For this purpose, we propose a dynamic learning system inspired by human communication named the semantic–view collaborative network (SVCNet), which actively promotes the interaction between spectral and semantic information. By linking multiview learning (MVL) with graph neural networks (GNNs) to simulate the three stages of human communication—understanding, communication, and collective consensus and reflection—SVCNet enhances RSIR with flexibility in representation extraction. Specifically, each spectral combination is processed to extract independent representations as view-specific knowledge. In the communication phase, we devise the graph attention-based multiround communication module (GACM), which uses GNN to perform graph-structured modeling and adaptive updating of views and semantics. Moreover, we achieve improved MSI representations by implementing novel objective functions that align learned semantics with category information, dynamically differentiating semantic similarities and disparities in MSIs, and flexibly weighting samples for enhanced adaptability in a multilabel RSIR environment. SVCNet surpasses current state-of-the-art methods in three MSI datasets for single and multilabel retrieval tasks. It effectively handles class imbalances and distinguishes challenging samples, highlighting its extensive applicability. |
نوع الوثيقة: | article |
وصف الملف: | electronic resource |
اللغة: | English |
تدمد: | 1939-1404 2151-1535 |
Relation: | https://ieeexplore.ieee.org/document/10547370/; https://doaj.org/toc/1939-1404; https://doaj.org/toc/2151-1535 |
DOI: | 10.1109/JSTARS.2024.3409417 |
URL الوصول: | https://doaj.org/article/9030675edfe6407e9dc4f635b85e7177 |
رقم الأكسشن: | edsdoj.9030675edfe6407e9dc4f635b85e7177 |
قاعدة البيانات: | Directory of Open Access Journals |
تدمد: | 19391404 21511535 |
---|---|
DOI: | 10.1109/JSTARS.2024.3409417 |