دورية أكاديمية

Spectral Feature Selection from the Hyperspectral Dataset to Identify Pistachio Leaves Infected by Psylla

التفاصيل البيبلوغرافية
العنوان: Spectral Feature Selection from the Hyperspectral Dataset to Identify Pistachio Leaves Infected by Psylla
المؤلفون: A Moghimi, A Sazgarnia, M. H Aghkhani
المصدر: Journal of Agricultural Machinery, Vol 12, Iss 2, Pp 159-167 (2022)
بيانات النشر: Ferdowsi University of Mashhad, 2022.
سنة النشر: 2022
المجموعة: LCC:Agriculture (General)
LCC:Engineering (General). Civil engineering (General)
مصطلحات موضوعية: classification, feature selection, hyperspectral data, multispectral data, pistachio, psylla, random forest, spectroscopy, Agriculture (General), S1-972, Engineering (General). Civil engineering (General), TA1-2040
الوصف: IntroductionPistachio production has been adversely affected by Psylla, which is a devastating insect. The primary goal of this study was to select sensitive spectral bands to distinguish pistachio leaves infected by Psylla from healthy leaves. Diagnosis of psylla disease before the onset of visual cues is crucial for making decisions about topical garden management. Since it is not possible to diagnose psylla disease even after the onset of symptoms with the help of color images by drones, hyperspectral and multispectral sensors are needed. The main purpose of this study was to extract spectral bands suitable for distinguishing healthy leaves from psylla leaves. For this purpose, in this paper, a new method for selecting sensitive spectral properties from hyperspectral data with the high spectral resolution is presented. The intelligent selection of sensitive bands is a convenient way to build multispectral sensors for a specific application (in this article, the diagnosis of psylla leaves). Knowledge of disease-sensitive wavelengths can also help researchers analyze multispectral and hyperspectral aerial images captured by satellites or drones.Materials and MethodsA total number of 160 healthy and diseased leaves were scanned in 64 spectral bands between 400-1100 nm with 10 nm spectral resolution. A random forest algorithm was used to identify the importance of features in classifying the dataset into diseased and healthy leaves. After computing the importance of the features, a clustering algorithm was developed to cluster the most important features into six clusters such that the center of clusters was 50 nm apart. To transfer the hyperspectral dataset into a multispectral dataset, the reflectance was averaged in spectral bands within ±15 nm of each cluster center and achieved six broad multispectral bands. Afterwards a support vector machine algorithm was utilized to classify the diseased and healthy leaves using both hyperspectral and multispectral datasets.Results and DiscussionThe center of clusters were 468 nm, 598 nm, 710 nm, 791 nm, 858 nm, and 1023 nm, which were calculated by taking the average of all the members assigned to the individual clusters. These are the most informative spectral bands to distinguish the pistachio leaves infected by Psylla from the healthy leaves. The F1-score was 90.91 when the hyperspectral dataset (all bands) was used, while the F1-score was 88.69 for the multispectral dataset. The subtle difference between the F1-scores indicates that the proposed pipeline in this study was able to select appropriately the sensitive bands while retaining all relevant information.ConclusionThe importance of spectral bands in the visible and near-infrared region (between 400 and 1100 nm) was obtained to identify pistachio tree leaves infected with psylla disease. Based on the importance of spectral properties and using a clustering algorithm, six wavelengths were obtained as the best wavelengths for classifying healthy and diseased pistachio leaves. Then, by averaging the wavelengths at a distance of 15 nm from these six centers, the hyperspectral data (64 bands) became multispectral (6 bands). Since the correlation between the wavelengths in the near-infrared region was very high (more than 95%), out of the three selected wavelengths in the near-infrared region (710, 791, and 1023), only the 710-nm wavelength, which was closer to the visible region, was selected. The results of classification of infected and diseased leaves using hyperspectral and multispectral data showed that the degree of classification accuracy decreases by about 2% and if only 4 bands are used, the degree of accuracy decreases by about 3%.The results of this study revealed that the proposed framework could be used for selecting the most informative spectral bands and accordingly develop custom-designed multispectral sensors for disease detection in pistachio. In addition, we could reduce the dimensionality of the hyperspectral datasets and avoid the issues related to the curse of dimensionalitylity.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
Persian
تدمد: 2228-6829
2423-3943
Relation: https://jame.um.ac.ir/article_35124_e4f56bf4c0a994ada7c9b65bf558314d.pdf; https://doaj.org/toc/2228-6829; https://doaj.org/toc/2423-3943
DOI: 10.22067/jam.v12i2.82089
URL الوصول: https://doaj.org/article/99c78a37620542558d7b30162cff1a6c
رقم الأكسشن: edsdoj.99c78a37620542558d7b30162cff1a6c
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:22286829
24233943
DOI:10.22067/jam.v12i2.82089