ResNet incorporating the fusion data of RGB & hyperspectral images improves classification accuracy of vegetable soybean freshness

Bu, Yuanpeng; Hu, Jinxuan; Chen, Cheng; Bai, Songhang; Chen, Zuohui; Hu, Tianyu; Zhang, Guwen; Liu, Na; Cai, Chang; Li, Yuhao; Xuan, Qi; Wang, Ye; Su, Zhongjing; Xiang, Yun; Gong, Yaming

doi:10.1038/s41598-024-51668-6

Download PDF

Article
Open access
Published: 31 January 2024

ResNet incorporating the fusion data of RGB & hyperspectral images improves classification accuracy of vegetable soybean freshness

Yuanpeng Bu¹,
Jinxuan Hu²,
Cheng Chen³,
Songhang Bai²,
Zuohui Chen²,
Tianyu Hu²,
Guwen Zhang¹,
Na Liu¹,
Chang Cai²,
Yuhao Li²,
Qi Xuan²,
Ye Wang⁴,
Zhongjing Su²,
Yun Xiang² &
…
Yaming Gong¹

Scientific Reports volume 14, Article number: 2568 (2024) Cite this article

1122 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

The freshness of vegetable soybean (VS) is an important indicator for quality evaluation. Currently, deep learning-based image recognition technology provides a fast, efficient, and low-cost method for analyzing the freshness of food. The RGB (red, green, and blue) image recognition technology is widely used in the study of food appearance evaluation. In addition, the hyperspectral image has outstanding performance in predicting the nutrient content of samples. However, there are few reports on the research of classification models based on the fusion data of these two sources of images. We collected RGB and hyperspectral images at four different storage times of VS. The ENVI software was adopted to extract the hyperspectral information, and the RGB images were reconstructed based on the downsampling technology. Then, the one-dimensional hyperspectral data was transformed into a two-dimensional space, which allows it to be overlaid and concatenated with the RGB image data in the channel direction, thereby generating fused data. Compared with four commonly used machine learning models, the deep learning model ResNet18 has higher classification accuracy and computational efficiency. Based on the above results, a novel classification model named ResNet-R &H, which is based on the residual networks (ResNet) structure and incorporates the fusion data of RGB and hyperspectral images, was proposed. The ResNet-R &H can achieve a testing accuracy of 97.6%, which demonstrates a significant enhancement of 4.0% and 7.2% compared to the distinct utilization of hyperspectral data and RGB data, respectively. Overall, this research is significant in providing a unique, efficient, and more accurate classification approach in evaluating the freshness of vegetable soybean. The method proposed in this study can provide a theoretical reference for classifying the freshness of fruits and vegetables to improve classification accuracy and reduce human error and variability.

Sugariness prediction of Syzygium samarangense using convolutional learning of hyperspectral images

Article Open access 17 February 2022

Fully connected-convolutional (FC-CNN) neural network based on hyperspectral images for rapid identification of P. ginseng growth years

Article Open access 26 March 2024

Sugarcane nitrogen nutrition estimation with digital images and machine learning methods

Article Open access 11 September 2023

Introduction

Vegetable soybean (VS) is typically harvested immaturely when the seeds have filled 90% of the seed cavity and the pod color has not yet turned yellow^1,2. The premature harvest makes VS rich in free amino acids and carbohydrates, a fresh green color, and a soft and sticky texture; all of which are important organoleptic quality properties of VS³. However, the texture and physicochemical properties of immature VS still undergo complex changes after harvest due to continuing metabolic processes. These changes lead to a rapid decrease in freshness, such as yellowing, increased spots, and decreased sweetness and flavor, which can significantly affect the nutritional value, taste, and appearance quality of VS⁴. It is reported that the sucrose contents can be drastically decreased over 60%, from 8.7%–10.4% to 3.0%–3.1%, within one day stored in ${25}^{\circ }\hbox {C}$⁵.

The freshness of VS plays a crucial role in attracting consumers⁶. Various factors, such as genotype, pod mature degree at harvest, storage time and conditions, can affect the physicochemical properties, thereby influencing the freshness of VS^2,4,7. Researchers have examined that the physicochemical characteristics, including total soluble sugar, moisture, total free amino acid, starch, protein, oil, pod green intensity, and seed hardness, are closely connected to the taste and appearance quality of VS^8,9,10,11,12. The color of the pod is the first aspect of the appearance quality that consumers pay attention to. Additionally, in sensory quality evaluation, the total soluble sugar is reported significantly positively correlated with the taste-quality score (r = 0.864, p < 0.01)¹³. The tender texture or low hardness is one of the important characteristics of VS for better taste and easier processing. VS with low moisture content have higher hardness and are scored lower in overall sensory evaluation¹⁴. The content of free amino acids significantly affected the umami taste and flavor of vegetable soybeans¹⁰. In the agricultural industry standard of the People’s Republic of China, “Vegetable Soybean Varieties Quality” (NY/T3705-2020), the aforementioned eight physicochemical parameters serve as the basis for quality assessment.

According to our knowledge, research is scarce on how to identify and evaluate the freshness of VS. Traditionally, freshness is quantified through sensor evaluation and physical and chemical index testing. However, sensory evaluation is subjective and lacks accuracy and universality. It is performed by individual experts and the results all depend on their sight, touch, taste, and smell. The measurement and evaluation of freshness-related indicators through physical and chemical experiments are the most common method. However, these processes are very time-consuming and laborious. As one of the most efficient techniques for plant nutrients analysis¹⁵, proton nuclear magnetic resonance spectroscopy has been used to determine sugars, organic acids, and amino acids changes of VS seeds². Although this method does not require complex chemical reactions, the contribution of each indicator to freshness is challenging to define accurately. Therefore, it is important and urgent to develop an objective, efficient, and comprehensive evaluation technology for VS freshness research.

Non-destructive analysis of food freshness and quality using optical sensing technology has become a current research hotspot^16,17,18. In its early phase, this technique was primarily utilized for detecting fungi which is responsible for rotting citrus fruits¹⁹, identifying mechanical damage in mangoes, categorizing agricultural produce, and assessing the ripeness of tomatoes^16,20. Thereinto, hyperspectral imaging provides both spatial and spectral information, making it ideal for monitoring the ripening process of agricultural products, such as ethylene biosynthesis, chlorophyll degradation, nutrient conversion and respiratory action²¹. Machine learning classification algorithms based on RGB images have been widely used in the quality grading of agricultural products^22,23, as well as ripeness monitoring of bell peppers²⁴ and gooseberries²⁵. The deep residual networks (ResNet) can greatly enhance the efficiency of neural networks in the image classification tasks²⁶. Convolutional Neural Network (CNN) is a kind of Feedforward Neural Network with a deep structure including convolution calculation. It is one of the representative algorithms of deep learning. At present, CNN technology has been used for fruit and vegetable classification and fruit ripened detection based on RGB images^27,28. Recently, a deep learning system for multi-category classification is proposed based on an improved YOLOv4 model²⁹. The system first identifies the type of objects in the RGB image, and then categorizes them as fresh or rotten. However, in their work, only two categories are distinguished with limited accuracy.

The RGB image recognition³⁰ technology is widely used in the study of food appearance evaluation. In addition, the hyperspectral image has outstanding performance in predicting the nutrient content of samples. However, research on classification models based on the fusion data of RGB and hyperspectral images are scarce. Therefore, in this work, we classified the freshness of VS by different stored times and determine their physicochemical properties. Then the RGB and hyperspectral images were collected in chronological order. ResNet is a widely used deep neural network architecture and is excellent in tasks such as image classification and target detection. It also has good generalization capabilities across different datasets and tasks. ResNet makes it easier to train deep networks while mitigating vanishing gradients. Hyperspectral images and RGB images cannot be directly fused because their dimensions are 1D single-channel and 2D dual-channel, respectively. In that case, the ResNet-based model is helpful in capturing their complex relationships. Moreover, the residual connection of ResNet can fully consider the correlation between multi-channel data. Therefore, ResNet model is capable of integrating these two types of data efficiently. Based on the fusion of RGB and hyperspectral image data, we develop a vegetable soybean freshness classification model named ResNet-R &H (RGB & Hyperspectral imagery).

Materials and methods

In this section, we present the sample management method, including physical and chemical characterization analysis, RGB and hyperspectral image acquisition, calibration, and processing. For vegetable soybean freshness classification, we develop ResNet-R &H model, which is based on the fusion of RGB image and hyperspectral data.

Experimental materials

The genotype “Zhenong 6”, one of China’s most representative and popular VS cultivars³¹, is used. The experimental materials were planted in the experimental field of the Zhejiang Academy of Agricultural Sciences on April 5, 2022. One thousand well-developed and disease-free three seed pods were harvested at the R6 stage, when the seeds were filled 90% of the seed cavity, and the pods and seeds were bright green in color. The pods are stored in a controlled environment greenhouse, maintaining a constant temperature of ${24}^\circ \hbox {C}$ , an ambient humidity of 60%, a CO₂ concentration of 400 ppm, a 12/12-hour light/dark photoperiod, and a photosynthetic flux density of 600 μ mol m⁻²s⁻¹. The harvested pods are divided into two groups: one consisting of 100 pods, and the other consisting of 900 pods. On the 1st, 3rd, 5th, and 7th days, the 100 pods from the first group are numbered and subjected to collect RGB and hyperspectral images. Simultaneously, 200 pods are randomly selected from the second group, and their seeds are extracted to perform quantitative measurements of hardness, soluble sugar, free amino acid, starch, moisture, protein, and oil content. The collection of plant materials complied with relevant institutional, national, and international guidelines and legislation.

PCI extraction

Physical characterization

The TA.XT Plus texture analyzer (Stable Micro Systems Ltd., UK) was adopted to test the seed hardness of VS according to the method described by our previous report¹⁴. Put it briefly, a cylinder stainless probe with a diameter of ${2} \hbox {mm}$ was equipped for puncture testing. The puncture test speed is ${1} \hbox {mm s}^{-1}$ and the test time (t) is 2 s. The averaged mechanical work (calculated as $W=\int _0^tf(t)dt$) from six parallel tests was used as the seed hardness index.

Chemical compositions

To measure the moisture content, twenty green seeds were heated in a constant temperature oven at 75$^\circ \hbox {C}$ until the weight stops dropping. The moisture content of the fresh VS seeds was then determined gravimetrically³². Freeze-dried VS seeds were ground into powder for determination of amino acid content, soluble sugar, protein, crude protein, and oil. The analysis of free amino acids was performed using a Hitachi 8900 amino acid analyzer (Hitachi High-Technologies, Tokyo, Japan) referring to the literature³. The soluble sugar content was determined by anthrone colorimetry using glucose as the standard ³³. The Kjeltec TM2300 autosampler system (Foss Analytical, Hillerd, Denmark) was adopted to measure protein content. The crude protein was estimated using a conversion factor of 6.25¹¹. Oil content is estimated using a Soxtec 2050 Soxhlet extraction system (Foss Analytical, Hillerd, Denmark)³⁴. Each sample was analyzed three times to ensure accuracy.

Image data processing

Image acquisition

RGB images of VS pods were captured using a Canon EOS 200D II camera in a RGB image acquisition system. The camera parameters were as follows: lens f=18–55mm, focal length 0.25 m, shutter speed 1/4000 to 1 sec, aperture f/4−5.6, lens mounting height 46mm. The digital images were converted into RGB (red–green–blue) and input to the CIELAB system using the Conversion Munsell (program version 4.01), thus deriving the parameters of the Lab color model. To assess the green intensity of VS, the hue parameter [H = arctang (b/a)] was calculated ^35,36,37.

A hyperspectral image acquisition system³⁸ was adopted to collect hyperspectral images of VS. The system consisted of a Pika XC hyperspectral camera, an imaging spectrograph, a high-performance Schneider Xenoplan 1.4/17 lens unit, four 15-W 12-V tungsten halogen lamps, and a Spectral Image data acquisition software. In this study, reflectance data was obtained within a spectral range spanning 386 to 1004 nm. The spectral resolution employed for this data collection was set at 1.3 nm. To initiate the spectral analysis process, the spectrometer undergoes a preheating phase aimed at achieving a consistent light irradiation. Subsequent to this step, a black and white correction procedure was executed. The initial stage of this correction process involves utilizing a whiteboard as the background. Following the acquisition of the background image of the whiteboard, the whiteboard was removed and then replaced by the samples, thereby allowing for the capture of hyperspectral images.

RGB image processing

To reduce the noise of varying ambient light on RGB images, a one-time color difference correction using a 24-color Macbeth card were performed .Then we extract the positions corresponding to the hyperspectral region of interest (ROI). Those positions in the RGB image are cropped and downsampled to ensure consistency.

Hyperspectral image processing

The ROI was manually selected on the pods with the assistance of ENVI5.3 (ITT, Visual Information Solutions, America)^38,39,40, as shown in Fig. 1. Then the average spectrum was calculated. Multiple scattering correction (MSC) was adopted to process the hyperspectral data. It can effectively eliminate the spectral differences⁴¹. MSC is a commonly used algorithm for hyperspectral data preprocessing. It can effectively eliminate spectral differences due to different scattering levels, and enhance the correlation between spectra and data. MSC corrects the baseline translation and offsets phenomena of spectral data. The detailed steps are as follows.

1.
It first derives the average value of all spectral data as the “ideal spectrum”.
2.
It performs a linear regression between the spectra of each sample and the ideal spectrum, and solve the least-squares problem to obtain the baseline shift and offset of each sample.
3.
It calibrates the spectra of each sample by subtracting the baseline shift and dividing by the offset to derive the corrected spectra.

Data fusion model

Combining RGB images with hyperspectral ones in the same model can significantly improve the estimation performance. The RGB images were two-dimensional while the processed hyperspectral data were one-dimensional. The size of the hyperspectral data was 1*462. In accordance with the deep learning model’s prerequisite, it was imperative that the dimensions of both the hyperspectral data and the image data align with each other. Therefore, the hyperspectral data was reconstruct to two dimensions (22$\times$21). For the RGB image data, the entire areas of the soybean pods were extracted and aligned with the hyperspectral data, i.e., the shape of the image data after downsampling was 22$\times$21$\times$3. Then we merged the single-channel hyperspectral data with the three-channel RGB image. The processed hyperspectral data was concatenate with the RGB image data in the color channel. Therefore, our model had 4 input channels.

The fused data were fed into ResNet-R &H. The fused data was four channels and its size was 22*21*4. The mathematical process for the ResNet-R &H model is as follows.

$$W_{{in}} *H_{{in}} *D_{{in}} = W_{r} *H_{r} *D_{r} + W_{h} *H_{h} *D_{h} ,$$

(1)

where $W_{in}$,$H_{in}$,$D_{in}$ are the width, height and number of channels of the fused data, respectively. $W_r$ is the width; $H_r$ is the height; and $D_r$ is the number of channels of the RGB image. $W_h$ is the width; $H_h$ is the height; and $D_h$ is the number of channels after reshape of the hyperspectral data.

The core idea of ResNet is the introduction of residual block (residual block), whose mathematical expression is as follows.

$$\begin{aligned} x_{l+1} = x_{l} + F(x_{l},w_{l}), \end{aligned}$$

(2)

where $x_{l}$ is the input of a layer, $x_{l+1}$ is the input of the next layer, and the function F(x, $w_{i}$) is the residual mapping to be learned, which consists of convolution, normalization, and relu activation functions. For a deeper layer L, its relation to layer l can be expressed as follows:

$$\begin{aligned} x_{L} = x_{l} + \sum _{i=l}^{L-1}F(x_{i},W_{i}). \end{aligned}$$

(3)

The L layer can be represented by the combination of any l-layer network shallower than it and the sum of the residual parts between them. the inputs of the l+1 layer are the outputs of the l layer as follows:

$$\begin{aligned} x_{l+1} = f(y_{l}), \end{aligned}$$

(4)

where f(x) is the activation function such as ReLU.The residual block is divided into two parts, which are the direct mapping part and the residual mapping part. The generic representation of the residual network is as follows:

$$\begin{aligned} y_{l} = h(x_{l}) + F(x_{l},w_{l}), \end{aligned}$$

(5)

where h(x) is the direct mapping part, which represents the direct mapping to the input (e.g., a constant transformation, i.e.,h($x_l$)=$x_l$); and f(x,w) is the residual part, which is the result of the input processed through the direct mapping part and the residual mapping part. The final output of the model in our experiments contains four classes.

The ResNet-R &H contains a conv1 stage, Layer1, Layer2, Layer3, Layer4 modules, a pooling layer, and a linear layer.The conv1 stage comprises of a convolutional layer, along with a batch normalization element. This layer operates with 4 input channels and generates 64 output channels. It employs a kernel size of 3, a stride of 1 for convolution, and is accompanied by a padding of 1. The batch normalization layer is used to accelerate the network’s convergence. Layer1 contains two normal residual modules, Layer2 to Layer4 consists of a downsampled convolutional module and a normal residual module.

The convolutional module contains three Conv2d convolutional layers and has two paths. The first path goes through the first convolutional layer, the Relu activation function, and the second convolutional layer sequentially. The second path contains a shortcut convolutional layer. The two paths are summed up and the result goes through the activation function Relu. To ensure shape consistency, the Conv2d convolutional kernel on the shortcut is set to 1. The residual module does not have the convolutional layer at the shortcut and directly adds up the original input. There are 8 blocks of convolutional modules. Then the model has one average pooling layer to deduce the parameters. It can improve the model’s accuracy and stability while reducing overfitting. The final output has four classes, representing four freshness categories.

Results

Physicochemical characteristics of VS

The freshness of VS decreases as the storage time increases. From an external perspective (Fig. 2a), the pod exhibits a vibrant green color and the surface is smooth and spotless when stored for one day. By the third day, the green intensity and vividness of the pod color began to decline slightly, and a yellowish hue starts to emerge from the base of the pod near the stalk. By the fifth day, this yellowing spreads throughout the entire pod, with brown spots appearing at its stem end. Upon reaching the seveth day, not only does the yellow color intensify but also do these brown spots enlarge and spread across the entire pod, rendering it unsuitable for sale and devoid of market value. The appearance characteristics of the samples showed regular changes with the decrease of freshness. The freshness is negatively correlated with the yellowing color and rust spot areas of the pods. In reality, experienced experts or consumers also measure freshness by observing these changes with their eyes. To give a clearer view of the details of the vegetable soybeans after they decay, we show the pictures of the vegetable soybeans from Day1 and Day7 in Supplement Figure S1.

Besides appearance, physicochemical characteristics of VS are also of great importance to freshness evaluation. Therefore, eight physicochemical traits (total soluble sugar, moisture, total free amino acid, starch, protein, oil, green intensity, hardness) that have been reported to be related to freshness are determined on the first, third, fifth, and seventh days after harvest (Fig. 2b and Supplement Table S1). With the extension of storage time, except for oil and hardness, the other six traits of VS show a downward trend. From the first day to the third day, the soluble sugar content dropped sharply from 12.64 to 7.22%, with a decrease rate of 42.88%. After the third day, the soluble sugar content slowly decreased. On the seventh day, the soluble sugar content is 6.30%, with a decrease rate of 12.68% compared to the third day. From the first day to the seventh day, the total soluble sugar decreased by 50.13%. The changes of total free amino acid content and green intensity are similar to those of soluble sugar content, which decrease rapidly from the first day to the third day and then decreased slowly. From the first day to the seventh day, the total free amino acid content decreased from 0.18 to 0.10%, with a decrease rate of 47.24%, which is slightly smaller than the decrease rate of total soluble sugar (50.13%). From the first day to the third day, the green intensity decreases from 1.15 to 1.00, and on the fifth and seventh days, it decreases to 0.97 and 0.94, respectively. High moisture content is one of the most obvious characteristics of VS as well as other vegetables or fruits. Water is the most abundant component of VS and make up more than 60% of the seed’s wet weight. The moisture content decreases from 67.16% on the first day to 60.70% on the seventh day, with a slow decrease from the first day to the third day and a rapid decrease from the third day to the seventh day. The starch content shows an approximately linear decrease process, from 21.43% on the first day to 15.04% on the seventh day, with a decline rate of 29.80%. The process of protein content change is quite unique. From the first day to the third day, the content slightly increases and then continues to decrease. However, the overall change is not obvious. Among the two increased characters, the hardness increased greatly, from 3.88 ×10⁻⁵ J to 6.39 ×10⁻⁵ J , with an increase rate of 64.74%, and the process of change is approximately linear. Oil content increase from 15.88 to 17.60%. In this process, the change from the third day to the fifth day is relatively obvious, while the other time periods have small changes.

Evaluation metrics

Different measures, including accuracy, precision, recall, and others, were employed for evaluating the efficacy of the model. The confusion matrix was also commonly employed. The main evaluation metrics are as follows:

True positive (TP): In cases wherein the actual value of the sample is positive, and the projected outcome of the model corresponds positively as well.
False positive (FP): In cases wherein the actual value of the sample is negative, yet the projected outcome of the model is positive.
True negative (TN): In cases wherein the actual value of the sample is negative, and the projected outcome of the model corresponds negatively as well.
False negative (FN): In cases wherein the actual value of the sample is positive, yet the projected outcome of the model is negative.

Accordingly, we employ the following metrics.

Accuracy: It is the percentage of positive and negative cases correctly predicted.
$$\begin{aligned} Accuracy = \frac{(TP + TN)}{TP + TN + FP + FN} \end{aligned}$$
(6)
Recall: it is the percentage of actual positive cases that are correctly predicted to be positive.
$$\begin{aligned} Recall = \frac{TP}{(TP + FN)} \end{aligned}$$
(7)
Precision: it is the percentage of positive cases predicted by the model as positive.
$$\begin{aligned} Precision = \frac{TP}{TP + FP} \end{aligned}$$
(8)
F1-Score: it is the summed average of the precision and recall scores.
$$\begin{aligned} F1 = 2 * \frac{Precision * Recall}{Precision + Recall} \end{aligned}$$
(9)

The results were presented by a confusion matrix. The X axis was the prediction of the model and the Y axis was the number of true labels of the data.

In this work, the technique proposed in this study was compared with four widely used machine learning models as follows.

Decision tree In machine learning, the decision tree⁴² finds fundamental application for both classification and regression purposes. It has proven to be highly effective in solving classification problems and is widely used in practice. The decision tree operates by selecting the most relevant features in the data, dividing the training samples based on these features, and then recursively repeating this process. The selection of features in the decision tree is based on two criteria: information entropy and information gain. The features with higher information gain were selected as the basis for dividing the data.
Random forest

A random forest is a collection of decision trees that combines numerous decisions into a single outcome⁴³. This algorithm runs by reconstructing multiple decision trees during the training phase, and it blongs to ensemble learning. Ensemble learning was employed to predict a single outcome by erecting a composite of numerous models. It functions by generating various classifiers, each of which learns and produces predictions autonomously, and then consolidating the ultimate prediction.
Adaboost

Boosting, also known as augmented learning, is a crucial technique for integrated learning. It can transform weak learners, which have prediction accuracy only slightly better than random guesses, into strong learners with high prediction accuracy. The AdaBoost algorithm is equivalent to a forward staged additive modeling algorithm that minimizes the loss of new indices used for multi-class classification⁴⁴.
KNN

The K-Nearest Neighbor (KNN) classification approach quantifies the gap separating unidentified samples from their established counterparts by referencing known samples spanning various categories⁴⁵. Within this process, the algorithm singles out K known samples, those closest in proximity to the unidentified specimen. Subsequently, adhering to the guideline of minority voting, the categorization of the unidentified samples aligns with the class of the K nearest instances spanning the most extensive array of categories.

Evaluation results

The current study has partitioned the processed VS samples and their associated images into two distinct sets, the training set was 70% and the test set was 30%. Notably, the selection of samples for each set was executed randomly. Moreover, the distribution of each category remains consistent in both sets, satisfying the principle of proportionality. The algorithms were run on a system equipped with an Intel Xeon Gold 5218 CPU, NVIDIA Tesla V100 GPU, and Ubuntu 18.04 operating system. The classification model was trained using Python (version 3.6.8) and PyTorch (version 1.7.1).

The method was evaluated through three datasets: the hyperspectral dataset, the RGB image dataset, and the fused dataset. Each dataset consists of four classes of VS with varying freshness levels. The datasets contain 462 bands of hyperspectral data, which was reprocessed into a single channel of $22\times 21$ data. The colored images were also resized to $22\times 21$ with a total number of 416.

Firstly, a waveband analysis on VS of varying freshness levels was performed. The hyperspectral values of each VS were then averaged and normalized. Fig. 3a illustrates the normalized spectral values corresponding to different bands of VS at four freshness levels. Fig. 3b shows the first-order derivatives corresponding to different bands of VS at the same freshness levels. As shown in Fig. 3a, there is one peak at 562 nm and one trough at 688 nm. The results of the first-order derivative analysis in Fig. 3b demonstrate that the spectral reflection separation between the types and levels of VS freshness mainly occurs within the 494 nm to 681 nm and 695 nm to 764 nm range⁴⁶. The spectral values within this range vary significantly, indicating that the spectrum in this band is strongly correlated to VS freshness.To better indicate the location of the peaks and valleys of the wave band in Fig. 3a and the reflectance separation band region in Fig. 3b, the 500 nm–710 nm region of Fig. 3a was zoomed (Supplement Figure S2). The 420nm-700nm region of Fig. 3b (Supplement Figure S3), and the 650nm-770nm region (Supplement Figure S4) are also amplified.

Then, VS samples were classified using machine learning techniques and our deep learning method based on RGB image data and hyperspectral data separately. Table 1 presents the accuracy, average precision, average recall, and average F1 score metrics for four machine learning models and deep learning models trained with hyperspectral data, image, and fused data, respectively. The results indicate that the highest testing accuracy achieved by machine learning models is 87.2%, while the deep learning method achieves the highest accuracy of 97.6% among all the types of data. And the values of precision, recall, and F1-score are all higher than those of traditional machine learning models. This demonstrates that the model developed in this study significantly enhances the classification performance. Due to the small number of samples and high accuracy, we use Wilson’s method to derive the confidence intervals. We calculate the confidence intervals for the 125 test samples with 97.6% classification accuracy at 95% confidence level and 99% confidence level, respectively. The results are 93.18%–99.18% and 93.15%–99.50%. This demonstrates the reliability of our method and show enough statistical significance.

Table 1 Evaluation metrics of the classification effects of five classification models on VS freshness based on RGB image data, hyperspectral data, and fusion data.

Full size table

Table 2 shows the processing time of 125 samples from the test set using different methods. When the data sources are RGB images and hyperspectral images, the time required by the proposed ResNet-R &H method in this study is 13.80 ms and 10.18 ms, respectively, which are higher than the remaining four machine learning models. When the data source is fused data, its required time is 15.23ms, which is only 1.43–5.05 ms higher than that based on the single source data. And the ResNet-R &H requires more inference time than that of DecisionTree/KNN/RandomForest based on the fusion data, the time gap required to process 125 samples is only 4.38–11.29 ms, which is generally acceptable in real-world applications. What’s more, the computation time of the ResNet-R &H model is even much lower than that of AdaBoost.

Table 2 Time required for inference test set of 125 samples under different methods.

Full size table

Figure 4 shows the confusion matrix of VS freshness classification under four machine learning methods and one deep learning method. Comparing Fig. 4a–d and e–h, it is clear that these models trained solely on RGB or hyperspectral data have considerably lower classification. This suggests that relying upon a single data source suffers from incomplete representation of certain essential characteristics. Therefore, using fused data has led to a marked improvement in classification accuracy, precision, recall, and F1 score.

Band ablation analysis

Table 3 Evaluation metrics of the classification effects of five classification models on VS freshness based on hyperspectral data and fusion data.

Full size table

The characteristic bands most closely related to freshness were identified. Instead of utilizing the full-band data, feature band selection on the hyperspectral bands was performed^47,48,49.In Fig. 5, the correlation analysis was employed to investigate the relationship between various wavelengths of data and their corresponding freshness categories. The correlation between bands and freshness was analyzed using the distance correlation coefficient^50,51. It was calculated by dividing the distance covariance of two random variables with the product of their distance standard deviations. The detailed equation is shown as follows.

$$\begin{aligned} dCor(X,Y) = \frac{dCov(X,Y)}{\sqrt{dVar(X)dVar(Y)}}. \end{aligned}$$

(10)

A stronger correlation was indicated by a correlation coefficient closer to 1. The experimental results show a series of highly correlated wavelengths around 670 nm and 980 nm. The distribution of selected wavelengths closely resembles the outcomes of the first-order derivative analysis, indicating that peaks and valleys at wavelengths correspond to higher correlations.

We selected 32 bands (673 nm–695 nm and 979 nm–1003 nm) with correlation coefficients greater than 0.7 to train models. All the other bands were set to zero. The results are shown in Table 3. It is observed that the ResNet-R &H model achieves the highest accuracy in general. The test accuracy obtained by the AdaBoost method is nearly equal to the full-band accuracy (462). The RandomForest method is about 5.6% lower than the fullband model. KNN has a 20% drop in accuracy compared to the full band.

According to results of the experiment, the RandForest and AdaBoost methods have achieved 92.0% and 92.8% accuracy, respectively. Which are higher than those trained on the full-band fused data, with values of 84.8% and 87.2% (Table 1 and Table 3. The feature selection can select the feature bands that are most closely related to the freshness, thus reducing the hardware requirement. Moreover, the average time required to train the network reduced from 3.634 to 3.465 s per epoch.

Discussion

Through visual and physicochemical indicators (Fig. 2), differences can be observed between samples at different freshness levels. The classification of vegetable freshness based on sensory evaluation and physical and chemical characteristics detection is a time-consuming and laborious task, and the results are difficult to reproduce. Therefore, there is a growing interest in non-contact technologies utilizing image and spectral analysis to achieve rapid and automated classification of vegetable freshness. At present, hyperspectral technology has been more and more widely used in the study of vegetable and fruit freshness. The research results of Polder et al.⁵²show that hyperspectral images have more advantages than ordinary RGB images in identifying the maturity of tomatoes, and the recognition and classification error of single pixel decreases from 51% to 19%. However, the efficiency of freshness classification is not high enough due to ignoring the external characteristics of the sample⁵³. The accuracy of classification results may be compromised by relying solely on a single source of data. And we experimentally prove this corollary in our study(Table 1). It is imperative to consider the fusion of data from multiple sources to improve classification accuracy. For example, there exist blurring distinctions between Day 1 and Day 3 observed from photos (Fig. 2a), while significant changes occur in the levels of chemicals like soluble sugars and free amino acids (Fig. 2b), which can be rapidly and sensitively detected with hyperspectral techniques (Fig. 3). In contrast, the contents of total soluble sugar, protein, and oil showed no difference between Day 5 and Day 7 (Fig. 2b), but the samples can be preliminarily distinguished visually (Fig. 2a). And this subjective visual distinction of the sample’s appearance changes can be more accurately and efficiently performed through RGB images. Therefore, the above results further illustrate that fusion data from both RGB and hyperspectral sources can effectively improve the accuracy and precision of freshness classification when compared to relying on a single data source (Table 1). Actually, this inference is confirmed in the subsequent experimental results of this paper. From the confusion matrix in Fig. 4m-o and Supplement Figure S5, it is demonstrated that the deep learning classification method based on RGB images misclassified 4 samples belonging to Day1 as Day3 (Fig. 4m). When the data source changes to hyperspectral (Fig. 4n) or fused data (Fig. 4o), the number of misclassifications is reduced to 0. On the contrary, there is a significant difference in the appearance of the samples on Day 3 and Day 7, which can be correctly distinguished through RGB images (Fig. 4m). However, there is still one sample misclassified based on hyperspectral images (Fig. 4n). Above all, based on the fusion data can effectively reduce the occurrence of the above mentioned misclassification situation (Fig. 4o), which highlighting the strength of the proposed model.

Hyperspectral imaging techniques enable the simultaneous acquisition of images at different wavelengths. In this work, the hyperspectral data have 462 wave bands. Due to the small sampling interval of imaging spectrometers, adjacent bands have high correlations and suffer from information redundancy. An RGB image is an array of color pixels of size M×N×3 (M×N pixel points, 3 channels), and each color pixel point is a composition of red, green, and blue components. The RGB camera decomposes the spectrum into three broad bands to capture images with superior spatial resolution but limited spectral resolution. Hyperspectral imaging techniques can simultaneously acquire images of different wavelengths in the same scene. However, the existing hyperspectral imaging equipment has low spatial resolution^54,55,56. The fusion of high spatial-resolution RGB images with low-resolution hyperspectral images can improve classification accuracy⁵⁷.

This work proposes a deep learning-based method to classify the freshness of VS based on RGB and hyperspectral image data. It combines hyperspectral data and image data to effectively distinguish the VS freshness. As shown in Table 1, the classification accuracy of VS freshness is significantly improved.

In the data preprocessing stage, the input image is downsampled. The downsampling processed image reduces the pixel points of the RGB image, thus reducing the model input complexity. The reduction of input data can shorten the training and inference time. Overall, the classification accuracy of the data fusion-based approach can be significantly improved with a quite slightly increased of inference time.

A feature selection model is proposed to select the most relevant bands. The correlation coefficient calculation is used, which requires limited resources. By reducing the input bands number from 462 to 32, performance for most methods decreases, except for RandomForest and AdaBoost. Their accuracies remain at 92.0% and 92.8% with fused input, respectively. For incomplete data sets, Random Forest can handle missing values and incomplete features. When building decision trees, Random Forest uses a random subset of features for training, so that even if some features are missing, other features can be used for prediction. The AdaBoost algorithm can handle incomplete data sets because it is an iterative algorithm that gradually adjusts the model to fit the missing data. In each iteration, AdaBoost adjusts the sample weights to focus more attention on the misclassified samples, thus improving the adaptation to the missing data. Such experimental results show that the feature selection method reduces the training time with less impact on the classification accuracy. In addition, our method offers the possibility to propose training algorithms with reduced training samples.

The method proposed in this study can provide a theoretical reference for classifying the freshness of other kinds of fruits and vegetables to improve classification accuracy and reduce human error and variability. The appearance and nutrient content of different vegetables or fruits are very different. With the decrease of maturity, the change trend of physical and chemical characteristics is also very different from that of vegetable soybeans. Therefore, our proposed model may only be applicable to the freshness classification of vegetable soybeans. In the later stage, the type and number of experimental samples can be expanded to improve the discrimination accuracy and generalization of the model.

Conclusions

In this study, we propose a novel classification model called ResNet-R &H, which is based on the residual networks (ResNet) structure and incorporates the fusion data of RGB and hyperspectral images. ResNet-R &H is significant in providing a unique, efficient, and more accurate classification approach in evaluating the freshness of vegetable soybean.

Data availability

Our dataset and code will be publicly available at https://github.com/HZSUZJ/DLDF.

References

Fehr, W., Caviness, C., Burmood, D. & Pennington, J. Stage of development descriptions for soybeans, glycine max (L.) merrill 1. Crop Sci. 11, 929–931 (1971).
Article Google Scholar
Song, J., Wu, G., Li, T., Liu, C. & Li, D. Changes in the sugars, amino acids and organic acids of postharvest spermine-treated immature vegetable soybean (glycine max l. merr.) as determined by 1h nmr spectroscopy. Food Prod. Process. Nutrit. 2, 1–10 (2020).
Google Scholar
Flores, D., Giovanni, M., Kirk, L. & Liles, G. Capturing and explaining sensory differences among organically grown vegetable-soybean varieties grown in northern california. J. Food Sci. 84, 613–622 (2019).
Article CAS PubMed Google Scholar
Sugimoto, M. et al. Metabolomic profiles and sensory attributes of edamame under various storage duration and temperature conditions. J. Agric. Food Chem. 58, 8418–8425 (2010).
Article CAS PubMed Google Scholar
Ko, J. et al. Changing patterns of sugars and tocopherols at before and after harvest of vegetable soybean. Korea Soybean Digest 12, 54656 (2011).
Google Scholar
Masuda, R. Quality requirement and improvement of vegetable soybean. Veget. Soybean Res. Needs Prod. Quality Improv. 12, 92–102 (1991).
Google Scholar
Makino, Y. et al. Influence of low o2 and high co2 environment on changes in metabolite concentrations in harvested vegetable soybeans. Food Chem. 317, 126380 (2020).
Article CAS PubMed Google Scholar
Yu, D. et al. Physical and chemical properties of edamame during bean development and application of spectroscopy-based machine learning methods to predict optimal harvest time. Food Chem. 368, 130799 (2022).
Article CAS PubMed Google Scholar
Krinsky, B. et al. The development of a lexicon for frozen vegetable soybeans (edamame). J. Sens. Stud. 21, 644–653 (2006).
Article Google Scholar
Song, J., Liu, C., Li, D. & Gu, Z. Evaluation of sugar, free amino acid, and organic acid compositions of different varieties of vegetable soybean (glycine max [l.] merr.). Indust. Crops Prod. 50, 743–749 (2013).
Article CAS Google Scholar
Li, Y.-S. et al. Greater differences exist in seedprotein, oil, total soluble sugar and sucrosecontentof vegetable soybean genotypes [’glycine max’(l.) merrill]. in northeast china. Aust. J. Crop Sci. 6, 1681–1686 (2012).
CAS Google Scholar
Czaikoski, K. et al. Canning of vegetable-type soybean in acidified brine: Effect of the addition of sucrose and pasteurisation time on color and other characteristics. Indust. Crops Products 45, 472–476 (2013).
Article CAS Google Scholar
Zhang, W. et al. Genetic and regulatory mechanisms of sucrose in soybean seeds for vegetable use: A research progress. Acta Agric. Zhejiangensis 33, 8966 (2021).
Google Scholar
Bu, Y. et al. Conditional and unconditional qtl analyses of seed hardness in vegetable soybean (glycine max l. merr.). Euphytica 214, 1–21 (2018).
Article CAS ADS Google Scholar
Simmler, C., Napolitano, J. G., McAlpine, J. B., Chen, S.-N. & Pauli, G. F. Universal quantitative nmr analysis of complex natural samples. Curr. Opin. Biotechnol. 25, 51–59 (2014).
Article CAS PubMed Google Scholar
Polder, G., van der Heijden, G. W. & Young, I. T. Spectral image analysis for measuring ripeness of tomatoes. Trans. ASAE 45, 1155 (2002).
Article Google Scholar
Kim, M. S., Chen, Y. & Mehl, P. Hyperspectral reflectance and fluorescence imaging system for food quality and safety. Trans. ASAE 44, 721 (2001).
Google Scholar
Peirs, A., Lammertyn, J., Ooms, K. & Nicolaı, B. M. Prediction of the optimal picking date of different apple cultivars by means of vis/nir-spectroscopy. Postharvest. Biol. Technol. 21, 189–199 (2001).
Article Google Scholar
Gómez-Sanchis, J. et al. Detecting rottenness caused by penicillium genus fungi in citrus fruits using machine learning techniques. Expert Syst. Appl. 39, 780–785 (2012).
Article Google Scholar
El-Bendary, N., El Hariri, E., Hassanien, A. E. & Badr, A. Using machine learning techniques for evaluating tomato ripeness. Expert Syst. Appl. 42, 1892–1905 (2015).
Article Google Scholar
Prasanna, V., Prabha, T. & Tharanathan, R. Fruit ripening phenomena—An overview. Crit. Rev. Food Sci. Nutrit. 47, 1–19 (2007).
Article CAS Google Scholar
Pandey, R., Naik, S. & Marfatia, R. Image processing and machine learning for automated fruit grading system: A technical review. Int. J. Comput. Appl. 81, 29–39 (2013).
Google Scholar
Cubero, S., Aleixos, N., Moltó, E., Gómez-Sanchis, J. & Blasco, J. Advances in machine vision applications for automatic inspection and quality evaluation of fruits and vegetables. Food Bioprocess Technol. 4, 487–504 (2011).
Article Google Scholar
Elhariri, E., El-Bendary, N., Hussein, A. M., Hassanien, A. E. & Badr, A. Bell pepper ripeness classification based on support vector machine. In 2014 International Conference on Engineering and Technology (ICET), 1–6 (IEEE, 2014).
Castro, W. et al. Classification of cape gooseberry fruit according to its level of ripeness using machine learning techniques and different color spaces. IEEE Access 7, 27389–27400 (2019).
Article Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 770–778 (2016).
Femling, F., Olsson, A. & Alonso-Fernandez, F. Fruit and vegetable identification using machine learning for retail applications. In 2018 14th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS), 9–15 (IEEE, 2018).
Vaviya, H., Yadav, A., Vishwakarma, V. & Shah, N. Identification of artificially ripened fruits using machine learning. In 2nd International Conference on Advances in Science & Technology (ICAST) (2019).
Mukhiddinov, M., Muminov, A. & Cho, J. Improved classification approach for fruits and vegetables freshness based on deep learning. Sensors 22, 8192 (2022).
Article PubMed PubMed Central ADS Google Scholar
Zin, T. T., Tin, P., Toriu, T. & Hama, H. Background modeling using special type of markov chain. IEICE Electr. Express 8, 1082–1088 (2011).
Article Google Scholar
Liu, N. et al. Genome sequencing and population resequencing provide insights into the genetic basis of domestication and diversity of vegetable soybean. Hortic. Res. 9, 899 (2022).
Article MathSciNet Google Scholar
Kumar, V. et al. Evaluation of vegetable-type soybean for sucrose, taste-related amino acids, and isoflavones contents. Int. J. Food Prop. 14, 1142–1151 (2011).
Article CAS Google Scholar
Buysse, J. & Merckx, R. An improved colorimetric method to quantify sugar content of plant tissue. J. Exp. Bot. 44, 1627–1629 (1993).
Article CAS Google Scholar
Huang, L. et al. Impact of tempeh flour on the rheology of wheat flour dough and bread staling. LWT 111, 694–702 (2019).
Article CAS Google Scholar
Song, J.-Y., An, G.-H. & Kim, C.-J. Color, texture, nutrient contents, and sensory values of vegetable soybeans [glycine max (l.) merrill] as affected by blanching. Food Chem. 83, 69–74 (2003).
Article CAS Google Scholar
Lawless, H. T., Heymann, H. et al. Sensory evaluation of food: Principles and practices, vol. 2 (Springer, 2010).
Xu, Y. et al. Physicochemical, functional and microstructural characteristics of vegetable soybean (glycine max) as affected by variety and cooking process. J. Food Meas. Charact. 9, 471–478 (2015).
Article CAS Google Scholar
Xiang, Y. et al. Deep learning and hyperspectral images based tomato soluble solids content and firmness estimation. Front. Plant Sci. 13, 860656 (2022).
Article PubMed PubMed Central Google Scholar
Su, Z. et al. Application of hyperspectral imaging for maturity and soluble solids content determination of strawberry with deep learning approaches. Front. Plant Sci. 12, 736334 (2021).
Article PubMed PubMed Central Google Scholar
Fu, D., Zhou, J., Scaboo, A. M. & Niu, X. Nondestructive phenotyping fatty acid trait of single soybean seeds using reflective hyperspectral imagery. J. Food Process Eng. 44, e13759 (2021).
Article CAS Google Scholar
Xiong, Z., Sun, D.-W., Pu, H., Zhu, Z. & Luo, M. Combination of spectra and texture data of hyperspectral imaging for differentiating between free-range and broiler chicken meats. LWT Food Sci. Technol. 60, 649–655 (2015).
Article CAS Google Scholar
Myles, A. J., Feudale, R. N., Liu, Y., Woody, N. A. & Brown, S. D. An introduction to decision tree modeling. J. Chemom. J. Chemom. Soc. 18, 275–285 (2004).
CAS Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Hastie, T., Rosset, S., Zhu, J. & Zou, H. Multi-class adaboost. Stat. Interface 2, 349–360 (2009).
Article MathSciNet Google Scholar
Kramer, O. & Kramer, O. K-nearest neighbors. Dimensionality reduction with unsupervised nearest neighbors 13–23 (2013).
Chu, H. et al. Hyperspectral imaging with shallow convolutional neural networks (scnn) predicts the early herbicide stress in wheat cultivars. J. Hazard. Mater. 421, 126706 (2022).
Article CAS PubMed Google Scholar
Zhou, L. et al. Wheat kernel variety identification based on a large near-infrared spectral dataset and a novel deep learning-based feature selection method. Front. Plant Sci. 11, 575810 (2020).
Article PubMed PubMed Central Google Scholar
Di, W., Zhang, L., Zhang, D. & Pan, Q. Studies on hyperspectral face recognition in visible spectrum with feature band selection. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 40, 1354–1361 (2010).
Article Google Scholar
Guo, Z., Zhang, D., Zhang, L. & Liu, W. Feature band selection for online multispectral palmprint recognition. IEEE Trans. Inf. Foren. Secur. 7, 1094–1099 (2012).
Article Google Scholar
Székely, G. J. & Rizzo, M. L. Brownian distance covariance. Ann. Appl. Stat. 63, 1236–1265 (2009).
MathSciNet Google Scholar
Edelmann, D., Móri, T. F. & Székely, G. J. On relationships between the pearson and the distance correlation coefficients. Stat. Prob. Lett. 169, 108960 (2021).
Article MathSciNet Google Scholar
Polder, G. & van der Heijden, G. Measuring ripening of tomatoes using imaging spectrometry. In Hyperspectral imaging for food quality analysis and control, 369–402 (Elsevier, 2010).
Liu, C. et al. Establishment of non-destructive discrimination model for egg freshness based on fusion technology of hyperspectral image and spectral features. J. Food Sci. Technol. 40, 172–182 (2022).
Google Scholar
Villa, A., Chanussot, J., Benediktsson, J. A., Jutten, C. & Dambreville, R. Unsupervised methods for the classification of hyperspectral images with low spatial resolution. Patt. Recogn. 46, 1556–1568 (2013).
Article ADS Google Scholar
Villa, A., Chanussot, J., Benediktsson, J. A. & Jutten, C. Spectral unmixing for the classification of hyperspectral images at a finer spatial resolution. IEEE J. Sel. Top. Signal Process. 5, 521–533 (2010).
Article ADS Google Scholar
Zhang, H., Zhang, L. & Shen, H. A super-resolution reconstruction algorithm for hyperspectral images. Signal Process. 92, 2082–2096 (2012).
Article Google Scholar
Zhou, J., Kwan, C. & Budavari, B. Hyperspectral image super-resolution: A hybrid color mapping approach. J. Appl. Rem. Sens. 10, 035024–035024 (2016).
Article ADS Google Scholar

Download references

Acknowledgements

This work is supported by the Key research and development project of Zhejiang Province (2021C02052-2 and 2022C02016), Accurate Identification and Evaluation of Crop Germplasm Resources (Soybean) in 2023 (2023R23T60D04), Zhejiang Basic Public Welfare Research Project (LTGN23C150004, LGN21C150008, LGN21C150007), and the Zhejiang Provincial Important Science & Technology Specific Projects of Vegetable Breeding (2021C02065).

Author information

Authors and Affiliations

Institute of Vegetables, Key Laboratory of Vegetable Legumes Germplasm Enhancement and Southern China of the Ministry of Agriculture and Rural Affairs, Zhejiang Academy of Agricultural Sciences, Hangzhou, China
Yuanpeng Bu, Guwen Zhang, Na Liu & Yaming Gong
Institute of Cyberspace Security, Zhejiang University of Technology, Hangzhou, China
Jinxuan Hu, Songhang Bai, Zuohui Chen, Tianyu Hu, Chang Cai, Yuhao Li, Qi Xuan, Zhongjing Su & Yun Xiang
Zhejiang Yuncheng Information technology Co Ltd., Hangzhou, China
Cheng Chen
Faculty of Engineering, Lishui University, Lishui, China
Ye Wang

Authors

Yuanpeng Bu
View author publications
You can also search for this author in PubMed Google Scholar
Jinxuan Hu
View author publications
You can also search for this author in PubMed Google Scholar
Cheng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Songhang Bai
View author publications
You can also search for this author in PubMed Google Scholar
Zuohui Chen
View author publications
You can also search for this author in PubMed Google Scholar
Tianyu Hu
View author publications
You can also search for this author in PubMed Google Scholar
Guwen Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Na Liu
View author publications
You can also search for this author in PubMed Google Scholar
Chang Cai
View author publications
You can also search for this author in PubMed Google Scholar
Yuhao Li
View author publications
You can also search for this author in PubMed Google Scholar
Qi Xuan
View author publications
You can also search for this author in PubMed Google Scholar
Ye Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhongjing Su
View author publications
You can also search for this author in PubMed Google Scholar
Yun Xiang
View author publications
You can also search for this author in PubMed Google Scholar
Yaming Gong
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.B., Y.G. and Y.X. performed the conceptualization, conducted the formal analysis and designed the experiments. Y.B., J.H., Z.S. and Y.X. conducted the experiments and wrote the manuscript. C.C. and Z.S. made an important contribution to the revision of this article. S.B., Z.C., T.H., N.L., C.C., Y.L., Q.X. and Y.W. participated the investigation and formal analysis. All authors contributed to the article and reviewed the manuscript.

Corresponding authors

Correspondence to Yun Xiang or Yaming Gong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bu, Y., Hu, J., Chen, C. et al. ResNet incorporating the fusion data of RGB & hyperspectral images improves classification accuracy of vegetable soybean freshness. Sci Rep 14, 2568 (2024). https://doi.org/10.1038/s41598-024-51668-6

Download citation

Received: 07 September 2023
Accepted: 08 January 2024
Published: 31 January 2024
DOI: https://doi.org/10.1038/s41598-024-51668-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Experimental materials

PCI extraction

Physical characterization

Chemical compositions

Image data processing

Image acquisition

RGB image processing

Hyperspectral image processing

Data fusion model

Results

Physicochemical characteristics of VS

Evaluation metrics

Evaluation results

Band ablation analysis

Discussion

Conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links