Early detection of plant virus infection using multispectral imaging and spatial–spectral machine learning

Peng, Yao; Dallas, Mary M.; Ascencio-Ibáñez, José T.; Hoyer, J. Steen; Legg, James; Hanley-Bowdoin, Linda; Grieve, Bruce; Yin, Hujun

doi:10.1038/s41598-022-06372-8

Download PDF

Article
Open access
Published: 24 February 2022

Early detection of plant virus infection using multispectral imaging and spatial–spectral machine learning

Yao Peng¹,
Mary M. Dallas²,
José T. Ascencio-Ibáñez³,
J. Steen Hoyer⁴,
James Legg⁵,
Linda Hanley-Bowdoin²,
Bruce Grieve¹ &
…
Hujun Yin¹

Scientific Reports volume 12, Article number: 3113 (2022) Cite this article

7057 Accesses
13 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Cassava brown streak disease (CBSD) is an emerging viral disease that can greatly reduce cassava productivity, while causing only mild aerial symptoms that develop late in infection. Early detection of CBSD enables better crop management and intervention. Current techniques require laboratory equipment and are labour intensive and often inaccurate. We have developed a handheld active multispectral imaging (A-MSI) device combined with machine learning for early detection of CBSD in real-time. The principal benefits of A-MSI over passive MSI and conventional camera systems are improved spectral signal-to-noise ratio and temporal repeatability. Information fusion techniques further combine spectral and spatial information to reliably identify features that distinguish healthy cassava from plants with CBSD as early as 28 days post inoculation on a susceptible and a tolerant cultivar. Application of the device has the potential to increase farmers’ access to healthy planting materials and reduce losses due to CBSD in Africa. It can also be adapted for sensing other biotic and abiotic stresses in real-world situations where plants are exposed to multiple pest, pathogen and environmental stresses.

Early detection of Solanum lycopersicum diseases from temporally-aggregated hyperspectral measurements using machine learning

Article Open access 11 May 2023

Early warning and diagnostic visualization of Sclerotinia infected tomato based on hyperspectral imaging

Article Open access 07 December 2022

Predicting sensitivity of recently harvested tomatoes and tomato sepals to future fungal infections

Article Open access 30 November 2021

Introduction

The advent of digital technology has been making an impact on growing number of areas including agriculture. There is a pressing need for better management of limited resources and optimisation of cultivation practice, including early detection of plant diseases. While end-point PCR is often the preferred diagnostic method for detection of viral nucleic acid in field-collected samples, it is dependent on expensive instrumentation, time consuming and often cannot reliably detect virus early in infection, as seen for cassava brown streak disease (CBSD)¹. Imaging technology has been applied to the analysis of plant conditions and nutrition levels, either based on visual traits or certain spectral properties reflected by the conditions or diseases². Hyperspectral imaging (HSI) and multispectral imaging (MSI) have become increasingly available and affordable techniques that offer many advantages over conventional RGB imaging. RGB imaging has been used to recognise visual symptoms of the disease³, while plant nutritional conditions and metabolic or biotic changes due to disease may be reflected in certain spectral wavelengths beyond the RGB channels^3,4,5. These subtle signs in vast amounts of spectral and spatial imaging data can be successfully detected using advanced machine learning techniques. With the rapid advancement of imaging sensors, MSI systems have become smaller and are able to be applied in real-time and in-field⁶. This paper describes the application of a custom built active MSI (A-MSI) device and a machine learning method that leverages both spectral and spatial information of the imagery data for early detection of CBSD.

Cassava, Manihot esculenta Crantz, produces starchy tuberous roots and is one of the important staple food crops in the developing world^7,8. It is cultivated primarily by smallholder farmers. Cassava production in Africa is limited by two viral diseases, cassava mosaic disease (CMD) and CBSD. Together these diseases cause severe economic losses and threaten food security⁹. CMD has been extensively studied and sources of endogenous resistance have been identified and deployed^{10,11,12,13,14,15,16}. Unfortunately, many farmer-preferred cassava cultivars, like the CMD2-resistant cultivar TME204, are highly susceptible to CBSD. CBSD, which was first reported in the coastal areas of Tanzania¹⁷, has emerged recently as serious threat to food production^14,18. The rapid spread of CBSD throughout east and central Africa resulted in research targeting development of new cassava cultivars that are resistant to both CBSD and CMD^8,19,20,21.

CBSD is caused by two closely related RNA viruses, cassava brown steak virus (CBSV) and Ugandan cassava brown streak virus (UCBSV), in the Ipomovirus genus of the Potyviridae family. These viruses are transmitted from plant to plant by the whitefly Bemisia tabaci²². The viruses are also propagated from one season’s crop to the next through the use of stem cuttings obtained from infected plants. CBSD typically induces only mild foliar symptoms that can be difficult to discern. The subtle symptoms can make it difficult even for experts to identify infected plants in the field, and the call is often complex and based on many visual cues. However, the tuberous roots of infected cassava plants have prominent necrotic lesions that can spread throughout the entire root structure, rendering it inedible. Because the necrotic lesions or root rot are only discovered when cassava is harvested, farmers often do not know that their crop is infected with CBSD until harvest at 9–12 months after planting.

Although considerable effort has been devoted to the search for strong sources of resistance to CBSD, the progress has been slow because of the lack of a rapid, reliable method for diagnosing CBSD during early stages of infection. Historically, diagnosis is based on subtle symptoms²³ and scoring requires visible symptoms characteristic of established infections and does not distinguish resistant from tolerant plants. The use of molecular techniques to screen for the presence or absence of viral RNA has only recently been implemented in a few African national research programmes because of the technical requirements, cost, and time constraints involved in screening large numbers of plants^8,21,24. To address these constraints, we have developed an active multispectral imaging (A-MSI) sensor system enhanced with machine learning as a screening platform for virus infection.

Multispectral imaging of plant viral infections generates massive amount of data. Machine learning techniques have become the most efficient means of extracting useful information from the wealth of the data available and for detecting the underlying relationships between certain mechanisms or functions under study and a large number of contributing parameters. Our previous studies^25,26 have used feature selections and a novelty detection classifier to distinguish healthy sugar beet plants from rust-diseased plants or stressed from control Arabidopsis plants using laboratory-based hyperspectral imaging systems. Performance was significantly higher than conventional vegetation indices such as NDVI (the normalized difference vegetation index).

In the study reported here, an A-MSI device and machine learning were combined to probe the early detection of CBSD. Such an approach alleviates the burden of using an expensive and precise MSI system. Machine learning techniques can effectively make sense of plant conditions even with a low-cost, compact and less precise MSI device. Combining spectral and spatial features of the scans, machine learning identified significant differences at a high confidence between healthy cassava plants and plants inoculated with UCBSV in four experimental trials. The approach reliably detects CBSD much earlier and in a faster and much less invasive manner than end-point PCR. The integrated handheld device with advanced machine learning should make it possible to detect disease in cassava fields early in the growing season, at a time when farmers can replant with virus-free cassava cuttings and should improve the efficiency of the work of cassava breeders in selecting for resistance to CBSD.

Table 1 Experimental design of the Cassava-TME204-UCBSV A-MSI trials.

Full size table

Results

Experimental settings

In three independent trials, we used the cassava cultivar TME204, which is susceptible to CBSD, to generate three treatment groups—control, infected and mock-inoculated (18 plants in each group, except for Trial 1—see Table 1). The infected group was inoculated with plasmid DNA corresponding to an infectious clone for the UCBSV Kenyan isolate 125^27,28,29. The mock group was inoculated with an ‘empty’ control E. coli plasmid using the same protocol and the untreated group was not subjected to inoculation. All of the plants were grown together in an insect-free plant growth chamber for the duration of each experiment. The three treatment groups were visually indistinguishable at 7, 14 and 21 days post inoculation (dpi) (Fig. 1, panel B). At 28 dpi, a few plants (15%) inoculated with UCBSV displayed very mild symptoms on a single leaf. For plants showing symptoms, their severity slowly increased over the next 8 weeks. By 87 dpi, all of the plants in the UCBSV-inoculated treatment group displayed symptoms with an average symptom score of 2.6 (out of 4), highlighting the mild nature of the symptom phenotype (Fig. 1, panel A). UCBSV infection was confirmed in 10 plants at 88 dpi by end-point RT-PCR of viral RNA (Fig. 1, panel C). We did not analyze viral RNA in the other 8 UCBSV-inoculated plants because the RcbS positive control could not be amplified from the samples. Viral RNA could also be detected at 52 dpi in some plants but with variable results. During the timeframe of the experiment, no symptoms were seen on plants in the other treatment groups and the RT-PCR results were negative for the control and mock groups. These results indicate that inoculation was highly efficient, consistent with the original report of this infectious clone²⁹, but do not guarantee that 100% of plants in the virus-inoculated group were infected. Some of the classification results presented below were therefore done using only the 10 plants with PCR-confirmed infection (“Trial 2 with PCR”).

Scanning of cassava leaves

A single leaf (usually the second visible leaf from the apex) from each plant was screened using the A-MSI device, producing leaf images across 14 wavelengths. Twelve small patches were automatically cropped out from random spatial locations in the leaf area of each image at each wavelength, avoiding the leaf clapping grids and main leaf veins. This represents a simple approach to considering spatial variation in contrast to using the whole leaf for classification. Cropped patches were of various sizes, varying from 16 × 16 to 48 × 48 pixels, which covered sufficient spatial variations. Examples are shown in Fig. 2.

Performance for cassava disease detection

From each cropped patch region of a scanned leaf across the wavelengths, a variety of spectral and spatial features were extracted and investigated, including six vegetation indices (VIs), patch-based spectra, as well as patch-based texture extracted by Markov random field (MRF) probabilistic texture model. For examples, patch-based spectra are the averaged spectral reflectance values across all wavelengths within a patch region, while patch-based MRF textural features are the MRF model coefficients. Details of various vegetation indices are given in the “Methods” section. Support vector machine (SVM) was used as the classifier, along with various information fusion schemes, including the proposed probabilistic decision fusion method (ProbDecFus) for more reliable classification. For example, spatial–spectral fusion combines spatial features (MRF) and spectral features (e.g. patch-based spectra) for classification or integrates classifiers built on spatial features and classifiers built on spectral features. Details of various schemes are described in the “Methods” section.

Table 2 Classification accuracy (%) at 28 dpi on leaf scans of three trials of Cassava-TME204-UCBSV and one trial of Cassava-Kiroba-UCBSV.

Full size table

Performance for detecting CBSD in TME204 has been investigated with various methods described in the “Methods” section for utilising these spectral and spatial features as well as classifier fusions, divided into to the following three categories, with corresponding results at 28 dpi shown in Table 2. For each of the three groups or pairs (control vs. infected, mock vs. infected, and control vs. mock), results are divided into the following three subgroups.

1.
Conventional spectral methods (1st two rows in the table, i.e. Vegetation indices and Spectral reflectance (whole leaf)). In the vegetation indices method, each of six VIs was calculated and averaged from cropped patch regions and six VIs were then concatenated for classification. The spatial reflectance (whole leaf) method refers to using the averaged spectral reflectance (spectrum) from each leaf.
2.
Spatial–spectral methods (next two rows). Spectral reflectance (patch-based) refers to using patch-based spectra for classification. Instead of using averaged spectra from entire leaves, patch-based spectra represents the simplest spatial–spectral approach. The spatial–spectral fusion (patch-based) method refers to the use of concatenated patch spectrum and MRF texture parameters for classification.
3.
Spatial–spectral and classifier fusion methods (the remaining five rows). Decision fusion (average) refers to fusion of three classifiers trained on VIs, spectral reflectance and MRF texture parameters extracted from patches. The patch-based voting scheme regards a leaf as infected when at least half of the patches extracted from the leaf are classified as infected. The scheme was used with either spectral information (spectral reflectance (patch-based voting)) or spatial–spectral information, i.e. concatenated patch spectrum and MRF parameters (spatial–spectral fusion (patch-based voting)). The proposed probabilistic decision fusion method (ProbDecFus) combines VIs, MRF texture features and spectral reflectances to generate reliable classification. When the method is used on patches, we refer to it as ProbDecFus (patch-based). When it is used with patch-based voting, the method becomes ProbDecFus (patch-based voting).

For a fair and comprehensive comparison, we separately trained the classifiers on three pairs of sample sets: control versus infected, mock versu infected, and control versus mock. A leave-one-leaf-out scheme was adopted in Trial 2 and Trial 3 of TME204 as the numbers of leaf samples in the three conditions (control, infected and mock) were 18:18:18. Trial 1 of TME204 however had unbalanced sample numbers (24:12:12). We therefore randomly selected half of the control leaves each time and then performed the leave-one-leaf-out training. The random selection process of control samples was repeated at least 5000 times and the averaged classification results were produced. For finding the best hyperparameters of the SVM in each training, we randomly chose half of the training samples as the validation set and optimised the SVM classifier using the grid search algorithm.

As shown in Table 2, at 28 dpi the proposed ProbDecFus with patch-based voting achieved a classification accuracy of 87.3% for Trial 1, 98.5% for Trial 2 and 93.6% for Trial 3 on control versus infected. The methods using patch-based spatial information (fusion or not) greatly boosted classification compared to using the whole leaf. Using additional MRF texture features further improves and stablises the performance. For the “Trial 2 w PCR” results, the classifiers were trained only from those leaves with detected UCBSV by end-point RT-PCR (Fig. 1, panel C). The similarities between this column and “Trial 2” column confirm the effectiveness of the A-MSI device and classification method in detecting CBSV at 28 dpi.

The progressive detection performances on Trial 1 are shown as an example as it has the longest time course. Figure 3 depicts the classification results of various methods over leaf samples at 7, 28, 53 and 88 dpi. These graphs illustrate again that combining spatial and spectral information gives an edge over other ways of utilising the available information and significantly outperforms the use of vegetation indices. It is worth noting that although MRF is a powerful model for describing spatial dependence, it does not deliver convincing results when used alone.

Performance on a tolerant cultivar

The CBSD tolerant cultivar, Kiroba³⁰, was also tested in a single trial to verify early detection of CBSD. Eighteen plants in each group (control, infected and mock) were used in the trial using an experimental procedure similar to that for TME204 as described before. Scans of the leaves took place at 14, 26, 53 and 91 dpi. No clearly visible symptoms were observed during the course of the trial. Classifications between control versus infected, mock versus infected and control vs. mock were performed and results of 28 dpi are shown in the “Kiroba” column of Table 2. Performances across all time courses are plotted in Fig. 4. As can be seen these results are in line with that of TME204, indicating early CBSD detection at 28 dpi in a tolerant, asymptomatic cultivar. While at 14 dpi, most classifiers, except for Vegetation Indices, whole leaf and MRF, can already separate with over 70% accuracy between the infected and control. After 28 dpi, classification performances do not seem to increase further, unlike in the case of TME204; this could be well due to the tolerant nature of Kiroba and its restrictive virus accumulation³⁰.

Discussion

Several technologies have been used for diagnosis of plant viral diseases, including, enzyme-linked immunosorbent assay (ELISA), loop mediated isothermal amplification (LAMP)³¹, and PCR. All three have been used to detect CBSV and UCBSV, with PCR being the most widely used. RNA isolation from non-model plants like cassava is often technically challenging, whereas the scanning approach described here does not require RNA isolation. To date, the high cost and need for laboratory resources have been prohibitive for mass deployment of these technologies for real-time, in-field detection of CBSD. Imaging offers an alternative to indirectly sense subtle biological changes reflected in plant leaves as a result of viral infection. Such an approach is almost instant once configured and can be portable, hence offering the potential for widespread, low-cost deployment. The results in this study also show that such changes can be detected at a very early stage, well before symptoms emerge, whereas end-point PCR depends on viral RNA accumulation to detectable levels, which occurs later in infection. Across the three trials of TME204, the available common sampling points were at 7, 28 and 53 dpi, as shown in Table 1. Trials 1 and 3 were also scanned at 14 dpi and detection of differences was also highly probable (> 80%). Further optimization of the spectral profile of the A-MSI device may enable still earlier detection of CBSD. In principle, early detection of infection in primary whitefly-infected leaves could even enable removal of infected tissue before the virus can move systemically thus eliminating infection to preserve the yield of individual plants.

Machine learning approaches for analysis of MSI data can effectively even out inaccuracies of the imaging to some extent. Even with intuitively chosen wavelengths, as compared to those strictly optimized through many rounds of cross-validation process, machine learning has proven useful and effective. This in principle is in line with its broad deviation from the traditional orthogonal approach to information processing. With an active MSI system, more crop or growth conditions could be investigated using the modulation properties of various wavelengths of electromagnetic spectrum in response to metabolic changes in the organism, as well as translucent properties of the plant leaves, extending the imaging approach from spectral and spatial to temporal and transitive. We are optimistic that refinements of this approach in future field trials may be useful for early detection of infection in a wide range of crop pathosystems.

To further demonstrate the generalizability of the developed spatial–spectral machine learning method to other HSI/MSI applications, a public benchmark HSI dataset was used. The Indian Pine dataset is a HSI image dataset on land coverage³². Each image is of 145 by 145 pixels with a spatial resolution of 20 m covering 16 different crops, provided in the ground truth reference as detailed in³². The dataset was captured by the AVIRIS sensor at the Indian Pines test site in June 1992. The data contains a subset of a full scene that covers portions of Northwestern Tippecanoe County, IN, USA. The dataset is widely used in HSI analysis for validating classification efficiency. Comparisons with the state-of-the-art methods are presented in Table 3. The classification was performed using a 5-fold cross-validation strategy, and repeated 10 times to achieve satisfactory precision. Convolutional neural networks (CNNs) were used on the extracted spectral-spatial features. The baseline model, named 2D-CNN, consisted of seven main blocks (architecture: 1 × 20-(8C3-8C3)-16R3-32R3-64R3-128R3-256R3-(512FC-16FC)). The first block used two convolutional layers, each containing 8 filters of 1 × 3 with zero padding. After subsequent five residual blocks, two fully connected layers were used with a softmax layer to produce probabilistic output over each class. Batch normalisation was inserted in each convolutional layer between the convolutional and ReLU activation. A max-pooling layer was also inserted in each residual block after the addition function. The developed spatial–spectral Net ($\hbox {SSFNet}_{\text{2D}}$) had the same architecture as the baseline except that the input was further combined with MRF texture parameters extracted from the 25 × 25 surrounding neighbourhood of the centre pixels. The networks were trained using the Adam optimiser for 200 epochs with a batch size of 4. The learning rate started at 0.001 and was decreased using the polynomial scheduler. Again, the inclusion of spatial features was beneficial, resulting in more accurate and more stable classification results.

Table 3 Class-specific accuracies (%) on Indian Pines dataset.

Full size table

Methods

Active multispectral imaging (A-MSI) system

A handheld active multispectral imaging (A-MSI) prototype, developed at the e-Agri Sensors Centre, the University of Manchester, was used to obtain the data presented in this study. The sensor system exploits a modified proprietary digital imaging detectors appropriately engineered within the active optical assembly, also to enable the vastly overlapping ‘Nth-order’ molecular vibrational harmonics, from the near-infrared and visible 2D time-series data, to be deconvoluted. Isotropic illumination is achieved, with minimised specular reflectance, via a combination of an integrating hemisphere, optical diffuser and appropriately arranged narrow-band semiconductor sources (LEDs). The latter cover 15 wavebands using 10 LEDs per waveband, at the wavelengths detailed in the Table 4. Custom drive electronics are then used to enable the multispectral frames to be compiled within a parallel processing unit (NVDIA Jetson Nano). The variant of A-MSI adopted in the study is distinct from more traditional passive multispectral imaging (MSI) systems³⁸ as closed-loop control of the illumination power at each detection band enables highly repeatable measurements to be undertaken with significantly greater signal-to-noise ratio (SNR) than that by a filtered or dispersive-element based passive MSI sensor-system. This is due, in part, to the variability in illumination angle, spectral composition and polarisation of ambient illumination. The prototype instrument used in the study is shown in Fig. 5, which depicts the as-built unit and the inset measurement chamber, relative position of the LED array, and camera assembly within the integrating hemisphere. The system automatically calibrates the illumination level for each band at the start of each scanning process.

Table 4 Detailed wavelength information used in the A-MSI system.

Full size table

Cassava cultivation and virus inoculation

Cassava cuttings (Manihot esculenta cultivar TME204 or Kiroba) were originally provided by J. Ndunguru (Tanzania Agricultural Research Institute). Import and containment of plant cuttings and the UCBSV infectious clone (pLX-UCBSVi) followed international, U.S. (Department of Agriculture Animal and Plant Health Inspection Service), and institutional guidelines. Plants were propagated at 28 °C in a 12-h light/12-h dark cycle. For each experiment, 18 plants at the 6-leaf stage were inoculated under the apical meristem area using a microsprayer and 40 psi helium to deliver gold particles coated with plasmid DNA³⁹ (Venganza, Inc.). Each plant was inoculated with 1.67 μg of plasmid DNA corresponding to pLX-UCBSVi (infected)^28,29 or pUC119 (mock). pLX-UCBSVi contained an E35S expression cassette driving transcription of UCBSV Ke_125 (GenBank accession KY825166²⁷). Untreated plants (18) were not subjected to the inoculation treatment. Plants were monitored for symptoms and were scored on a scale of 1 (no symptoms), 2 (small yellow blotches on 1 leaf), 3 (yellow blotches on two leaves), and 4 (yellow blotches and yellowing along veins on multiple leaves).

TME204 samples (1 mg) were collected at 88 dpi near the petiole of leaf 2 (relative to the plant apex), flash-frozen in liquid nitrogen, and stored at − 18 °C. Leaf samples were ground in a Retsch Mixer Mill, followed by RNA extraction using a Qiagen RNeasy Plant Mini kit. RNA concentration was measured using the Qubit RNA BR Assay Kit (Invitrogen). cDNA was synthesized in reactions containing 0.5 μg total RNA, oligo d(T)18 mRNA primer (60 μM), RNasin (40 U/μL), dNTPs (10 mM), and M-MuLV reverse transcriptase (200 U/μL) using the following conditions: 5 min at 25 μC , 60 min at 42 μC, 20 min at 65 μC. cDNA (5 μL) was amplified in PCR reactions comprising 10 pMol primers, 10 mM dNTPs, and Hot Start Taq polymerase (NEBioLabs). UCSBV RNA was amplified across the 3′ NIa-Pro and 5′ NIb coding sequences using the primer pair—UCBSV-F (GGGTTCCATAGTGGTGTGATTAG) and UCBSV-F (CTCGAACTGGCTCATTGTACTT). The cassava RbcS transcript (Manes.05G137400.1), which served as a positive control for mRNA and cDNA quality, was amplified using the primer pair—RBC-1 (CTACTATGGTGGCTCCGTTC) and RBC-2 (CCGTTCAGTCGGAGAAACTC). Both sequences were amplified for 30 cycles using the following conditions: 95 °C denaturation for 60 s, 51 °C annealing for 60 s, 68 °C extension for 60 s. The PCR products were resolved on 1% agarose gels. The UCBSV products were purified using the Qiagen PCR purification kit and verified by Sanger sequencing.

Cassava leaf MSI acquisition

Leaves were detached from the TME204 plants at 7, 28, 53 and 88 dpi and from Kiroba plants at 14, 26, 53 and 91 dpi. The adaxial and abaxial surfaces of each leaf were scanned using a handheld multispectral imaging instrument. The leaves were sampled from the same position on each plant (leaf 2 or leaf 6 counting from the plant apex). The plants were also scored for symptom development at the same dpi using a scale from 1 (no visible symptoms) to 4 (severe symptoms). Details of the experimental design are shown in Table 1.

Data preprocessing

In the experiments, multispectral scans of cassava leaves were sampled by automatically cropping out patches. Twelve patches were cropped out at random spatial locations of the leaf region from each leaf scan. For spectral analysis, the number of pixels from the cropped leaf area were averaged over each wavelength range to reduce the variability in pixel intensities and produce the spectral signature. Examples are shown in Fig. 2. Such patch-based spectral information represents the simplest approach to consider spatial variation. Averaged reflectance across the entire leaf was used for comparisons, in which leaf segmentation was performed and average reflectance calculated.

Vegetation indices (VIs) calculation

The spectral signatures from each cropped patch were extracted and averaged to calculate empirical VIs. Based on the 14 wavelength bands provided by the A-MSI, six empirical indices were extracted and analysed to study plant properties and conditions. The primary formulation is the Carter index (CI), a strong indicator for plant stress, which measures the ratio between reflectance at 695 nm and 420 nm⁴⁰,

$$\begin{aligned} \text {CI}=\frac{R_{695}}{R_{420}}. \end{aligned}$$

(1)

The modified chlorophyll absorption in reflectance index (MCARI) measures the depth of chlorophyll absorption at 670 nm relative to the green reflectance peak at 550 nm and the reflectance 700 nm⁴¹,

$$\begin{aligned} \text {MCARI}=[(R_{700}-R_{670}-0.2(R_{700}-R_{550})]\frac{R_{700}}{R_{670}}. \end{aligned}$$

(2)

The optimised index transformed chlorophyll absorption in reflectance index (TCARI) was studied as being more sensitive to chlorophyll content, thus avoiding the influence by canopy and soil reflectance values⁴²,

$$\begin{aligned} \text {TCARI}=3\left[\left(R_{700}-R_{670}-0.2(R_{700}-R_{550}\right)\left(\frac{R_{700}}{R_{670}}\right)\right]. \end{aligned}$$

(3)

The photochemical reflectance index (PRI) measures the normalised difference VI of reflectivity at 531 nm and 570 nm and has been developed for disease detection⁴³,

$$\begin{aligned} \text {PRI}=\frac{(R_{531}-R_{570})}{(R_{531}+R_{570})}. \end{aligned}$$

(4)

The disease water stress index (DSWI) is the ratio between reflectances at 550 nm and 680 nm⁴⁴,

$$\begin{aligned} \text {DSWI}=\frac{R_{550}}{R_{680}}. \end{aligned}$$

(5)

And the healthy index (HI)⁴⁵ can be expressed by,

$$\begin{aligned} \text {HI}=\frac{(R_{534}-R_{698})}{(R_{534}+R_{698})}-0.5R_{704}. \end{aligned}$$

(6)

Conventional decision fusion techniques

The general idea of classifier/decision fusion can be summarised as merging multiple learners or classifiers to produce the best possible decision so as to enhance the prediction performance over a single classifier. By taking into account the outputs of all classifiers, combinations of multiple classifiers minimise the risk of choosing a weak classifier, stabilise results of random classifiers and increase the robustness of the decisions⁴⁶. Classifier/decision fusion has been an active research topic in machine learning since the late twentieth century and much of the effort has been devoted to combining classifiers for decision making in several pattern recognition applications^47,48,49,50. Typically, in a multiple classifier system, there are two general approaches to obtaining the final decision⁴⁶:

1.
Selection: Assuming complementary classifiers, only a single selected classifier contributes to the final decision.
2.
Fusion: Assuming competitive classifiers, the integration of all classifiers determines the final decision.

Based on the output information of classifiers, fusion can be divided into three levels⁵¹:

1.
Abstract level: Each classifier only outputs the predicted class label for each input. An abstract level combiner includes weighted or unweighted versions of the majority vote.
2.
Rank level: For each input, classifiers rank all labels or classes and produce a list of possible predictions.
3.
Measurement level: Instead of class labels, each classifier outputs the probability or confidence score for each class. The measurement level contains the most information among these three levels, making it possible to incorporate with various combiners (e.g. average, maximum, minimum and product), by using either selection or fusion methods.

Various methods in the literature are also concerned with how the final outputs can be combined. Majority vote is the simplest and most used combiner, in which the ensemble of classifiers choose the class that receives the highest number of votes. The fusion scheme for the unweighted majority voting can be described as,

$$\begin{aligned} {\hat{y}}=\arg \max _{\theta _{j} \in \{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}}\sum _{i=1}^{L}{\hat{y}}_{i,j}, \end{aligned}$$

(7)

where $\{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}$ are the C possible classes that an input is to be assigned to, L denotes the total number of classifiers, ${\hat{y}}_{i,j}$ is the predicted output of the ith classifier for the jth class, and ${\hat{y}}$ represents the final decision. In cases where each classifier contributes unequally to the fusion output, a weighted majority vote scheme can be employed by associating a weighting $w_{i}$ for ith classifier, and the decision becomes,

$$\begin{aligned} {\hat{y}}=\arg \max _{j \in \{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}}\sum _{i=1}^{L}w_{i}{\hat{y}}_{i,j}. \end{aligned}$$

(8)

Apart from the majority voting, multiple rules can be applied at the measurement level^49,52. The maximum, minimum or average rule finds the maximum, minimum or average probability of each class among the classifiers and assigns the input to the class with the maximum score among the maximum, minimum or average scores, respectively. These rules can be expressed as,

$$\begin{aligned} {\hat{y}}= \arg \max _{\theta _{j} \in \{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}}\max _{L}P(\theta _{j}|x_{i}), \end{aligned}$$

(9)

$$\begin{aligned} {\hat{y}}= \arg \max _{\theta _{j} \in \{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}}(1-\min _{L}P(\theta _{j}|x_{i})), \end{aligned}$$

(10)

$$\begin{aligned} {\hat{y}}= \arg \max _{\theta _{j} \in \{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}}\frac{1}{L}\sum _{i=1}^{L}P(\theta _{j}|x_{i}), \end{aligned}$$

(11)

where $P(\theta _{j}|x_{i})$ represents the estimated probability for input x that the ith classifier output $x_{i}$ belongs to the jth class $\theta _{j}$. Similarly, the product rule multiplies the probabilities or confidence scores generated by each classifier and assigns the class label with the maximum score to given input pattern,

$$\begin{aligned} {\hat{y}}=\arg \max _{\theta _{j} \in \{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}}\prod _{i=1}^{L}P(\theta _{j}|x_{i}). \end{aligned}$$

(12)

Markov random field texture analysis

As a fundamental image property descriptor, image texture models brightness variations in a local neighbourhood. Furthermore, image texture features are associated with various image properties such as orientation, coarseness and smoothness and quantify the spatial arrangements of pixel intensities in an image or an image region. Texture-based image analysis has been shown to be helpful in various applications such as remote sensing, medical imaging and industrial inspection.

MRFs are generative, flexible and stochastic image texture models, in which contextual dependencies and spatial interrelationships are established among image pixels or other correlated features⁵³. Due to the random nature of imaging and noise, pixels are naturally considered as random variables that are conditionally related to neighbouring variables. As undirected probabilistic graph models, MRFs not only specify the conditional dependencies between these random variables, but also interpolate the joint probability distributions with useful potential functions⁵⁴. MRF based texture analysis plays an important role in modern texture modelling and synthesis^55,56,57 as well as helps visual interpolation and image understanding^54,58,59.

A typical Gaussian MRF model is a stationary noncausal two-dimensional autoregressive process that can be expressed by a set of difference equations^53,60 as

$$\begin{aligned} f_{s}=\sum _{s+r\in N_{s}}\beta _{r}f_{s+r}+e_{s}, \end{aligned}$$

(13)

where r is the relative position with respect to central pixel s, and $\{e_{s}\}$ is a stationary Gaussian noise sequence with zero mean and standard deviation $\sigma ^{2}$ characterised by

$$\begin{aligned} E(e_{s}e_{s+r})=\left\{ \begin{array}{lll} \sigma ^{2}&{}\;\text {if } r=(0,0) \\ -\sigma ^{2}\beta _{r}&{}\;\text {if } r\ne (0,0)\\ 0&{}\;\text {otherwise} \end{array}\right. , \end{aligned}$$

(14)

where $\beta _{r}$ is the model parameter describing the relationship between pixels $f_{s}$ and $f_{s+r}$. All the parameters $\beta _{r}$ in the neighbourhood system $N_{s}$ form the parameter vector ${\varvec{\beta }}$.

In model-based texture methods, model parameters can be used as features for distinguishing textures. Model parameter estimation plays an significant role in analysing image properties and the least squares estimation is commonly used to estimate Gaussian MRF models⁶⁰. The quadratic difference $\Theta $ between the centre pixel and its neighbours can be defined as

$$\begin{aligned} \Theta =\sum _{s}\Big (f_{s}-\sum _{s+r\in N_{s}}\beta _{r}f_{s+r}\Big )^{2}. \end{aligned}$$

(15)

The least squares problem can be resolved by a close-form solution,

$$\begin{aligned} \hat{{\varvec{\beta }}}=(F_{s+r}^{T}F_{s+r})^{-1}F_{s+r}^{T}\Theta , \end{aligned}$$

(16)

where $f_{s+r}$ represents the neighbouring pixels of $f_{s}$.

ProbDecFus: probabilistic decision fusion

Support vector machines (SVMs) are commonly used machine learning algorithms for classification and regression. Given training vectors $\varvec{X}=\{\varvec{x}^{(1)},\varvec{x}^{(2)},\ldots ,\varvec{x}^{(N)}\}=\{\varvec{x}^{(k)}\}_{k=1}^{N}\in {\mathbf {R}}^{M\times N}$ and its corresponding class labels $\varvec{Y}=\{y^{(1)},y^{(2)},\ldots ,y^{(N)}\}=\{y^{(k)}\}_{k=1}^{N}$, the $\nu $-SVM⁶¹ solves the quadratic optimisation problem

$$\begin{aligned} &\min _{\varvec{\omega },b,\varvec{\xi },\rho } \frac{1}{2}\varvec{\omega }^{T}\varvec{\omega }-\nu \rho +\frac{1}{N}\sum _{k=1}^{N}\xi _{k}\\ &\text {s.t.}\;\;y^{(k)}(\varvec{\omega }^{T}\phi (\varvec{x}^{(k)})+b)\geqslant \rho -\xi _{k},\\ &\quad \nu \in (0,1], \; \xi _{k}\geqslant 0,\; \rho >0\\ \end{aligned} $$

(17)

where $\varvec{\omega }$ denotes the weight vector, b is the learning bias, $\xi $ is a non-zero slack variable, and $\nu $ is the regularisation parameter that controls the trade-off between smaller training errors and larger margins. $\nu \in (0,1]$ represents an upper bound on the fraction of training margin errors as well as a lower bound of the fraction of support vectors^61,62. Training vectors $\varvec{x}_{i}$ are mapped into a high-dimensional space by function $\phi $ though the kernel trick $K(\varvec{x}^{(i)},\varvec{x}^{(j)})=\phi (\varvec{x}^{(i)})^{T}\phi (\varvec{x}^{(j)})$. A radial basis function (RBF) is a typical kernel function

$$\begin{aligned} K\big (\varvec{x}^{(i)},\varvec{x}^{(j)}\big )=\exp \big (-\gamma \Vert \varvec{x}^{(i)}-\varvec{x}^{(j)}\Vert ^{2}\big ),\;\gamma >0, \end{aligned}$$

(18)

where $\gamma $ is the kernel parameter. Hence the predicted class labels $\varvec{{\hat{Y}}}=\{{\hat{y}}^{(1)},{\hat{y}}^{(2)},\ldots ,{\hat{y}}^{(N)}\}=\{{\hat{y}}^{(k)}\}_{k=1}^{N}$ can be obtained through the decision function,

$$\begin{aligned} {\hat{y}}=\text {sigmoid}\bigg (\sum _{k=1}^{N}\alpha _{k}y^{(k)}K(\varvec{x}^{(k)},\varvec{x})+b\bigg ), \end{aligned}$$

(19)

where $\alpha _{k}$ is the Lagrange multiplier.

In addition to predicted class labels, it is also possible to obtain an estimated probability for each class, $P(\theta _{j}|\varvec{x}^{(k)})$, by minimising the negative log likelihood and optimising the quadratic problem^62,63,64. In this study, three independent SVM classifiers were constructed based on the spectral reflectances, VIs and MRF spatial features, respectively. The spectral reflectance profiles, $\varvec{x}_\text {0}^{(k)}$, extracted from selected areas of leaves, were averaged within the regions over the entire wavelengths. The empirical VI information, $\varvec{x}_\text {VI}^{(k)}$, refers to the concatenation of six VIs (CI, MCARI, TCARI, PRI, DWSI and HI) calculated on the spectral reflectance. The MRF spatial features, $\varvec{x}_\text {MRF}^{(k)}$, were produced by estimating the texture parameters in each of the selected area. The classifier built on the spectral reflectances was considered as the baseline model, and we proposed a probabilistic decision fusion scheme, ProbDecFus, for integrating spectral and spatial information for further classification. Firstly, we calculated a threshold value $\mu $ based on the classification accuracy of the validation set ($\hbox {acc}_\text {val}$),

$$\begin{aligned} \mu =\mu _{1}*\text {acc}_\text {val} +\mu _{2}*(1-\text {acc}_\text {val}) \end{aligned}$$

(20)

where,

$$\begin{aligned} \mu _{1}=\min _{k \in \{y^{(k)}={\hat{y}}^{(k)}\}}\max _{\theta _{j} \in \{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}}P_\text {val}(\theta _{j}|\varvec{x}^{(k)}_{0}) \end{aligned}$$

(21)

$$\begin{aligned} \mu _{2}=\max _{k \in \{y^{(k)}\ne {\hat{y}}^{(k)}\}}\max _{\theta _{j} \in \{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}}P_\text {val}(\theta _{j}|\varvec{x}_{0}^{(k)}) \end{aligned}$$

(22)

Then according to the probability estimations, the final classification results were generated by weighting and fusing the three classifiers using

$$\begin{aligned} w_0^{(k)}= 1-\frac{\min _{\theta _{j} \in \{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}}P(\theta _{j}|\varvec{x}_{0}^{(k)})}{\max _{\theta _{j} \in \{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}}P(\theta _{j}|\varvec{x}_{0}^{(k)})} \end{aligned}$$

(23)

$$\begin{aligned} w_{i}^{(k)}=\left\{ \begin{array}{ll} 1-\frac{\min _{\theta _{j} \in \{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}}P(\theta _{j}|\varvec{x}_{i}^{(k)})}{\max _{\theta _{j} \in \{\theta _{1},\theta _{j},\ldots ,\theta _{C}\}}P(\theta _{j}|\varvec{x}_{i}^{(k)})} &{} w_{0}^{(k)} \leqslant w_{\mu }\\ 0 &{}w_{0}^{(k)} > w_{\mu } \end{array}\right. \end{aligned}$$

(24)

Data availability

The MSI dataset of these three trials (Cassava-TME204-UCBSV) are available at https://doi.org/10.5281/zenodo.4636968.

References

Adams, I. et al. High throughput real-time RT-PCR assays for specific detection of cassava brown streak disease causal viruses, and their application to testing of planting material. Plant. Pathol. 62, 233–242 (2012).
Article Google Scholar
Hatfield, L. J., Gitelson, A. A., Schepers, S. J. & Walthall, L. C. Application of spectral remote sensing for agronomic decisions. Agron. J. 100, 117–131 (2008).
Article Google Scholar
Lowe, A., Harrison, N. & French, A. P. Hyperspectral image analysis techniques for the detection and classification of the early onset of plant disease and stress. Plant Methods 13, 1–12 (2017).
Article ADS Google Scholar
Gates, D. M., Keegan, H. J., Schleter, J. C. & Weidner, V. R. Spectral properties of plants. Appl. Opt. 4, 11–20 (1965).
Article ADS Google Scholar
Mahlein, A. K. Plant disease detection by imaging sensors-parallels and specific demands for precision agriculture and plant phenotyping. Plant Dis. 100, 241–251 (2016).
Article PubMed Google Scholar
Owomugisha, G., Melchert, F., Mwebaze, E., Quinn, J. A. & Biehl, M. Matrix relevance learning from spectral data for diagnosing cassava diseases. IEEE Access 9, 83355–83363 (2021).
Article Google Scholar
Thresh, J. M. Control of tropical plant virus diseases. Adv. Virus Res. 67, 245–295 (2006).
Article CAS PubMed Google Scholar
Patil, B. L., Legg, J. P., Kanju, E. & Fauquet, C. M. Cassava brown streak disease: A threat to food security in Africa. J. Gen. Virol. 96, 956–968 (2015).
Article CAS PubMed Google Scholar
Legg, J. P., Owor, B., Sseruwagi, P. & Ndunguru, J. Cassava mosaic virus disease in East and Central Africa: Epidemiology and management of a regional pandemic. Adv. Virus Res. 67, 355–418 (2006).
Article CAS PubMed Google Scholar
Akano, A. O., Dixon, A. G. O., Mba, C., Barrera, E. & Fregene, M. Genetic mapping of a dominant gene conferring resistance to cassava mosaic disease. Theor. Appl. Genet. 105, 521–525 (2002).
Article CAS PubMed Google Scholar
Legg, J. P. & Fauquet, C. M. Cassava mosaic geminiviruses in Africa. Plant Mol. Biol. 56, 585–599 (2004).
Article CAS PubMed Google Scholar
Lokko, Y., Danquah, E., Offei, S., Dixon, A. G. & Gedil, M. Molecular markers associated with a new source of resistance to the cassava mosaic disease. Afr. J. Biotechnol. 4 (2005).
Patil, B. L. & Fauquet, C. M. Cassava mosaic geminiviruses: Actual knowledge and perspectives. Mol. Plant Pathol. 10, 685–701 (2009).
Article CAS PubMed PubMed Central Google Scholar
Legg, J. P. et al. Comparing the regional epidemiology of the cassava mosaic and cassava brown streak virus pandemics in Africa. Virus Res. 159, 161–170 (2011).
Article CAS PubMed Google Scholar
Okogbenin, E. et al. Molecular marker analysis and validation of resistance to cassava mosaic disease in elite cassava genotypes in Nigeria. Crop Sci. 52, 2576–2586 (2012).
Article CAS Google Scholar
Rabbi, I. Y. et al. High-resolution mapping of resistance to cassava mosaic geminiviruses in cassava using genotyping-by-sequencing and its implications for breeding. Virus Res. 186, 87–96 (2014).
Article CAS PubMed Google Scholar
Storey, H. H. Virus diseases of East African plants. VI. A progress report on studies of disease of cassava. East Afr. Agric. J. 2, 34–9 (1936).
Legg, J. P. et al. Spatio-temporal patterns of genetic change amongst populations of cassava Bemisia tabaci whiteflies driving virus pandemics in East and Central Africa. Virus Res. 186, 61–75 (2014).
Article CAS PubMed Google Scholar
Beyene, G. et al. A virus-derived stacked RNAi construct confers robust resistance to cassava brown streak disease. Front. Plant Sci. 7, 2052 (2017).
Article PubMed PubMed Central Google Scholar
Kawuki, R. S. et al. Alternative approaches for assessing cassava brown streak root necrosis to guide resistance breeding and selection. Front. Plant Sci. 10, 1461 (2019).
Article PubMed PubMed Central Google Scholar
Sheat, S., Fuerholzner, B., Stein, B. & Winter, S. Resistance against cassava brown streak viruses from Africa in cassava germplasm from South America. Front. Plant Sci. 10, 567 (2019).
Article PubMed PubMed Central Google Scholar
Maruthi, M. N. et al. Transmission of cassava brown streak virus by Bemisia tabaci (Gennadius). J. Phytopathol. 153, 307–312 (2005).
Article Google Scholar
Nichols, R. F. J. The brown streak disease of cassava: Distribution, climatic effects and diagnostic symptoms. East Afr. Agric J. 15, 154–160 (1965).
Google Scholar
Mohammed, I., Abarshi, M., Muli, B., Hillocks, R. & Maruthi, M. The symptom and genetic diversity of cassava brown streak viruses infecting cassava in East Africa. Adv. Virol. 2012 (2012).
AlSuwaidi, A., Grieve, B. & Yin, H. Feature-ensemble-based novelty detection for analyzing plant hyperspectral datasets. IEEE J. Sel. Topics Appl. Earth Observ. Remote Sens. 11, 1041–1055 (2018).
AlSuwaidi, A., Grieve, B. & Yin, H. Combining spectral and texture features in hyperspectral image analysis for plant monitoring. Meas. Sci. Tech. 29, 104001 (2018).
Article ADS Google Scholar
Winter, S. et al. Analysis of cassava brown streak viruses reveals the presence of distinct virus species causing cassava brown streak disease in East Africa. J. Gen. Virol. 91, 1365–1372 (2010).
Article CAS PubMed Google Scholar
Pasin, F. et al. Multiple T-DNA delivery to plants using novel mini binary vectors with compatible replication origins. ACS Synth. Biol. 6, 1962–1968 (2017).
Article PubMed Google Scholar
Valli, A. A. et al. Maf/ham1-like pyrophosphatases, host-specific partners of viral RNA-dependent RNA polymerases. bioRxiv https://doi.org/10.1101/2021.05.18.444600.
Kaweesi, T. et al. Field evaluation of selected cassava genotypes for cassava brown streak disease based on symptom expression and virus load. Virol. J. 11, 1–15 (2014).
Article Google Scholar
Panno, S. et al. Loop mediated isothermal amplification: Principles and applications in plant virology. Plants 9, 461 (2020).
Article CAS PubMed Central Google Scholar
Baumgardner, M. F., Biehl, L. L. & Landgrebe, D. A. 220 band AVIRIS hyperspectral image data set: June 12, 1992 Indian Pine test site 3 (2015).
Chen, Y., Jiang, H., Li, C., Jia, X. & Ghamisi, P. Deep feature extraction and classification of hyperspectral images based on convolutional neural networks. IEEE Trans. Geosci. Remote Sens. 54, 6232–6251 (2016).
Article ADS Google Scholar
Mou, L., Ghamisi, P. & Zhu, X. X. Deep recurrent neural networks for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 55, 3639–3655 (2017).
Article ADS Google Scholar
Cao, X. et al. Hyperspectral image classification with Markov random fields and a convolutional neural network. IEEE Trans. Image Process. 27, 2354–2367 (2018).
Article ADS MathSciNet PubMed Google Scholar
Wang, Y. et al. Self-supervised feature learning with CRF embedding for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 57, 2628–2642 (2018).
Article ADS Google Scholar
Yu, W., Zhang, M. & Shen, Y. Spatial revising variational autoencoder-based feature extraction method for hyperspectral images. IEEE Trans. Geosci. Remote Sens. 59, 1410–1423 (2021).
Article ADS Google Scholar
Park, B. & Lu, R. (eds.) Hyperspectral Imaging Technology in Food and Agriculture (Springer, 2015).
Cabrera-Ponce, J. L. et al. An efficient particle bombardment system for the genetic transformation of asparagus (Asparagus officinalis L.). Plant Cell Rep.16, 255–260 (1997).
Carter, G. A. Ratios of leaf reflectances in narrow wavebands as indicators of plant stress. Remote Sens. 15, 697–703 (1994).
Article MathSciNet Google Scholar
Daughtry, C., Walthall, C., Kim, M., De Colstoun, E. B. & McMurtrey, J. III. Estimating corn leaf chlorophyll concentration from leaf and canopy reflectance. Remote Sens. Environ. 74, 229–239 (2000).
Article ADS Google Scholar
Haboudane, D., Miller, J. R., Tremblay, N., Zarco-Tejada, P. J. & Dextraze, L. Integrated narrow-band vegetation indices for prediction of crop chlorophyll content for application to precision agriculture. Remote Sens. Environ. 81, 416–426 (2002).
Article ADS Google Scholar
Gamon, J., Penuelas, J. & Field, C. A narrow-waveband spectral index that tracks diurnal changes in photosynthetic efficiency. Remote Sens. Environ. 41, 35–44 (1992).
Article ADS Google Scholar
Apan, A., Held, A., Phinn, S. & Markley, J. Formulation and assessment of narrow-band vegetation indices from EO-1 Hyperion imagery for discriminating sugarcane disease. In Proc. Spatial Sci. Conf., 1–13 (2003).
Xue, J. & Su, B. Significant remote sensing vegetation indices: A review of developments and applications. J. Sensors https://doi.org/10.1155/2017/1353691 (2017).
Ponti Jr, M. P. Combining classifiers: from the creation of ensembles to the decision fusion. In Proc. SIBGRAPI Conf. Graph. Patterns Images Tuts, 1–10 (IEEE, 2011).
Selfridge, O. G. Pandemonium: A paradigm for learning. In Neurocomputing: Foundations of Research, 115–122 (1988).
Jacobs, R. A., Jordan, M. I., Nowlan, S. J. & Hinton, G. E. Adaptive mixtures of local experts. Neural Comput. 3, 79–87 (1991).
Article PubMed Google Scholar
Kittler, J., Hatef, M., Duin, R. P. & Matas, J. On combining classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 20, 226–239 (1998).
Article Google Scholar
Jain, A. K., Duin, R. P. W. & Mao, J. Statistical pattern recognition: A review. IEEE Trans. Pattern Anal. Mach. Intell. 22, 4–37 (2000).
Article Google Scholar
Xu, L., Krzyzak, A. & Suen, C. Y. Methods of combining multiple classifiers and their applications to handwriting recognition. IEEE Trans. Syst., Man, Cybern. 22, 418–435 (1992).
Alexandre, L. A., Campilho, A. C. & Kamel, M. On combining classifiers using sum and product rules. Pattern Recogn. Lett. 22, 1283–1289 (2001).
Article ADS MATH Google Scholar
Geman, S. & Geman, D. Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell. 6, 721–741 (1984).
Article CAS PubMed MATH Google Scholar
Li, S. Z. A Markov random field model for object matching under contextual constraints. In Proc. IEEE Conf. Comput. Vis. Pattern Recogn., 866–866 (1994).
Cross, G. R. & Jain, A. K. Markov random field texture models. IEEE Trans. Pattern Anal. Mach. Intell. 5, 25–39 (1983).
Article CAS PubMed Google Scholar
Li, C. & Wand, M. Combining Markov random fields and convolutional neural networks for image synthesis. In Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2479–2486 (2016).
Zhirong Wu, X. T., Dahua Lin. Deep Markov random field for image modeling. In Proc. Eur. Conf. Comput. Vis., 295–312 (2016).
Julesz, B. Visual pattern discrimination. IRE Trans. Inf. Theory 8, 84–92 (1962).
Article Google Scholar
Julesz, B. Textons, the elements of texture perception, and their interactions. Nature 290, 91–97 (1981).
Article ADS CAS PubMed Google Scholar
Kashyap, R. & Chellappa, R. Estimation and choice of neighbors in spatial-interaction models of images. IEEE Trans. Inf. Theory 29, 60–72 (1983).
Article MATH Google Scholar
Schölkopf, B., Smola, A. J., Williamson, R. C. & Bartlett, P. L. New support vector algorithms. Neural Comput. 12, 1207–1245 (2000).
Article PubMed Google Scholar
Chang, C.-C. & Lin, C.-J. LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Tech. 2, 1–27 (2011).
Article Google Scholar
Wu, T.-F., Lin, C.-J. & Weng, R. C. Probability estimates for multi-class classification by pairwise coupling. J. Mach. Learn. Res. 5, 975–1005 (2004).
MathSciNet MATH Google Scholar
Lin, H.-T., Lin, C.-J. & Weng, R. C. A note on Platt’s probabilistic outputs for support vector machines. Mach. Learn. 68, 267–276 (2007).
Article MATH Google Scholar

Download references

Acknowledgements

We thank Adrián Valli, Fabio Pasin, and Juan Antonio Garcáa (Spanish National Center for Biotechnology, Madrid, Spain) for the gift of the pLX-UCBSVi infectious clone.

Author information

Authors and Affiliations

Department of Electrical and Electronic Engineering, University of Manchester, Manchester, UK
Yao Peng, Bruce Grieve & Hujun Yin
Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
Mary M. Dallas & Linda Hanley-Bowdoin
Department of Molecular and Structural Biochemistry, North Carolina State University, Raleigh, NC, USA
José T. Ascencio-Ibáñez
Department of Ecology, Evolution, and Natural Resources, Rutgers University, New Brunswick, NJ, USA
J. Steen Hoyer
International Institute of Tropical Agriculture (IITA), Dar es Salaam, Tanzania
James Legg

Authors

Yao Peng
View author publications
You can also search for this author in PubMed Google Scholar
Mary M. Dallas
View author publications
You can also search for this author in PubMed Google Scholar
José T. Ascencio-Ibáñez
View author publications
You can also search for this author in PubMed Google Scholar
J. Steen Hoyer
View author publications
You can also search for this author in PubMed Google Scholar
James Legg
View author publications
You can also search for this author in PubMed Google Scholar
Linda Hanley-Bowdoin
View author publications
You can also search for this author in PubMed Google Scholar
Bruce Grieve
View author publications
You can also search for this author in PubMed Google Scholar
Hujun Yin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.M.D., J.T.A-I., J.S.H. and L.H.B. designed the laboratory experiments. M.M.D. collected the data. Y.P., B.G. and H.Y. analysed the data and designed ML methods, and B.G. designed the MSI device. All authors contributed to analysis of the results and drafting and approving the submitted manuscript.

Corresponding author

Correspondence to Hujun Yin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Peng, Y., Dallas, M.M., Ascencio-Ibáñez, J.T. et al. Early detection of plant virus infection using multispectral imaging and spatial–spectral machine learning. Sci Rep 12, 3113 (2022). https://doi.org/10.1038/s41598-022-06372-8

Download citation

Received: 23 July 2021
Accepted: 21 January 2022
Published: 24 February 2022
DOI: https://doi.org/10.1038/s41598-022-06372-8

This article is cited by

Early detection of Solanum lycopersicum diseases from temporally-aggregated hyperspectral measurements using machine learning
- Michał Tomaszewski
- Jakub Nalepa
- Krzysztof Smykała
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Early detection of Solanum lycopersicum diseases from temporally-aggregated hyperspectral measurements using machine learning

Early warning and diagnostic visualization of Sclerotinia infected tomato based on hyperspectral imaging

Predicting sensitivity of recently harvested tomatoes and tomato sepals to future fungal infections

Introduction

Results

Experimental settings

Scanning of cassava leaves

Performance for cassava disease detection

Performance on a tolerant cultivar

Discussion

Methods

Active multispectral imaging (A-MSI) system

Cassava cultivation and virus inoculation

Cassava leaf MSI acquisition

Data preprocessing

Vegetation indices (VIs) calculation

Conventional decision fusion techniques

Markov random field texture analysis

ProbDecFus: probabilistic decision fusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Early detection of Solanum lycopersicum diseases from temporally-aggregated hyperspectral measurements using machine learning

Comments

Search

Quick links