Automated quantification of avian influenza virus antigen in different organs

Landmann, Maria; Scheibner, David; Gischke, Marcel; Abdelwhab, Elsayed M.; Ulrich, Reiner

doi:10.1038/s41598-024-59239-5

Download PDF

Article
Open access
Published: 16 April 2024

Automated quantification of avian influenza virus antigen in different organs

Maria Landmann¹,
David Scheibner²,
Marcel Gischke²,
Elsayed M. Abdelwhab² &
…
Reiner Ulrich ORCID: orcid.org/0000-0002-9403-1224¹

Scientific Reports volume 14, Article number: 8766 (2024) Cite this article

297 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

As immunohistochemistry is valuable for determining tissue and cell tropism of avian influenza viruses (AIV), but time-consuming, an artificial intelligence-based workflow was developed to automate the AIV antigen quantification. Organ samples from experimental AIV infections including brain, heart, lung and spleen on one slide, and liver and kidney on another slide were stained for influenza A-matrixprotein and analyzed with QuPath: Random trees algorithms were trained to identify the organs on each slide, followed by threshold-based quantification of the immunoreactive area. The algorithms were trained and tested on two different slide sets, then retrained on both and validated on a third set. Except for the kidney, the best algorithms for organ selection correctly identified the largest proportion of the organ area. For most organs, the immunoreactive area assessed following organ selection was significantly and positively correlated to a manually assessed semiquantitative score. In the validation set, intravenously infected chickens showed a generally higher percentage of immunoreactive area than chickens infected oculonasally. Variability between the slide sets and a similar tissue texture of some organs limited the ability of the algorithms to select certain organs. Generally, suitable correlations of the immunoreactivity data results were achieved, facilitating high-throughput analysis of AIV tissue tropism.

Bridging clinic and wildlife care with AI-powered pan-species computational pathology

Article Open access 26 April 2023

Inter-species cell detection - datasets on pulmonary hemosiderophages in equine, human and feline specimens

Article Open access 03 June 2022

Deep Learning-Based Quantification of Pulmonary Hemosiderophages in Cytology Slides

Article Open access 03 August 2020

Introduction

Avian influenza viruses (AIV) infect a wide range of bird species and other animals^1,2. Influenza viruses belong to the family Orthomyxoviridae. Depending on the surface antigens hemagglutinin (HA) and neuraminidase (NA), different HxNy-subtypes are defined³. AIV in birds can be further divided into highly pathogenic AIV (HPAIV) and low pathogenic AIV (LPAIV). While HPAIV infection causes severe systemic disease, LPAIV infection often causes no or only mild clinical disease, usually limited to the respiratory and/or gastrointestinal tract in chickens, respectively⁴. Influenza viruses have a high variability due to multiple mechanisms such as mutations (antigenic drift), and reassortment, i.e., an exchange of whole gene segments (antigenic shift)^1,5. Due to this variability and many other factors, such as host species or host age, the tissue tropism and pathology of different AIV subtypes in different infected species are also highly variable^6,7,8 and HPAIV infection is often associated with necrotizing lesions and lymphoid depletion in many organs^4,9. Especially in the case of HPAIV infections in chickens, a considerable amount of viral antigen is often also present in endothelial cells¹⁰, contributing to the systemic spread of the virus⁴.

In pathology, the viral antigen distribution is often assessed by immunohistochemical staining of the antigen followed by microscopic examination. This method allows for a qualitative assessment of the tissue tropism with identification of the different affected cell types in various organs (e.g., parenchymal, endothelial or immune cells), the preference for a specific localization inside the cells (e.g., nuclear or cytoplasmic) in these organs and the examination of their association with lesions such as necrosis¹¹. Also, immunohistochemistry detects viral antigens (i.e., proteins), thus enabling conclusions about the active virus replication in specific cells. In addition, RT-PCR data are influenced by viruses circulating in the blood that do not represent organ infection per se. Immunohistochemistry is often used as a valuable supplementation for these analyses to further elucidate AIV tissue tropism and in vivo pathogenesis. Furthermore, a semiquantitative score is often applied to facilitate a comparison between different viruses and animals⁸. Both of these methods are relatively time-consuming and thus often only a small number of animals are examined. This in turn limits the statistical analysis of the collected data.

In contrast, digital quantification of AIV antigen can enable automated, high-throughput analysis of tissue tropism. The advantages of automated analysis, in addition to reduced workload, include consistency of assessment and the ability to measure and/or count many structures not only in representatively chosen areas of the slides but on whole slide images and for a large number of samples^12,13. However, there are limitations to computer-based analysis regarding the complexity of the task as well as the variability and quality of the slides analyzed^12,13,14.

QuPath is an open-source software specially designed for whole slide image analysis, which provides a large number of tools for slide assessment and has many options for integrating different steps of analysis in one workflow, thus enabling largely automated image analysis¹⁵.

This study aimed to create and validate a method for automated analysis of tissue samples, detecting and quantifying influenza virus antigen, enable a high-throughput evaluation of tissue samples, and reduce the associated workload for the examining pathologists. Automated organ selection was established as a first step since such studies often strive to examine many different tissues and therefore, a histopathologic slide frequently contains samples of multiple different organs. As often only a subset of organs is subjected to and fit for specific analyses, the presence of additional tissue samples on one slide was used in this study to assess possible limitations of the automated organ selection. In the second step, the immunoreactive area was quantified by applying threshold-based pixel classifiers in the respective organ regions selected in the first step.

Material and methods

AIV infection experiments

For the training and validation of the algorithms and the threshold-based antigen quantification, organ samples from multiple AIV infection studies (studies 1–3) conducted at the Friedrich-Loeffler-Institut (FLI; Greifswald-Insel Riems, Germany) were used. In these studies, approximately 6-week-old white leghorn chickens from Lohmann Animal Health (Cuxhaven, Germany) were inoculated with 12 different AIVs either oculonasally (studies 1, 2, 3) with 0.2 ml containing 10⁵ plaque-forming units (PFU) per bird, or intravenously (study 3) with 0.1 ml of 1:10 diluted allantoic fluid¹⁶, according to WOAH/OIE recommendations¹⁷, and died or were euthanized after inhalation of Isoflurane® (CP-Pharma, Germany) at 1–4 days post inoculation (dpi). Seven non-infected chickens were used as a negative control group. All experiments were approved by the State Office of Agriculture, Food Safety and Fishery in Mecklenburg-Western Pomerania, Germany (LALLF M-V, registration numbers LALLF MV 7221.3-1-060/17 and 7221.3–1.1-051-12) and performed in accordance with all relevant guidelines and regulations, as well as in compliance with the ARRIVE guidelines.

Histology and immunohistochemistry

Animals were necropsied under biosafety level-3 (BSL-3) conditions following standard procedures. Organ samples of the brain, heart, lung, spleen, liver, and kidney, as well as multiple other organs (skin, nasal cavity, trachea, thymus, glandular stomach, gizzard, duodenum, jejunum, caecum, pancreas, and bursa fabricii), were taken, fixed in 4% neutral-buffered formaldehyde for > 7 days, processed and embedded in paraffin wax. For the study groups, one paraffin block (block A) contained brain, heart, lung, and spleen and another block (block B) contained liver, kidney, glandular stomach, gizzard, thymus, and trachea. 4–5 µm microtome slices were mounted on glass slides. Immunohistochemical examination was conducted with the avidin–biotin-peroxidase complex (ABC) method (Vectastain PK-6100; Vector Laboratories, Newark, CA, USA) with citric buffer pretreatment (pH 6.0), a primary monoclonal mouse antibody targeting an epitope of the influenza A matrixprotein (ATCC clone HB-64, 1:100), a secondary biotinylated goat-anti-mouse IgG (BA-9200, Vector Laboratories, Newark, USA, 1:200), 3-amino-9-ethylcarbazol (AEC) as chromogen (Agilent Technologies, Santa Clara, CA, USA and Nichirei Biosciences Inc., Tokyo, Japan), and hematoxylin counterstain as done before^18,19. Controls included validated positive and negative archival tissues¹⁹, as well as replacement of the primary antibody with an anti-isotype IgG antibody.

Setup for image analysis

Before scanning, the slides were checked for artifacts such as dust, contaminants, severe tissue folds, or air bubbles. If present, these were minimized as much as possible, with the aim of keeping artifacts to less than 0.5% of the organ area. Slides were scanned using the AxioScan 7 with ZEN blue software (Carl Zeiss Microscopy GmbH, Jena, Germany), a 20 × objective with a numeric aperture of 0.45 and a pixel size of the scanned slides of 0.1725 × 0.1725 µm (detailed settings see Supplementary Table S1) and stored via ZEN data storage (Carl Zeiss Microscopy GmbH, Jena, Germany). Image analysis was conducted with QuPath v0.2.3¹⁵ on desktop computers with at least 16 GB RAM and a 3.30 GHz processor and monitors with at least 59.8 Hz/1920 × 1080.

First, one training set consisting of 21 animals and one test set consisting of 12 animals was selected from the chickens of studies 1 and 2. The 15 chickens of study 3 were used as a validation set, as they were independent of the chickens of studies 1 and 2.

Automated selection of organs

In short, two separate classifiers were trained for the automated organ selection on the immunohistochemical tissue sections using the “object classification” tools: one for block A/slide A to select the brain, heart, lung, and spleen and one for block B/slide B to select the liver and kidney among other tissue samples, which were present on slide B, but not used for quantitative analysis. For a short overview of the training and evaluation of the classifiers, see Supplementary Fig. S1, for a detailed workflow see Supplementary Figs. S2–S6.

For training of the classifiers, representative regions of the different organs were manually annotated on the slides of the training set and sometimes of the test set. Classes used for the annotations were brain, heart, lung, spleen, and background, including clear space and artifacts, for slide A, and liver, kidney, and background, including clear space, artifacts, and other organs present on the slide, for slide B. Furthermore, for training and evaluation, slides of the training set and test set were divided into tiles of 1000 × 1000 µm using the “Create tiles” tool. Of these created tiles, one tile for each organ and one tile for the background were selected randomly using Microsoft Excel (Version 2108, Microsoft 365) and annotated at 6× display magnification and the rest of the tiles were discarded (Supplementary Fig. S2). This was done for 12 randomly selected animals of the training set and all 12 animals of the test set. For 7 animals of the validation set, one 1000 × 1000 µm tile was manually annotated for evaluation, likewise. Additionally, on all slides of all sets a rough outline of each organ was manually annotated at 1–2× display magnification for evaluation purposes.

Stain vectors were set via the “Estimate stain vectors” tool based on one slide from study 1, and stain vectors were then transferred to all other slides (Supplementary Table S2). Subsequently, the “SLIC superpixel segmentation” tool was used on the whole slides to create multiple small sub-regions consisting of similar pixels. Intensity features were then calculated for each superpixel (Supplementary Tables S3 and S4).

Using the annotations of representative regions and tiles in the training set, a set of initial classifiers was trained via the “Train object classifier” tool. All trained classifiers were random trees classifiers and default settings for features were used, as they were found to work best in visual pre-evaluation. These initial classifiers were then applied to all slides of the training set as well as the test set, each consisting of animals from studies 1 and 2, to assess performance on slides known to the classifier as well as on slides unknown, but originating from the same studies. This resulted in one class being assigned to each superpixel (Supplementary Fig. S3).

The classification of the superpixels was then transformed into larger annotations for each class using the “Tile classifications to annotations” tool. Annotations and holes in annotations smaller than 1,000,000 µm² were removed via the “Remove fragments & holes” tool to exclude smaller, falsely classified regions, resulting in one large, cohesive, annotated region for each organ sample (Supplementary Fig. S3).

Results of the classification of the slides were then evaluated visually as well as using the 1000 × 1000 µm tiles and the rough outlines of the organs to allow for assessment at different levels of detail. For the tile evaluation, labeled images downsampled to a pixel size of 1 × 1 µm were extracted for each tile from the automated as well as the manually annotated slides. These labeled images were compared using the MorphoLibJ plugin²⁰ for Fiji/ImageJ²¹ (ImageJ version 1.53q) to calculate the Jaccard similarity index (Supplementary Fig. S4). For further evaluation of the whole slides, the percentage of correctly classified tissue (e.g., brain correctly classified as brain) or falsely classified tissue (e.g., brain falsely classified as heart, lung, spleen, and/or background) was calculated for each rough organ outline (Supplementary Fig. S5).

The initial classifiers trained with the training set were evaluated as described above and the best-performing classifiers were identified. These classifiers were then re-trained with the manual annotations from the test set in addition to those from the training set to further improve performance. These refined classifiers were then applied to the validation set, consisting of animals from study 3, as well as to the training set and test set, and evaluated as described above. Therefore, the performance on slides unknown to the classifier and originating from a study independent of the studies used in the training set and the test set was assessed.

Threshold-based quantification of immunoreactive area

The immunoreactive area was then measured using threshold-based pixel classifiers via the “create thresholder” tool in the selected organ regions (Supplementary Fig. S6). Organ-specific suitable thresholds for the AEC signal (immunoreactive area) were defined by visually assessing selected representative slides from at least 7 animals of the training set as well as slides from the 7 non-infected control animals for each organ (Table 1). A moderate resolution (1.38 µm/pixel) was chosen for the classification of the immunoreactive area. Those organ-specific thresholds were then applied to the respective selected organ region on the slides of the training set, test set, and validation set. The immunoreactive area was then transformed into an annotation with a minimum object and hole size of 3 µm². The immunoreactive area and immunonegative area were then measured and the percentage of the immunoreactive area was calculated per organ. This detection of the immunoreactive area was done for the manually selected regions as well as the regions selected by the established organ classifiers.

Table 1 Threshold for detection of immunoreactive area.

Full size table

As the immunohistochemical slides of the liver and kidney showed variable and often pronounced false positive immunoreactivity in peripheral regions, the margins of the annotations for the liver and kidney were excluded before the threshold-based antigen quantification. Margin width was chosen based on the estimation of the peripheral false positive area of the training set and the non-infected control animals, which resulted in a circumferential subtraction of 400 µm for the liver and 500 µm for the kidney. This was applied to all slide sets using the “expand annotation” tool.

Creation of a coherent workflow

All the steps necessary for pre-processing, organ selection, post-processing, and threshold-based quantification of the immunoreactive area were compiled into one coherent script to enable unsupervised batch-processing of multiple slides with QuPath.

Correlation of quantitative and semiquantitative analysis data

A semiquantitative scoring of parenchymal and endothelial antigen was conducted⁸. In short, the distribution of viral antigen was scored as follows: for parenchymal cells 0 = no, 1 = focal to oligofocal, 2 = multifocal, 3 = coalescing to diffuse antigen and for endothelial cells 0 = no antigen, 1 = antigen in single blood vessels, 2 = antigen in multiple blood vessels, 3 = diffuse immunoreactivity. For this study, the score was used as a parenchymal antigen score only or as the sum of the scores for parenchymal and endothelial antigen. Spearman’s correlation analysis was done with GraphPad Prism (Prism 8 for Windows, version 8.4.3, GraphPad Software, San Diego, CA, USA) to compare the percentage of immunoreactive area with the semiquantitative score.

In cases where the classifiers falsely selected no organ area when the respective organ was present on the slide, or selected organ area on a slide when the respective organ was not present, the pair of values was excluded.

Cutoff value for immunoreactive and immunonegative organs, sensitivity, and specificity

Unspecific staining and artifacts could not be fully excluded by the pixel classifier based on an AEC threshold alone without losing the majority of the true immunoreactive signal of the viral antigen. Therefore, an additional cutoff value was determined for the percentage of the threshold-based area to discern positive (i.e., displaying viral antigen) and negative organs (i.e., displaying no true viral antigen, but sometimes unspecific staining or artifacts). For this, the semiquantitative score described above⁸ was used as a ground truth as follows: If the sum of the parenchymal score and the endothelial score was 0, the organ was defined as negative ground truth and otherwise as positive ground truth. For the establishment of the cutoff value, the percentage of the immunoreactive area measured inside the manual organ selections was compared to this ground truth as follows: For the training set and the negative control animals, different cutoff values and the associated number of false positive and false negative samples for each organ were calculated using Microsoft Excel (Version 2108, Microsoft 365). One appropriate cutoff value per organ was selected accordingly and applied to the other data sets, including the slides with automated organ selection. As described for the correlation analysis, the pair of values was excluded if no organ area was selected for a present organ or organ area was falsely selected without the respective organ being present on the slide. The sensitivity and specificity were then calculated for organs of the training set, test set, and validation set, likewise.

Comparison of immunoreactive area for the validation set

For the 15 chickens of the validation set infected with the same H5N1 HPAIV¹⁶, the percentage of the immunoreactive area was compared between the animals with intravenous and oculonasal inoculation routes at 2 dpi using Mann–Whitney U tests and between the different organs within one inoculation group using Friedman tests followed by Dunn’s post-hoc-tests with GraphPad Prism (Prism 8 for Windows, version 8.4.3, GraphPad Software, San Diego, CA, USA). Due to missing values, the kidney was fully excluded from the automated organ selection and subsequently only animals with a full set of measurements from all organs were included in the statistical analysis.

RT-qPCR

For the animals of study 1, samples of brain, heart, lung, spleen, liver, and kidney (n = 94) were analyzed by reverse transcription-quantitative polymerase chain reaction (RT-qPCR), as described before²². Samples were homogenized and RNA extraction was done with NucleoMag VET kit (Macherey–Nagel GmbH, Germany) and KingFisher (Thermo Fisher Scientific, USA), following the manufacturer’s instructions. RT-qPCR was done using 1-Step RT-qPCR ToughMix for M1 (Quantabio, MA, USA). The relative virus amount (equivalent log10 PFU/ml) was calculated using standard curves. A Spearman’s correlation analysis and Pearson’s correlation analysis was performed to correlate the RT-qPCR data with the semiquantitative score and the percentage of the immunoreactive area, respectively, using GraphPad Prism (Prism 8 for Windows, version 8.4.3, GraphPad Software, San Diego, CA, USA).

Results

Evaluation of automated organ selection

Classifiers for automated organ selection were trained, evaluated, refined, and applied. For the best-performing classifiers, the Jaccard similarity index was calculated for selected tiles, comparing the automated organ selection to a detailed manual selection. Tiles of the training set and test set were used for evaluation of the initial classifiers and tiles of the combined training and test set and the validation set were used for evaluation of the refined classifiers. For each classifier, a post-processing exclusion of regions smaller than 1,000,000 µm² was conducted and compared to the same regions without this optimization. In most cases, the mean Jaccard indices were considerably higher for the post-processed organ selections than for the selections without this processing step (Fig. 1). Generally, the mean Jaccard indices of the slide sets used for the training of the initial classifiers, i.e., the training set, and the refined classifiers, i.e., the combined training and test set, were higher than those for the slide sets, which were not used for training, i.e., test set and validation set, respectively. With exception of the kidney and the background in the validation set, mean Jaccard indices for post-processed organ selections by the best classifiers were above 0.6 throughout all slide sets (Fig. 1).

Furthermore, for the best-performing classifiers, the percentage of correctly classified tissue was calculated for the whole slides after post-processing utilizing the rough, manual outline. Similar to the Jaccard index, the mean correctly selected percentage of each organ was generally higher for the slide sets used for training the classifiers than for the slide sets not used for training the initial and the refined classifiers, likewise (Figs. 2, 3). The mean correctly selected percentage was mostly above 65% except for the kidney of the validation set (Figs. 2, 3). In case of the kidney, only 11.99% were classified correctly (Fig. 3 f). Most of the misclassified organ area was classified as background or was left unclassified due to the removal of small fragments during post-processing. The mean percentage of the area of one organ, which was misclassified as another organ, was less than 5% per slide set for most organs and sets, except for the kidney in the validation set as selected by the refined classifier and the heart in the test set and validation set as selected by the initial and refined classifier, respectively (Figs. 2 b, 3b, f). In the case of the kidney, 10.59% were falsely classified as liver and 65.13% were falsely classified as background (Fig. 3f), and in the case of the heart, 12.71% for the test set and 8.05% for the validation set were falsely classified as brain (Figs. 2b, 3b).

Evaluation of quantification of immunoreactive area

The automated organ selection of the best refined classifiers was used as a basis for the threshold-based quantification of the immunoreactive area (Fig. 4). For evaluation, the same classifiers were additionally applied to the manual organ selections. The results of both analyses were compared to the semiquantitative score for parenchymal antigen only and the sum of the scores for parenchymal and endothelial antigen, respectively. Throughout all analyses, the percentage of the immunoreactive area showed a positive and often significant (p ≤ 0.05) correlation with the semiquantitative scores, except for the kidney in the automated organ selection, which showed a negative correlation (Fig. 5).

Comparison of infection routes for the validation set

The immunoreactive area was compared for the organs of the chickens of the validation set at 2 dpi, which were infected with the same H5N1 virus via IV or ON inoculation route. In general, the immunoreactive area measured in both manually and automatically selected organs exhibited a similar pattern across different organs and infection routes: The IV-infected animals had a larger percentage of immunoreactive area than the ON-infected animals and brain, liver, and kidney had a smaller percentage of immunoreactive area than heart, lung, and spleen. For the automated organ selections, the immunoreactive area of the heart was significantly smaller in the ON-infected animals than in the IV-infected animals (Mann–Whitney U test, p ≤ 0.05). In the IV-infected animals, the immunoreactive area in the liver was significantly smaller than in the lung and heart. For the ON-infected animals, the immunoreactive area in the brain was significantly smaller than in the lung (p ≤ 0.05, Friedman test with Dunn’s post-hoc-tests) (Fig. 6a). Considering the manual organ selection, in the IV-infected animals the liver had a significantly smaller immunoreactive area compared to the heart, lung, and spleen. Additionally, the kidney had a significantly smaller immunoreactive area than the lung (p ≤ 0.05, Friedman tests with Dunn’s post-hoc-tests). The immunoreactive area of the six organs of the ON-infected animals differed significantly with the Friedman test (p ≤ 0.05), but no significant difference between any pair of two organs was detected with Dunn’s post-hoc-test (Fig. 6b).

Cutoff value for organ positivity

To allow for an additional quick assessment of the overall immunoreactivity of an organ and to account for occasional false positive signal due to unspecific staining and artifacts, a cutoff value was determined for the whole organ. This was based on the measured percentage of immunoreactive area to distinguish “positive” from “negative” organs. Appropriate cutoff values were determined using the negative control animals and the training set with the manual organ selection. These cutoff values varied between the organs and ranged from ≥ 0.06 to ≥ 0.13% of the immunoreactive area. The selected cutoff values were then applied to the training set, test set, and validation set both with manual and automated organ selection and were evaluated using a semiquantitative score as a ground truth measurement for positivity and negativity (Tables 2 and 3). Sensitivity, specificity, and accuracy for the cutoff value were often suitable, but overall showed high variability.

Table 2 Evaluation of cutoff value for immunoreactive and immunonegative organs—manual organ selection.

Full size table

Table 3 Evaluation of cutoff value for immunoreactive and immunonegative organs—automated organ selection.

Full size table

Correlation of immunohistochemical data with RT-qPCR

A correlation analysis of immunohistochemical and RT-qPCR data was done for samples of brain, heart, lung, spleen, liver, and kidney (n = 94) from the animals of study 1 (Supplementary Fig. S7). For the automatically detected percentage of immunoreactive area and the RT-qPCR data r was 0.4678 (Pearson’s correlation analysis, p < 0.0001). For the sum of the semiquantitative scores for parenchymal and endothelial antigen and the RT-qPCR results r was 0.5638 (Spearman’s correlation analysis, p < 0.0001).

Discussion

This study aimed to develop a method for automated quantification of AIV antigen in different organs to reduce the workload for the involved pathologists. Based on our experience, the manual semiquantitative scoring of AIV antigen⁸ in the organs chosen for this study takes on average approximately 30 min per animal. In this study, the manual pre-selection of organs with a rough outline took about 15 min per animal, with an additional time of about 2 min for the unsupervised automated antigen quantification. In contrast, the fully automated, unsupervised analysis took about 9 min per animal. For the (partly) automated analyses, quality control of the results requires a small amount of extra time, which was approximately 2 min per animal for this study. Nonetheless, the time needed for the involved pathologist is markedly reduced, as the digitization of slides and the starting of the automated analysis can be done by trained technical staff. However, the training of the classifiers and the selection of the thresholds requires some time. Hence, the method presented here might not be the method of choice for small studies with low numbers of animals. Although it was only applied to a limited sample size in this study, this method might be useful for the analysis of larger sample sizes, as the automated part of the slide analysis takes only a fraction of the time needed for manual scoring after successful classifier training.

For most samples, the automated organ selection worked quite well, as was evaluated in detail for the small image regions with the Jaccard similarity index and for the whole slide visually and through comparison with a rough manual organ outline. The post-processing removal of smaller classified fragments turned out to be an important step for improving the selected regions. For some organs, particularly the kidney and the heart, the classifier-based organ selection was performing notably worse on the slide sets not used for classifier training (i.e., the test or validation set) compared to most other organs. Classifiers are known to often be limited by variability between different slides^14,23. Furthermore, on the analyzed slides, multiple organ samples were present due to the design of the original studies and due to maintaining an efficient preparation and staining workflow. We found that the distinction of organs with a similar texture most likely presents a challenge to these classifiers, especially on immunohistochemical slides only stained with hematoxylin and the chromogen. On these slides, the organ structure itself is relatively low in contrasting colors as opposed to slides stained with hematoxylin and eosin. The kidney samples were often falsely classified as liver and as background. On the slide with the kidney sample, there was also a sample of glandular stomach present, which has a similar, tubule-like texture. As other, non-analyzed organs on the slide were labeled with the background class during training, this is most likely often responsible for the misclassification of the kidney as background. Therefore, to improve future studies, we suggest that slides for automated organ detection should not contain additional tissues not included in the analysis. In addition, the relatively similar structure of the brain and heart as well as of liver and kidney, respectively, probably led to the misclassification of the one for the other organ sometimes. This was mostly the case for slide sets not used for classifier training. Regarding the liver and the kidney, the selected organ area was further reduced by the automated exclusion of annotation margins due to the false immunoreactivity of some samples. These findings demonstrate the limitations of the automated organ selection for this study and should be considered in sample processing for further studies.

Despite digitization with standardized settings, the brightness of the background often varied between different slides and organs. Therefore, a post-processing step of automated small-scale background exclusion was omitted and only background areas of a size of 1,000,000 µm² or larger were excluded from the surrounding organ area. This might have led to a certain amount of underestimation of the true immunoreactive area, especially in organs with many small, air-filled spaces such as the lung. Nonetheless, variability of selected areas for quantitative analysis is to be expected to some degree, regardless of the post-processing. For example, the airways might be filled with a varying amount of edematous fluid or erythrocytes due to pathological processes or euthanasia- and/or preparation-related artifacts, thus influencing the measured area.

To reduce the time required for the analysis, only one threshold was established per organ for measuring the immunoreactive area. Therefore, variation in staining intensity between different studies or batches of slides could affect the results, as a balance between true immunoreactive signal and background signal from other tissues must be found that works as best as possible over multiple slides. In this case, the validation set had a rather low staining intensity for AEC, which tended to underestimate the true immunoreactive area. In addition, in some slides of all sets, erythrocytes showed a false positive signal of low intensity, making it impossible to set a lower threshold. Furthermore, the resolution of the threshold-based pixel classifiers was set to “moderate” to shorten the analysis time, which may also have influenced the results of the antigen quantification.

Emphasis should also be put on the slide quality, as artifacts such as air bubbles, tissue folds or dust can easily impair the digitization²⁴ as well as influence the image analysis itself¹⁴ and may lead to a false positive signal, as the computer cannot discern these from a true positive signal with the same ease as the human eye. Therefore, it is necessary to critically assess the quantification results and ensure the quality of the automated analysis²⁴.

Spearman’s correlation analysis was done for automated and manual organ selection as well as for the semiquantitative parenchymal score only and the sum of the parenchymal and endothelial scores. In general, the correlation coefficients for the parenchymal scores were slightly higher than for the sum of parenchymal and endothelial scores, likely due to the smaller area taken up by immunoreactive endothelial cells compared to immunoreactive parenchymal cells. Furthermore, correlation coefficients for the liver and kidney were sometimes higher in the manual organ selection, especially for the validation set. This was probably influenced by the limitations of the automated organ selection regarding these organs.

Especially for the heart, liver, and kidney, the correlation coefficients in some slide sets were considerably lower than for the rest of the organs and sets. This might be due to the relatively high presence of background signals or to organ-specific differences in scoring criteria.

To further assess the additional cutoff value for whole organ positivity defined in this study, a larger scale validation is necessary, as the slide sets in this study show no equal distribution of truly positive and negative samples for all organs. Nonetheless, such a cutoff can give a quick overview of the organ tropism of different AIVs.

A comparison of the chickens of study 3 infected with H5N1 HPAIV and inoculated IV or ON showed a tendency towards larger immunoreactive areas and marked involvement of multiple organs for the IV-infected animals. The ON-infected animals generally had a smaller immunoreactive area. Interestingly, the largest amount of immunoreactive area for these animals was found in the lung, suggesting a possible influence of the oculonasal infection route. To our knowledge, so far no major differences in histopathological lesions and antigen distribution between IV and intranasal infection routes have been described for H5N1 HPAIV strains in chickens and both inoculation routes caused a systemic distribution of viral antigen and lesions^25,26. Nonetheless, a shorter mean time to death has been reported for the IV route of inoculation compared to the intranasal route of inoculation^25,27.

As immunohistochemical examination of AIV is often used in addition to RT-qPCR, we conducted further correlation analyses to compare these methods. We found a significant positive correlation between viral loads in multiple organs of the ON-infected animals from study 1 as detected by RT-qPCR and the corresponding immunohistochemical data. Although some studies have failed to demonstrate a coherent relationship between immunohistochemical and RT-PCR data²⁸, our results are consistent with several other studies that have found a positive correlation. For instance, in ducks infected with H5N1 HPAIV^29,30 or chickens and ducks infected with H5N8 HPAIV⁸. It is important to note that both techniques provide unique and valuable information contributing to a better understanding of AIV. For a fast and quantitative assessment of the overall amount of viral genome in an organ, including the intravascular blood, RT-qPCR is the preferred method. On the other hand, for topographic assessment of the relationship of the virus antigen to specific cell types and its association with lesions, immunohistochemistry is the preferred method¹¹.

In summary, the automated random trees classifiers correctly selected most of the organ areas, and a suitable correlation was achieved between the immunoreactive area per organ and a manual semiquantitative score. A functional workflow was created and successfully applied to slides from various experimental studies of avian influenza. A great advantage of the open-source software QuPath is its ability to create one coherent script to perform multiple, consecutive steps. This allows for an easy automated processing of multiple slides, thus providing an easy and fast method for assessment of AIV tissue tropism and reducing the workload for the involved scientists. In conclusion, we have explored a method for high-throughput analysis of immunohistochemically stained whole slide images. This method can be used as tool to characterize the tissue tropism of avian influenza viruses in future studies.

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

References

Webster, R. G., Bean, W. J., Gorman, O. T., Chambers, T. M. & Kawaoka, Y. Evolution and ecology of influenza A viruses. Microbiol. Rev. 56, 152–179 (1992).
Article CAS PubMed PubMed Central Google Scholar
Reperant, L. A., Rimmelzwaan, G. F. & Kuiken, T. Avian influenza viruses in mammals. Rev. Sci. Tech. 28, 137–159 (2009).
Article CAS PubMed Google Scholar
Medina, R. A. & García-Sastre, A. Influenza A viruses: New research developments. Nat. Rev. Microbiol. 9, 590–603 (2011).
Article CAS PubMed PubMed Central Google Scholar
Pantin-Jackwood, M. J. & Swayne, D. E. Pathogenesis and pathobiology of avian influenza virus infection in birds. Rev. Sci. Tech. 28, 113–136 (2009).
Article CAS PubMed Google Scholar
Mostafa, A., Abdelwhab, E. M., Mettenleiter, T. C. & Pleschka, S. Zoonotic potential of influenza A viruses: A comprehensive overview. Viruses 10, 497 (2018).
Article PubMed PubMed Central Google Scholar
Alexander, D. J., Parsons, G. & Manvell, R. J. Experimental assessment of the pathogenicity of eight avian influenza A viruses of H5 subtype for chickens, turkeys, ducks and quail. Avian Pathol. 15, 647–662 (1986).
Article CAS PubMed Google Scholar
Horimoto, T. & Kawaoka, Y. Influenza: Lessons from past pandemics, warnings from current incidents. Nat. Rev. Microbiol. 3, 591–600 (2005).
Article CAS PubMed Google Scholar
Landmann, M. et al. A semiquantitative scoring system for histopathological and immunohistochemical assessment of lesions and tissue tropism in avian influenza. Viruses 13, 868. https://doi.org/10.3390/v13050868 (2021).
Article PubMed PubMed Central Google Scholar
Perkins, L. E. & Swayne, D. E. Pathobiology of A/chicken/Hong Kong/220/97 (H5N1) avian influenza virus in seven gallinaceous species. Vet. Pathol. 38, 149–164 (2001).
Article CAS PubMed Google Scholar
Short, K. R., Veldhuis-Kroeze, E. J. B., Reperant, L. A., Richard, M. & Kuiken, T. Influenza virus and endothelial cells: A species specific relationship. Front. Microbiol. 5, 653. https://doi.org/10.3389/fmicb.2014.00653 (2014).
Article PubMed PubMed Central Google Scholar
Hooper, P. & Selleck, P. Pathology of low and high virulent influenza virus infections. Avian Dis. 47, 134–141 (2003).
Google Scholar
Hamilton, P. W. et al. Digital pathology and image analysis in tissue biomarker research. Methods 70, 59–73 (2014).
Article CAS PubMed Google Scholar
Aeffner, F. et al. The gold standard paradox in digital image analysis: Manual versus automated scoring as ground truth. Arch. Pathol. Lab. Med. 141, 1267–1275 (2017).
Article PubMed Google Scholar
Riber-Hansen, R., Vainer, B. & Steiniche, T. Digital image analysis: A review of reproducibility, stability and basic requirements for optimal results. APMIS 120, 276–289 (2012).
Article PubMed Google Scholar
Bankhead, P. et al. QuPath: Open source software for digital pathology image analysis. Sci. Rep. 7, 16878. https://doi.org/10.1038/s41598-017-17204-5 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Gischke, M. et al. Insertion of basic amino acids in the hemagglutinin cleavage site of H4N2 avian influenza virus (AIV)–reduced virus fitness in chickens is restored by reassortment with highly pathogenic H5N1 AIV. Int. J. Mol. Sci. 21, 2353. https://doi.org/10.3390/ijms21072353 (2020).
Article CAS PubMed PubMed Central Google Scholar
World Organisation for Animal Health. Avian influenza (including infection with high pathogenicity avian influenza viruses). https://www.woah.org/fileadmin/Home/eng/Health_standards/tahm/3.03.04_AI.pdf (2021).
Graaf, A. et al. A viral race for primacy: Co-infection of a natural pair of low and highly pathogenic H7N7 avian influenza viruses in chickens and embryonated chicken eggs. Emerg. Microbes Infect. 7, 204. https://doi.org/10.1038/s41426-018-0204-0 (2018).
Article CAS PubMed PubMed Central Google Scholar
Koethe, S. et al. Modulation of lethal HPAIV H5N8 clade 2.3.4.4B infection in AIV pre-exposed mallards. Emerg. Microbes Infect. 9, 180–193. https://doi.org/10.1080/22221751.2020.1713706 (2020).
Article CAS PubMed PubMed Central Google Scholar
Legland, D., Arganda-Carreras, I. & Andrey, P. MorphoLibJ: Integrated library and plugins for mathematical morphology with ImageJ. Bioinformatics 32, 3532–3534 (2016).
Article CAS PubMed Google Scholar
Schindelin, J. et al. Fiji: An open-source platform for biological-image analysis. Nat. Methods 9, 676–682 (2012).
Article CAS PubMed Google Scholar
Hoffmann, B., Hoffmann, D., Henritzi, D., Beer, M. & Harder, T. C. Riems influenza a typing array (RITA): An RT-qPCR-based low density array for subtyping avian and mammalian influenza a viruses. Sci. Rep. 6, 27211. https://doi.org/10.1038/srep27211 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Bertram, C. A. & Klopfleisch, R. The pathologist 2.0: An update on digital pathology in veterinary medicine. Vet. Pathol. 54, 756–766 (2017).
Article PubMed Google Scholar
Zuraw, A. & Aeffner, F. Whole-slide imaging, tissue image analysis, and artificial intelligence in veterinary pathology: An updated introduction and review. Vet. Pathol. 59, 6–25 (2022).
Article PubMed Google Scholar
Lee, C.-W. et al. Characterization of highly pathogenic H5N1 avian influenza A viruses isolated from South Korea. J. Virol. 79, 3692–3702 (2005).
Article CAS PubMed PubMed Central Google Scholar
Nakamura, K. et al. Pathology of specific-pathogen-free chickens inoculated with H5N1 avian influenza viruses isolated in Japan in 2004. Avian Dis. 52, 8–13 (2008).
Article PubMed Google Scholar
Jeong, O.-M. et al. Experimental infection of chickens, ducks and quails with the highly pathogenic H5N1 avian influenza virus. J. Vet. Sci. 10, 53–60 (2009).
Article PubMed PubMed Central Google Scholar
Bingham, J. et al. Infection studies with two highly pathogenic avian influenza strains (Vietnamese and Indonesian) in Pekin ducks (Anas platyrhynchos), with particular reference to clinical disease, tissue tropism and viral shedding. Avian Pathol. 38, 267–278 (2009).
Article PubMed Google Scholar
Löndt, B. Z. et al. Pathogenesis of highly pathogenic avian influenza A/turkey/Turkey/1/2005 H5N1 in Pekin ducks (Anas platyrhynchos) infected experimentally. Avian Pathol. 37, 619–627 (2008).
Article ADS PubMed Google Scholar
Wasilenko, J. L. et al. Pathogenicity of two Egyptian H5N1 highly pathogenic avian influenza viruses in domestic ducks. Arch. Virol. 156, 37–51 (2011).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors are grateful to Elfi Quente, Hilke Gräfe, and Maritta Wipplinger for excellent histotechnological support, to Angele Breithaupt and Silvia Schuparis for archival work, and to Benjamin Diehl for discussion of the manuscript.

Funding

Open Access funding enabled and organized by Projekt DEAL. M. Landmann received support for this work through a doctoral scholarship from the European Social Fund (ESF) in the Free State of Saxony. Supported by the Open Access Publishing Fund of Leipzig University.

Author information

Authors and Affiliations

Institute of Veterinary Pathology, Leipzig University, Leipzig, Germany
Maria Landmann & Reiner Ulrich
Institute of Molecular Virology and Cell Biology, Friedrich-Loeffler-Institut, Greifswald-Insel Riems, Germany
David Scheibner, Marcel Gischke & Elsayed M. Abdelwhab

Authors

Maria Landmann
View author publications
You can also search for this author in PubMed Google Scholar
David Scheibner
View author publications
You can also search for this author in PubMed Google Scholar
Marcel Gischke
View author publications
You can also search for this author in PubMed Google Scholar
Elsayed M. Abdelwhab
View author publications
You can also search for this author in PubMed Google Scholar
Reiner Ulrich
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.U. supervised and conceptualized this study; R.U., M.G., D.S. and E.M.A. planned and conducted the animal experiments; M.L. performed the image analysis and statistical analysis, wrote the manuscript and prepared the figures and tables. All authors reviewed and approved the manuscript.

Corresponding author

Correspondence to Reiner Ulrich.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Landmann, M., Scheibner, D., Gischke, M. et al. Automated quantification of avian influenza virus antigen in different organs. Sci Rep 14, 8766 (2024). https://doi.org/10.1038/s41598-024-59239-5

Download citation

Received: 10 August 2023
Accepted: 08 April 2024
Published: 16 April 2024
DOI: https://doi.org/10.1038/s41598-024-59239-5

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Bridging clinic and wildlife care with AI-powered pan-species computational pathology

Inter-species cell detection - datasets on pulmonary hemosiderophages in equine, human and feline specimens

Deep Learning-Based Quantification of Pulmonary Hemosiderophages in Cytology Slides

Introduction

Material and methods

AIV infection experiments

Histology and immunohistochemistry

Setup for image analysis

Automated selection of organs

Threshold-based quantification of immunoreactive area

Creation of a coherent workflow

Correlation of quantitative and semiquantitative analysis data

Cutoff value for immunoreactive and immunonegative organs, sensitivity, and specificity

Comparison of immunoreactive area for the validation set

RT-qPCR

Results

Evaluation of automated organ selection

Evaluation of quantification of immunoreactive area

Comparison of infection routes for the validation set

Cutoff value for organ positivity

Correlation of immunohistochemical data with RT-qPCR

Discussion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links