Predicting acute pancreatitis severity with enhanced computed tomography scans using convolutional neural networks

Liang, Hongyin; Wang, Meng; Wen, Yi; Du, Feizhou; Jiang, Li; Geng, Xuelong; Tang, Lijun; Yan, Hongtao

doi:10.1038/s41598-023-44828-7

Download PDF

Article
Open access
Published: 16 October 2023

Predicting acute pancreatitis severity with enhanced computed tomography scans using convolutional neural networks

Hongyin Liang^1,2,
Meng Wang³,
Yi Wen^1,2,
Feizhou Du⁴,
Li Jiang⁵,
Xuelong Geng⁴,
Lijun Tang^1,2 &
…
Hongtao Yan⁶

Scientific Reports volume 13, Article number: 17514 (2023) Cite this article

1136 Accesses
1 Citations
2 Altmetric
Metrics details

Subjects

Abstract

This study aimed to evaluate acute pancreatitis (AP) severity using convolutional neural network (CNN) models with enhanced computed tomography (CT) scans. Three-dimensional DenseNet CNN models were developed and trained using the enhanced CT scans labeled with two severity assessment methods: the computed tomography severity index (CTSI) and Atlanta classification. Each labeling method was used independently for model training and validation. Model performance was evaluated using confusion matrices, areas under the receiver operating characteristic curve (AUC-ROC), accuracy, precision, recall, F1 score, and respective macro-average metrics. A total of 1,798 enhanced CT scans met the inclusion criteria were included in this study. The dataset was randomly divided into a training dataset (n = 1618) and a test dataset (n = 180) with a ratio of 9:1. The DenseNet model demonstrated promising predictions for both CTSI and Atlanta classification-labeled CT scans, with accuracy greater than 0.7 and AUC-ROC greater than 0.8. Specifically, when trained with CT scans labeled using CTSI, the DenseNet model achieved good performance, with a macro-average F1 score of 0.835 and a macro-average AUC-ROC of 0.980. The findings of this study affirm the feasibility of employing CNN models to predict the severity of AP using enhanced CT scans.

Segment anything in medical images

Article Open access 22 January 2024

Screening and diagnosis of cardiovascular disease using artificial intelligence-enabled cardiac magnetic resonance imaging

Article Open access 13 May 2024

AI in health and medicine

Article 20 January 2022

Introduction

Acute pancreatitis (AP) is a common acute abdominal disease in clinical practice¹. Mild acute pancreatitis (MAP) has a good prognosis, while severe acute pancreatitis (SAP) is often associated with complications such as pancreatic necrosis and organ failure, resulting in a high mortality rate². As the clinical course of AP strongly depends on the early management of the disease, accurate assessment of the severity of AP can facilitate early intervention and contribute to improved clinical outcomes^{3, 4}.

Several assessment systems that utilize clinical manifestations, laboratory tests have been developed and evaluated in predicting the severity and prognosis of AP, including the Acute Physiology and Chronic Health Evaluation (APACHE II)⁵, the Ranson system⁶, the Bedside Index for Severity in Acute Pancreatitis (BISAP)⁷, the Marshall score⁸, the Sepsis-related Organ Failure Assessment (SOFA)⁹. However, these systems, either individually or in combination, have not provided satisfactory predictions for SAP¹⁰. Machine learning and other artificial intelligence methods have shown promise in forecasting the severity of AP¹¹. Studies have shown that machine learning models based on patient demographics and biochemical markers can enhance prediction accuracy^12,13,14.

In clinical practice, CT scans play a vital role in assessing the severity of pancreatitis. Several CT scan-related assessment systems, such as the Computed Tomography Severity Index (CTSI)¹⁵, the Modified Computed Tomography Severity Index (MCTSI)¹⁶, and the extrapancreatic inflammation on computed tomography (EPIC) score¹⁷, have shown associations with the severity and prognosis of AP. Several radiomics studies have also been applied in the prediction of AP^{18, 19}. However, the use of deep learning models based on CT images for evaluating AP severity is still in its early stages. A recent study by Chen et al. constructed a deep learning model based on MobileNetV2 using non-enhanced CT images obtained from AP patients within 72 h after onset²⁰. The results demonstrated that the CT image-based deep learning model achieved a prediction accuracy of 72.3% with an AUC-ROC of 0.741 for MAP, and an accuracy of 79.5% with an AUC-ROC of 0.896 for SAP. These findings suggest that using deep learning models with CT scans to predict the severity of AP is feasible and holds great promise for future applications.

Notably, the use of non-enhanced CT images obtained upon admission may be insufficient for a comprehensive evaluation of AP severity. In the initial stages of the disease, the pancreas undergoes rapid morphological changes and necrosis, which may remain undetectable or underestimated in non-enhanced CT scans²¹. Relevant guidelines^{22, 23} recommend that the optimal timing for CT scans used in assessing the severity of AP is at least 72–96 h after the onset of symptoms, and enhanced CT scans should be utilized.

In this study, we developed a convolutional neural network (CNN) model using 3D DenseNet, to predict the severity of AP using enhanced CT scans. Additionally, we investigated two distinct approaches for severity grading of AP, CTSI and the Atlanta classification, to label the enhanced CT scans. The CTSI can be derived solely from CT images, whereas the Atlanta classification, being more commonly used, incorporates factors beyond CT images. Each labeling method was used independently for model training and validation, facilitating a comprehensive comparison of the predictive performance of the models.

Methods

Study design

This was a single-center retrospective study conducted in a tertiary care hospital in western China. The study was approved by the Institutional Ethics Committee (No. A20200212008), and a waiver of informed consent was obtained.

CT scan data from an AP database established in 2009 were utilized²⁴, comprising enhanced CT scans of patients diagnosed with AP from 2009 to 2022. The database includes 2,571 abdomen-enhanced CT scans from 1,945 patients diagnosed with AP. Exclusions included patients under 18 years old, patients who have undergone retroperitoneal puncture and catheterization, those with chronic pancreatitis, a history of upper abdominal surgery (except cholecystectomy and bile duct exploration), or tumors. Ultimately, 1,798 enhanced CT scans were included in this study.

Definition

Diagnosis of AP

A diagnosis of AP was made according to the 2012 revised Atlanta classification and definitions of AP²; patients had to meet any two of the following conditions: (1) abdominal pain consistent with the characteristics of AP; (2) serum amylase (or lipase) greater than three times the upper limit of normal; and (3) characteristic findings of AP on imaging.

Definition of CTSI

Balthazar proposed the CTSI score based on enhanced CT images, considering pancreatic inflammation and the area proportion of pancreatic necrosis¹⁵. The detailed scoring criteria are provided in Table 1.

Table 1 Computed tomography severity index.

Full size table

Classification of AP severity

In this study, we used two approaches for assessing AP severity. Firstly, classification based on CTSI scores: a total CTSI of 0–3 indicated MAP, 4–6 indicated moderately severe AP (MSAP), and 7–10 indicated SAP. Secondly, the classification based on the 2012 revised Atlanta classification, which also defined three degrees of AP severity, as outlined in Table 2.

Table 2 Grades of acute pancreatitis severity based on the 2012 revised Atlanta classification.

Full size table

CT scans acquisition and labeling

All patients underwent standard contrast-enhanced abdominal CT examinations using a single-source, 64-multidetector CT scanner. Specific parameters were as follows: slice thickness of 1.0 mm and a matrix size of 512 × 512. CT scans usually consisted of 300–350 slices. Following non-enhanced CT acquisition, enhanced CT images were obtained after intravenous administration of nonionic iodinated contrast material (300 mg/mL of iodine) at a dose of 1.2 mL/kg and an injection rate of 2.5 mL/s using an automatic power injector. CT scans were saved in digital imaging and communications in medicine (DICOM) format to the picture archiving and communication system (PACS).

Portal venous phase CT images (50–70 s after contrast injection) were extracted from the PACS and used in this study. The CT image window width was adjusted to 200, and the window position was set at 45. Raw Hounsfield unit (HU) values were rescaled to a range of 0 to 1. Each CT scan included 256 manually selected slices encompassing the pancreas, and the CT scan image size was reshaped to 64 × 128 × 128.

CT scans were labeled for AP severity based on CTSI or the 2012 revised Atlanta classification. CTSI scores were determined by radiologists (Du and Geng, each with more than 10 years of experience) using the CTSI criteria according to the CTSI criteria. The radiologists were blinded to patient clinical symptoms and treatment. AP severity based on CTSI was determined by the calculated CTSI score, while severity based on the Atlanta classification was extracted from the database, recorded during patient hospitalization. These data were classified into MAP, MSAP, and SAP groups accordingly. Notably, the AP severity classification based on CTSI did not exactly match that based on the Atlanta classification. Each labeling method was used independently for model training and validation.

Model development and evaluation

Training and test datasets

The 1,798 CT scans were randomized into the training dataset (n = 1,618) and the test dataset (n = 180) at a ratio of 9:1. To enhance the training process, data augmentation techniques such as random rotation and translation were applied to the CT scans. As the CT scans were represented as rank-3 shape tensors (samples, depth, height, width), an additional dimension of size 1 at axis 4 was added to enable 3D convolutions (samples, depth, height, width, 1). The training dataset included both the raw CT scans and augmented CT scans, while only the raw CT scans were used for model evaluation in the test dataset.

DenseNet model

A three-dimensional DenseNet CNN model was developed for this study, utilizing the network architecture presented in Fig. 1. The model consisted of four modules, each comprising a dense block and a transition block. Within the dense block, the output Xi of layer i satisfied expression (1), where the nonlinear transformation function Hi(·) incorporated batch normalization and convolution. The last module connected the fully connected layers and applied the Softmax function to produce the final predictions.

$${\text{X}}_{{\text{i}}} = {\text{H}}_{{\text{i}}} \left( {\left[ {{\text{x}}_{0} ,{\text{x}}_{{1}} , \ldots ,{\text{x}}_{{{\text{i}} - {1}}} } \right]} \right).$$

(1)

Model evaluation

Confusion matrices were used to assess the accuracy of pairwise classification between different categories of patients. Since the task involved multiclassification prediction, the metrics such as AUC-ROC, precision, recall, and F1 score were calculated. The macro-average values of these metrics, computed as the arithmetic mean across individual classes, were used to evaluate the model's performance. In this study, macro-average metrics were employed instead of micro-average metrics to evaluate model performance in the triple classification task²⁵. Macro-average metrics, which assign equal importance to each class, are considered more suitable for imbalanced datasets compared to micro-average metrics²⁶. By giving equal weight to each class, macro-average metrics provide objective results for imbalanced datasets, allowing for reliable evaluation.

Visual interpretation of the models

The interpretation of model predictions was achieved by employing Gradient-weighted Class Activation Mappings (Grad-CAMs) extended to the 3D setting²⁷. These visual explanations represent heat maps superimposed on each slice, providing insights into the model's decision-making process. To visualize the Grad-CAMs, we overlay the Grad-CAMs on each input slice, offering a comprehensive view of the prediction rationale.

Ethics statement and informed consent statement

The study was approved by the Institutional Ethics Committee of the General Hospital of Western Theater Command (No. A20200212008). The requirement for obtaining written informed consent from patients was waived by the Institutional Ethics Committee of the General Hospital of Western Theater Command due to the retrospective nature of this study. Our study was conducted according to the ethical standards of the 1964 Declaration of Helsinki and its later amendments.

Methods statement

All methods were carried out in accordance with relevant guidelines and regulations.

Experimental environment and statistical analysis

This study was conducted on a computer with an NVIDIA(R) RTX(R) 3090 TI GPU and Intel(R) Core(R) CPU i9-12900 K processor. Python 3.9.0 (Python Software Foundation, Wilmington, DE, USA) was used for data extraction and preprocessing, model development and validation, and visualization and statistical analysis. To calculate the Ninety-five percent confidence intervals (CIs) for performance evaluation metrics such as accuracy and F1 score, we implemented bootstrapping with 1,000 iterations²⁸. This allowed us to derive values from these iterations, upon which the CIs were computed. Statistical significance was computed with the same bootstrapping method²⁹. P < 0.05 was considered statistically significant.

Results

In this study, a total of 1,798 enhanced CT scans from 1,561 patients were included (Fig. 2). These CT scans were labeled according to both the CTSI (MAP: 769, 42.8%; MSAP: 619, 34.4%; SAP: 410, 22.8%) and the 2012 revised Atlanta classification (MAP: 629, 35.0%; MSAP: 709, 39.4%; SAP: 460, 25.6%) to determine the severity of AP. Notably, there were 173 instances (9.6%) where the severity determination based on the CTSI did not correspond with the Atlanta classification. Of these, 154 instances were allocated to the training dataset, and 19 to the test dataset. Specifically, 123 instances categorized as MAP based on the CTSI were classified as MSAP according to the Atlanta classification. 17 instances categorized as MAP by CTSI were later classified as SAP under the Atlanta criteria. Furthermore, 33 instances categorized as MSAP by CTSI were classified as SAP according to the Atlanta classification. The dataset was randomly divided into a training dataset (n = 1,618) and a test dataset (n = 180) with a ratio of 9:1.

The demographic characteristics and clinical outcomes of the patients in both the training and test datasets are summarized in Table 3. There were no significant differences between the two groups in terms of age, gender, etiology, length of hospital stay, and mortality rate (P > 0.05). Furthermore, the time interval from the onset of symptoms to the CT examination for both groups was 5.4 ± 0.9 days and 5.4 ± 0.8 days, respectively, demonstrating no significant variance (P > 0.05).

Table 3 Demographic characteristics and clinical outcomes of the training and validation cohort.

Full size table

The performance of the trained models are summarized in Table 4, and the confusion matrices depicting the prediction results are presented in Fig. 3. The results revealed that the DenseNet model achieved favorable predictions for both the CTSI- and Atlanta classification-labeled CT scans, with accuracy exceeding 0.7 and AUC-ROC exceeding 0.7. Notably, the model trained with CTSI-labeled CT scans demonstrated particularly favorable performance, with a macro-average accuracy of 0.899, macro-average F1 score of 0.835, and macro-average AUC-ROC of 0.980.

Table 4 Predictive performance of the models trained using CT scans labeled with CTSI and Atlanta classification.

Full size table

The ROC curves of the DenseNet model predictions for the two different labeling methods are illustrated in Fig. 4. Both methods exhibited the best prediction performance for SAP, likely due to the more pronounced CT image changes observed in patients with SAP, making them more easily distinguishable by the model. Furthermore, the model demonstrated superior predictions with CTSI labeling compared to the Atlanta classification (macro-average AUC-ROC: 0.980 vs. 0.864, P < 0.05; macro-average F1 score: 0.835 vs. 0.670, P < 0.05).

The visualization results of our model can be seen in Fig. 5. We selected three representative CT scan slices of AP and employed Grad-CAMs to visualize the regions influencing the decision-making process in our trained DenseNet models. Our findings reveal that in the case of MAP with a pancreatic enlargement (Fig. 5A), both DenseNet models trained with the two annotation methods focused on the pancreas and the surrounding peripancreatic region. However, in instances with more noticeable pancreatic morphological changes (Fig. 5B and C), the DenseNet model trained using CTSI-labeled CT scans more effectively accentuated the areas corresponding to pancreatic necrosis and peripancreatic accumulation.

Discussion

Our study confirms the feasibility of using the CNN models, grounded on 3D DenseNet, to predict the severity of AP using enhanced CT scans. Regardless of whether the models were trained on CT scans labeled CTSI or Atlanta classification, the trained model consistently yielded robust classification performance, with a macro-average AUC-ROC score surpassing 0.8.

In recent years, advancements in AI and deep learning have enabled the application of CNN models, such as ResNet, DenseNet, Inception, and VGG, in the automatic analysis of CT scans^30,31,32,33. DenseNet has demonstrated favorable predictive performance in studies on the prediction of various conditions, including COVID-19³⁴. Previous studies predominantly utilized 2D CT slices for training and testing CNN models³⁵. However, since CT scans inherently provide 3D information, processing them using 2D models may lead to the loss of valuable information and compromise predictive efficacy. With the advancement of computing capabilities, the use of 3D models for direct processing of CT scans has become feasible. Studies involving the automated identification of COVID-19 patients have shown promising results using 3D CNN models³⁶. Therefore, in this study, we employed a 3D DenseNet model based on these previous findings.

To the best of our knowledge, there is limited research on using deep learning models to predict the severity of AP from CT scans. In a recent study, Chen et al. employed an image-deep learning model based on MobileNetV2 and trained it on non-enhanced CT scans obtained within 72 h of onset²⁰. In this study, we trained a CNN model based on 3D DenseNet using enhanced CT scans from patients with AP, typically taken around 5.4 days after symptom onset. The focus of the two studies varies, and the superior predictive performance of our models demonstrates the potential advantage of using enhanced CT scans for severity prediction of AP through CNN models, aligning with pancreatitis treatment guidelines and clinical experience. However, both studies preliminarily suggest the promising potential of CNN models in predicting the severity of AP using CT scans. However, there remains a considerable gap between the current capabilities of these models and their potential clinical applications, emphasizing the need for continued research.

The Atlanta classification is currently the most prevalent method for severity classification in AP. While there is a correlation between the CTSI and Atlanta classification, they are not identical^{37, 38}. The primary rationale for employing the CTSI in our research stems from its exclusive reliance on CT imaging, which sets it apart from the Atlanta Classification which also factors in additional data beyond the scope of CT imaging. Given these distinctions, CT images may not carry sufficient information for a precise Atlanta classification, and the utilization of Atlanta classification labels in training the CNN model may compromise the model's generalization capability.

Indeed, in this research, we amassed as many enhanced CT scans of patients with AP as possible. Several patients underwent multiple CT scans, and if these scans met the study's inclusion and exclusion criteria, they were included. As pancreatic necrosis can develop during the early stages of AP, the CTSI classification for these patients may fluctuate over different time points. However, their classifications according to the Atlanta criteria remained consistent. Training the model on distinct CT images carrying identical labels may detrimentally impact the model's predictive capacity. Our results showed that the predictive performance of the model trained on CT scans labeled with CTSI outperformed the model trained on CT scans labeled with the Atlanta classification (macro-average AUC-ROC: 0.980 vs. 0.864). These findings indicated that training a CNN model using enhanced CT images based on the CTSI can achieve better predictive performance, and it demonstrates the ability of the CNN model to capture information about changes in the severity of AP from CT images.

In this study, we did not perform an a priori extraction of the region of interest (ROI) pertaining to the pancreas. Currently, there are no mature algorithms for accurately classifying pancreatic necrosis, peripancreatic necrotic accumulation, and normal pancreas³⁹, and manual ROI labeling or training a separate model for automated segmentation would require substantial time and effort⁴⁰.

One of the key advantages of deep learning models is their ability to automatically extract quantitative features from high-throughput images, analyze image data in-depth, and translate microscopic lesion changes into quantitative measures⁴¹. In this study, referred to some previous CNN model research^42,43,44, we adopted an end-to-end approach that bypasses ROI extraction and directly employs the entire CT scan for model development. Fortunately, our results affirmed the feasibility and effectiveness of this direct approach in predicting the severity of pancreatitis. Regardless, employing an appropriate pancreatic segmentation algorithm or extracting ROI could potentially enhance the predictive performance of the model. This issue can be further explored in subsequent research.

To further evaluate whether our model effectively focuses on the pancreas and peripancreatic necrosis, we employed the Grad-CAMs visualization technique to highlight potentially decision-related areas in CT scan slices²⁷. The Grad-CAM results confirmed that the model successfully attends to areas of both pancreatic and extrapancreatic necrosis, further supporting the model's applicability.

In the field of artificial intelligence for pancreatitis, one of the ultimate goals might be to dynamically predict the severity of a patient's condition and their clinical prognosis based on various collected data, thereby guiding clinical diagnosis and treatment. In future research, through model improvements, such as adopting attention-based transformer models and incorporating time-series based recurrent neural networks, we may not only further enhance the predictive performance of the model and reduce computational load, but also potentially achieve a dynamic evaluation of pancreatitis severity using heterogeneous data from different time points. This could enable us to rapidly and accurately determine the severity of AP using available data.

However, this study has certain limitations inherent to its design and objective conditions. Firstly, it was a single-center study, which limits its generalizability, although sample consistency was high. Conducting multicenter studies and external validation would further strengthen the predictive efficacy of the model. Secondly, as mentioned earlier, ROI extraction was not attempted in this study, and it remains unclear whether performing ROI extraction would improve the model's prediction performance. This aspect can be explored in future research. Thirdly, in this study, we focused on developing a 3D DenseNet CNN model. Future studies can investigate additional CNN models and cross-modal hybrid models that integrate both imaging information and clinical data to enhance the model's performance in predicting AP severity.

In summary, our findings demonstrate that the constructed 3D DenseNet CNN model exhibits reliable predictive capability in classifying AP severity after training with enhanced CT scans, highlighting the feasibility of using CNN models for automatic AP severity classification based on imaging data. Moreover, this study provides insights for the development of more comprehensive models that incorporate both imaging information and clinical data for predicting the severity of pancreatitis. Further advancements in this area can lead to improved clinical decision-making and better patient outcomes.

Data availability

The data generated and analyzed during the current study are not publicly available due to privacy laws and policies, but are available from the corresponding author on reasonable request.

References

Al-Hadeedi, S., Fan, S. T. & Leaper, D. APACHE-II score for assessment and monitoring of acute pancreatitis. Lancet (London, England) 2, 738. https://doi.org/10.1016/s0140-6736(89)90795-2 (1989).
Article CAS PubMed Google Scholar
Banks, P. A. et al. Classification of acute pancreatitis–2012: Revision of the Atlanta classification and definitions by international consensus. Gut 62, 102–111. https://doi.org/10.1136/gutjnl-2012-302779 (2013).
Article PubMed Google Scholar
Baron, T. H., DiMaio, C. J., Wang, A. Y. & Morgan, K. A. American Gastroenterological Association clinical practice update: Management of pancreatic necrosis. Gastroenterology 158, 67-75.e61. https://doi.org/10.1053/j.gastro.2019.07.064 (2020).
Article CAS PubMed Google Scholar
Tenner, S., Baillie, J., DeWitt, J. & Vege, S. S. American College of Gastroenterology guideline: Management of acute pancreatitis. Am. J. Gastroenterol. 108, 1400–1415. https://doi.org/10.1038/ajg.2013.218 (2013).
Article CAS PubMed Google Scholar
Knaus, W. A., Draper, E. A., Wagner, D. P. & Zimmerman, J. E. APACHE II: A severity of disease classification system. Crit. Care Med. 13, 818–829 (1985).
Article CAS PubMed Google Scholar
Ranson, J. H. et al. Objective early identification of severe acute pancreatitis. Am. J. Gastroenterol. 61, 443–451 (1974).
CAS PubMed Google Scholar
Singh, V. K. et al. A prospective evaluation of the bedside index for severity in acute pancreatitis score in assessing mortality and intermediate markers of severity in acute pancreatitis. Am. J. Gastroenterol. 104, 966–971. https://doi.org/10.1038/ajg.2009.28 (2009).
Article ADS PubMed Google Scholar
Marshall, J. C. et al. Multiple organ dysfunction score: A reliable descriptor of a complex clinical outcome. Crit. Care Med. 23, 1638–1652. https://doi.org/10.1097/00003246-199510000-00007 (1995).
Article CAS PubMed Google Scholar
Vincent, J. L. et al. The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure. On behalf of the working group on sepsis-related problems of the european society of intensive care medicine. Intensive Care Med. 22, 707–710. https://doi.org/10.1007/bf01709751 (1996).
Article CAS PubMed Google Scholar
Cho, J. H., Kim, T. N., Chung, H. H. & Kim, K. H. Comparison of scoring systems in predicting the severity of acute pancreatitis. World J. Gastroenterol. 21, 2387–2394. https://doi.org/10.3748/wjg.v21.i8.2387 (2015).
Article PubMed PubMed Central Google Scholar
Tarján, D. & Hegyi, P. Acute pancreatitis severity prediction: It is time to use artificial intelligence. J. Clin. Med. 12, 62. https://doi.org/10.3390/jcm12010290 (2022).
Article Google Scholar
Zhou, Y. et al. Machine learning predictive models for acute pancreatitis: A systematic review. Int. J. Med. Inf. 157, 104641. https://doi.org/10.1016/j.ijmedinf.2021.104641 (2022).
Article Google Scholar
Li, J. N. et al. Machine learning improves prediction of severity and outcomes of acute pancreatitis: A prospective multi-center cohort study. Sci. China. Life Sci. https://doi.org/10.1007/s11427-022-2333-8 (2023).
Article PubMed PubMed Central Google Scholar
İnce, A. T. et al. Early prediction of the severe course, survival, and ICU requirements in acute pancreatitis by artificial intelligence. Pancreatol. Off. J. Int. Assoc. Pancreatol. 23, 176–186. https://doi.org/10.1016/j.pan.2022.12.005 (2023).
Article Google Scholar
Balthazar, E. J. et al. Acute pancreatitis: Prognostic value of CT. Radiology 156, 767–772. https://doi.org/10.1148/radiology.156.3.4023241 (1985).
Article CAS PubMed Google Scholar
Mortele, K. J. et al. A modified CT severity index for evaluating acute pancreatitis: Improved correlation with patient outcome. AJR. Am. J. Roentgenol. 183, 1261–1265. https://doi.org/10.2214/ajr.183.5.1831261 (2004).
Article PubMed Google Scholar
De Waele, J. J. et al. Extrapancreatic inflammation on abdominal computed tomography as an early predictor of disease severity in acute pancreatitis: Evaluation of a new scoring system. Pancreas 34, 185–190. https://doi.org/10.1097/mpa.0b013e31802d4136 (2007).
Article PubMed Google Scholar
Zhao, Y. et al. Early prediction of acute pancreatitis severity based on changes in pancreatic and peripancreatic computed tomography radiomics nomogram. Quant. Imaging Med. Surg. 13, 1927–1936. https://doi.org/10.21037/qims-22-821 (2023).
Article PubMed PubMed Central Google Scholar
Lin, Q. et al. Radiomics model of contrast-enhanced MRI for early prediction of acute pancreatitis severity. J. Magnet. Resonance Imaging JMRI 51, 397–406. https://doi.org/10.1002/jmri.26798 (2020).
Article PubMed Google Scholar
Chen, Z. et al. Deep learning models for severity prediction of acute pancreatitis in the early phase from abdominal nonenhanced computed tomography images. Pancreas 52, e45–e53. https://doi.org/10.1097/mpa.0000000000002216 (2023).
Article PubMed Google Scholar
Rocha, A. P. C., Schawkat, K. & Mortele, K. J. Imaging guidelines for acute pancreatitis: When and when not to image. Abdom. Radiol. (New York) 45, 1338–1349. https://doi.org/10.1007/s00261-019-02319-2 (2020).
Article Google Scholar
Association, C. P. S. Guidelines for diagnosis and treatment of acute pancreatitis in China (2021). Zhonghua wai ke za zhi [Chin. J. Surg.] 59, 578–587. https://doi.org/10.3760/cma.j.cn112139-20210416-00172 (2021).
Article Google Scholar
IAP/APA evidence-based guidelines for the management of acute pancreatitis. Pancreatol. Off. J. Int. Assoc. Pancreatol. 13, e1–15, https://doi.org/10.1016/j.pan.2013.07.063 (2013).
Liu, W. H. et al. Abdominal paracentesis drainage ahead of percutaneous catheter drainage benefits patients attacked by acute pancreatitis with fluid collections: A retrospective clinical cohort study. Crit. Care Med. 43, 109–119. https://doi.org/10.1097/ccm.0000000000000606 (2015).
Article ADS CAS PubMed Google Scholar
Fremond, S. et al. Interpretable deep learning model to predict the molecular classification of endometrial cancer from haematoxylin and eosin-stained whole-slide images: A combined analysis of the PORTEC randomised trials and clinical cohorts. The Lancet 5, e71–e82. https://doi.org/10.1016/s2589-7500(22)00210-2 (2023).
Article CAS PubMed Google Scholar
Schultebraucks, K., Choi, K. W., Galatzer-Levy, I. R. & Bonanno, G. A. Discriminating heterogeneous trajectories of resilience and depression after major life stressors using polygenic scores. JAMA Psychiat. 78, 744–752. https://doi.org/10.1001/jamapsychiatry.2021.0228 (2021).
Article Google Scholar
Selvaraju, R. R. et al. in Grad-cam: Visual explanations from deep networks via gradient-based localization. 618–626.
Rutter, C. M. Bootstrap estimation of diagnostic accuracy with patient-clustered data. Acad. Radiol. 7, 413–419. https://doi.org/10.1016/s1076-6332(00)80381-5 (2000).
Article CAS PubMed Google Scholar
Samuelson, F. W., Petrick, N. & Paquerault, S. in Advantages and examples of resampling for CAD evaluation. 492–495 (IEEE).
Yamashita, R., Nishio, M., Do, R. K. G. & Togashi, K. Convolutional neural networks: An overview and application in radiology. Insights Imaging 9, 611–629. https://doi.org/10.1007/s13244-018-0639-9 (2018).
Article PubMed PubMed Central Google Scholar
Yahya, A. A., Liu, K., Hawbani, A., Wang, Y. & Hadi, A. N. A novel image classification method based on residual network, inception, and proposed activation function. Sensors (Basel, Switzerland) https://doi.org/10.3390/s23062976 (2023).
Article PubMed Google Scholar
Zhang, W. et al. Detecting individuals with severe mental illness using artificial intelligence applied to magnetic resonance imaging. EBioMedicine 90, 104541. https://doi.org/10.1016/j.ebiom.2023.104541 (2023).
Article PubMed PubMed Central Google Scholar
Ren, K., Hong, G., Chen, X. & Wang, Z. A COVID-19 medical image classification algorithm based on Transformer. Sci. Rep. 13, 5359. https://doi.org/10.1038/s41598-023-32462-2 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
de Vente, C. et al. Automated COVID-19 grading with convolutional neural networks in computed tomography scans: A systematic comparison. IEEE Trans. Artif. Intell. 3, 129–138. https://doi.org/10.1109/tai.2021.3115093 (2022).
Article PubMed Google Scholar
Song, Y. et al. Deep learning enables accurate diagnosis of novel coronavirus (COVID-19) with CT images. IEEE/ACM Trans. Comput. Biol. Bioinform. 18, 2775–2780. https://doi.org/10.1109/tcbb.2021.3065361 (2021).
Article CAS PubMed Google Scholar
Wang, S. et al. A fully automatic deep learning system for COVID-19 diagnostic and prognostic analysis. Eur. Respirat. J. https://doi.org/10.1183/13993003.00775-2020 (2020).
Article Google Scholar
Bollen, T. L. et al. Comparative evaluation of the modified CT severity index and CT severity index in assessing severity of acute pancreatitis. AJR. Am. J. Roentgenol. 197, 386–392. https://doi.org/10.2214/ajr.09.4025 (2011).
Article PubMed Google Scholar
Bollen, T. L. et al. A comparative evaluation of radiologic and clinical scoring systems in the early prediction of severity in acute pancreatitis. Am. J. Gastroenterol. 107, 612–619. https://doi.org/10.1038/ajg.2011.438 (2012).
Article ADS PubMed Google Scholar
Lim, S. H. et al. Automated pancreas segmentation and volumetry using deep neural network on computed tomography. Sci. Rep. 12, 4075. https://doi.org/10.1038/s41598-022-07848-3 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Mashayekhi, R. et al. Radiomic features of the pancreas on CT imaging accurately differentiate functional abdominal pain, recurrent acute pancreatitis, and chronic pancreatitis. Eur. J. Radiol. 123, 108778. https://doi.org/10.1016/j.ejrad.2019.108778 (2020).
Article PubMed Google Scholar
Chetoui, M. et al. Explainable COVID-19 detection based on chest x-rays using an end-to-end RegNet architecture. Viruses https://doi.org/10.3390/v15061327 (2023).
Article PubMed PubMed Central Google Scholar
Ardila, D. et al. End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nat. Med. 25, 954–961. https://doi.org/10.1038/s41591-019-0447-x (2019).
Article CAS PubMed Google Scholar
Lesage, M. et al. An end-to-end pipeline based on open source deep learning tools for reliable analysis of complex 3D images of ovaries. Development (Cambridge, England) https://doi.org/10.1242/dev.201185 (2023).
Article PubMed Google Scholar
Si, K. et al. Fully end-to-end deep-learning-based diagnosis of pancreatic tumors. Theranostics 11, 1982–1990. https://doi.org/10.7150/thno.52508 (2021).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the National Clinical Key Subject of China (Grant No. 41732113). The funder of this study had no role in study design, data collection, data analysis, data interpretation, or writing of the report.

Author information

Authors and Affiliations

Department of General Surgery, The General Hospital of Western Theater Command (Chengdu Military General Hospital), Chengdu, 610083, China
Hongyin Liang, Yi Wen & Lijun Tang
Sichuan Provincial Key Laboratory of Pancreatic Injury and Repair, Chengdu, 610083, China
Hongyin Liang, Yi Wen & Lijun Tang
Department of Traditional Chinese Medicine, The General Hospital of Western Theater Command (Chengdu Military General Hospital), Chengdu, 610083, China
Meng Wang
Department of Radiology, The General Hospital of Western Theater Command (Chengdu Military General Hospital), Chengdu, 610083, China
Feizhou Du & Xuelong Geng
Department of Cardiac Surgery, The General Hospital of Western Theater Command (Chengdu Military General Hospital), Chengdu, 610083, China
Li Jiang
Department of Liver Transplantation and Hepato-biliary-pancreatic Surgery, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610016, China
Hongtao Yan

Authors

Hongyin Liang
View author publications
You can also search for this author in PubMed Google Scholar
Meng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yi Wen
View author publications
You can also search for this author in PubMed Google Scholar
Feizhou Du
View author publications
You can also search for this author in PubMed Google Scholar
Li Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Xuelong Geng
View author publications
You can also search for this author in PubMed Google Scholar
Lijun Tang
View author publications
You can also search for this author in PubMed Google Scholar
Hongtao Yan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.L. and H.Y. proposed the ideas; M.W., Y.W., and L.J. collected data; H.L. designed the model; F.D. and X.G. implemented CTSI scoring; H.L., M.W., and L.T. analyzed and interpreted data; H.L., L.T., and H.Y. drafted and revised the article. All authors reviewed the manuscript.

Corresponding author

Correspondence to Hongtao Yan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liang, H., Wang, M., Wen, Y. et al. Predicting acute pancreatitis severity with enhanced computed tomography scans using convolutional neural networks. Sci Rep 13, 17514 (2023). https://doi.org/10.1038/s41598-023-44828-7

Download citation

Received: 13 May 2023
Accepted: 12 October 2023
Published: 16 October 2023
DOI: https://doi.org/10.1038/s41598-023-44828-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Segment anything in medical images

Screening and diagnosis of cardiovascular disease using artificial intelligence-enabled cardiac magnetic resonance imaging

AI in health and medicine

Introduction

Methods

Study design

Definition

Diagnosis of AP

Definition of CTSI

Classification of AP severity

CT scans acquisition and labeling

Model development and evaluation

Training and test datasets

DenseNet model

Model evaluation

Visual interpretation of the models

Ethics statement and informed consent statement

Methods statement

Experimental environment and statistical analysis

Results

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links