Mask R-CNN assisted 2.5D object detection pipeline of 68Ga-PSMA-11 PET/CT-positive metastatic pelvic lymph node after radical prostatectomy from solely CT imaging

Xu, Di; Ma, Martin; Cao, Minsong; Kishan, Amar U.; Nickols, Nicholas G.; Scalzo, Fabien; Sheng, Ke

doi:10.1038/s41598-023-28669-y

Download PDF

Article
Open access
Published: 30 January 2023

Mask R-CNN assisted 2.5D object detection pipeline of ⁶⁸Ga-PSMA-11 PET/CT-positive metastatic pelvic lymph node after radical prostatectomy from solely CT imaging

Di Xu^1,2,
Martin Ma²,
Minsong Cao²,
Amar U. Kishan²,
Nicholas G. Nickols³,
Fabien Scalzo⁴ &
…
Ke Sheng^2,5

Scientific Reports volume 13, Article number: 1696 (2023) Cite this article

1440 Accesses
3 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Prostate-specific membrane antigen (PSMA) positron emission tomography (PET)/computed tomography (CT) is a molecular and functional imaging modality with better restaging accuracy over conventional imaging for detecting prostate cancer in men suspected of lymph node (LN) progression after definitive therapy. However, the availability of PSMA PET/CT is limited in both low-resource settings and for repeating imaging surveillance. In contrast, CT is widely available, cost-effective, and routinely performed as part of patient follow-up or radiotherapy workflow. Compared with the molecular activities, the morphological and texture changes of subclinical LNs in CT are subtle, making manual detection of positive LNs infeasible. Instead, we harness the power of artificial intelligence for automated LN detection on CT. We examined ⁶⁸Ga-PSMA-11 PET/CT images from 88 patients (including 739 PSMA PET/CT-positive pelvic LNs) who experienced a biochemical recurrence after radical prostatectomy and presented for salvage radiotherapy with prostate-specific antigen < 1 ng/mL. Scans were divided into a training set (nPatient = 52, nNode = 400), a validation set (nPatient = 18, nNode = 143), and a test set (nPatient = 18, nNodes = 196). Using PSMA PET/CT as the ground truth and consensus pelvic LN clinical target volumes as search regions, a 2.5-dimensional (2.5D) Mask R-CNN based object detection framework was trained. The entire framework contained whole slice imaging pretraining, masked-out region fine-tuning, prediction post-processing, and “window bagging”. Following an additional preprocessing step—pelvic LN clinical target volume extraction, our pipeline located positive pelvic LNs solely based on CT scans. Our pipeline could achieve a sensitivity of 83.351%, specificity of 58.621% out of 196 positive pelvic LNs from 18 patients in the test set, of which most of the false positives can be post-removable by radiologists. Our tool may aid CT-based detection of pelvic LN metastasis and triage patients most unlikely to benefit from the PSMA PET/CT scan.

Accuracy of standard clinical 3T prostate MRI for pelvic lymph node staging: Comparison to 68Ga-PSMA PET-CT

Article Open access 24 July 2019

The grade of individual prostate cancer lesions predicted by magnetic resonance imaging and positron emission tomography

Article Open access 09 November 2023

The impact of the co-registration technique and analysis methodology in comparison studies between advanced imaging modalities and whole-mount-histology reference in primary prostate cancer

Article Open access 12 March 2021

Introduction

Prostate cancer is the second most frequently diagnosed cancer in men worldwide¹. Radical prostatectomy (RP) is a standard of care option for all men with localized disease². Unfortunately, about 20–40% of patients treated with RP will develop a biochemical recurrence (BCR) from prostate bed recurrence, pelvic lymph nodes (LNs), or distant metastases. Early detection of the disease could improve the efficacy of intervention and reduce treatment-related toxicity. The source of the prostate-specific antigen (PSA) rise includes prostate bed, pelvic LNs or distant metastases. Conventional imaging studies are thought to have low sensitivity at low PSA levels, which poses a challenge since earlier salvage radiotherapy is known to be more effective than late salvage radiotherapy^3,4. Advanced nuclear medicine tests, such as flucicolvine⁵ and Prostate-specific membrane antigen (PSMA) positron emission tomography (PET)⁶, have a much higher sensitivity and can detect the location of recurrences at much lower PSA values. Studies have reported patient-based sensitivity and specificity of 98.7–100% and 88.2–100%, respectively^7,8. Recently, the landmark EMPIRE-1 trial showed improved event-free survival with the incorporation of fluciclovine PET into radiation planning after RP⁹. A head-to-head trial has shown that the detection rate and sensitivity of PSMA is superior to that of Axumin for pelvic and extrapelvic disease¹⁰.

Unfortunately, the PSMA PET/computed tomography (CT) carries significantly higher average overall costs compared to CT scans¹⁰. The cost can be prohibitive in low-resource settings and/or if repeated scans are needed. Therefore, significant barriers exist for the widespread use of ⁶⁸Ga-PSMA-11 PET/CT for detecting prostate cancer recurrence after radical prostatectomy at the present time.

Unprecedented progress has been made in artificial intelligence in the past decade, which has demonstrated great promise in many fields, including computer-aided diagnosis (CAD) of metastatic tumor spreads. Lately, researchers have been coming up with numerous solutions regarding the classification of various types of metastases¹¹. For example, Zhou et al. demonstrated the feasibility of breast cancer metastases classification using convolutional neural networks (CNN)¹², while Ariji et al. designed CNN for nodal metastases classification¹³. For metastatic prostate cancer (PCa), Hartenstein et al. presented the work of PCa LN metastasis classification¹⁴.

Nevertheless, most of the current CAD metastases detection methods are limited to binary patch classification with an evenly balanced mix of positive/negative cases (50%/50%), which would be difficult to apply in the clinical setting¹⁵. The corresponding reasons are two-fold. First, extracting incoming patients’ scanning into patches or voxels and then feeding into classification algorithms are too labor-intensive to be included into a clinical workflow. Second, artificially balanced positive to negative cases bears little resemblance to the ratio seen in the real-world setting.

Compared to most classification methodologies, modern object detection networks are more powerful tools that can identify and localize abnormalities from the entire input feature maps¹⁶. Lately, Zhao et al. proposed a triple-combining 2.5D U-Net pipeline for metastatic pelvis bone and LN lesion segmentation on ⁶⁸Ga-PSMA-11 PET/CT. This framework consisted of three 2.5D U-Nets, which extracted features from axial, coronal, and sagittal planes and predicted tumor masks based on majority voting¹⁷. They assessed the regime with the input of CT/PET alone or a fusion of the two.

Recent object detection and localization methods could be divided into one-stage as well as two-stage approaches. One-stage models, including the YOLO series¹⁸ and U-Net derivatives¹⁹ are more efficient, whereas two-stage ones, including the R-CNN family, are of better accuracy²⁰. Since most tasks in clinical practice are more rigid on the accuracy of the modality, two-stage detectors are more favorable for learning medical imaging²¹.

In the present study, built from Mask R-CNN²², we investigated the feasibility of detecting PCa LN metastases solely based on diagnostic CT images with contours on pelvic lymph node clinical target volumes (CTVs).

Materials and methods

Dataset

Patients and data management

In total, 88 PCa patients who showed positive lymph nodes in PSMA PET/CT at 4 institutions (the Technical University of Munich, the University of California at Los Angeles, Ludwig-Maximilians-University of Munich, and the University of Essen) were included. All patients underwent radical prostatectomy, had BCR without prior radiotherapy and underwent ⁶⁸Ga-PSMA-11 PET/CT at a serum PSA level of less than 1 ng/mL between August 2013 and May 2017 to detect the sites of recurrence. All patients gave written consent to undergo the procedures. The clinical data and Digital Imaging and Communications in Medicine (DICOM) files of all patients were anonymized and imported onto a dedicated radiotherapy contouring workstation at UCLA (MIM, version 6.7.5; MIM Software Inc., location of the company). This post hoc retrospective analysis was approved by the UCLA Institutional Review Board (#12-001882), and the requirement to obtain informed consent was waived. All experiments were performed in accordance with relevant guidelines and regulations.

⁶⁸Ga-PSMA-11 PET/CT image acquisition

⁶⁸Ga-PSMA-11 PET/CT imaging was performed according to recent guidelines²³. Images were acquired on the Siemens Biograph 128 mCT (68%), Siemens Biograph 64 (19%), Siemens Biograph 64 mCT (9%), and GE Healthcare Discovery 690 (5%). The ⁶⁸Ga-PSMA tracer was used at all sites. The median injected dose was 154 MBq (range 65–267 MBq). To reduce bladder activity, patients received 20 mg of furosemide at the time of tracer injection if there was no contraindication²⁴. The median uptake period was 59 min (range 37–132 min). A diagnostic CT scan (200–240 mAs, 120 kV) was performed after intravenous injection of contrast agent, followed by whole-body PET image acquisition (2–4 min/bed position)²⁵.

Pelvic lymph node clinical target volumes and PET lesion contouring

Pelvic lymph node CTVs were contoured on the CT dataset of the PET/CT scan for all 88 patients by an experienced radiation oncologist who was masked to the PET findings in accordance with the radiation therapy oncology group (RTOG) consensus contouring^26,27. CTV is a term commonly used in radiotherapy. CTV includes all at-risk LNs plus a margin for micro diseases in this specific context. We also noticed in certain cases the pelvic LNs were located at the boundary of pelvic nodal CTVs following RTOG guidelines (slightly fall out of the RTOG contours for 1–2 pixels). To ensure that the pelvic LN masks cover all the pixels of LN metastases and, more importantly, overcome the weak learning capability of CNN filter on edges of a feature map, we isotropically expanded pelvic LN CTVs by 10 absolute pixels (l = 6.48 mm). These wider contours introduced false positives (FPs) within the expansion zone but then eliminated them at the stage of post-processing (see details in “Modeling pipeline”). ⁶⁸Ga-PSMA-11 positive lesions were contoured on the CT images by radiation oncologists. These contours were subsequently used to define ⁶⁸Ga-PSMA-11-based target volumes²⁵.

Data split

The patients were divided into training (nPatient = 52, nNode = 400, split ratio = 3/5), validation (nPatient = 18, nNode = 143, split ratio = 1/5), and test (nPatient = 18, nNodes = 196, split ratio = 1/5) sets balanced on their national comprehensive cancer network (NCCN) risk groups at initial diagnosis. Details of split on NCCN risk group see in Table 1.

Table 1 Patient split of training, validation, and test sets on NCCN risk groups.

Full size table

Windowing analysis

To narrow down the area of metastatic LN detection and accentuate the morphological features of metastases, we focused on the area inside the pelvic CTVs and carefully selected windowing strategies of Hounsfield units (HUs) during training. Table 2 lists the representative statistics of window width. Noteworthy, various ranges of widow width were selected by first conducting distribution analysis of all HUs of positive node pixels in the training set and then gradually and symmetrically excluding some extreme image pixel values at the left and right tails of the distribution based on quantile analysis. We will explore different PCa LNs metastases Hounsfield unit (HU) window width along with standard soft tissue HU widow width (− 125, 225) in the below modeling pipeline. This windowing logic will be referred to as quantile windowing strategies in the following sections.

Table 2 Descriptive statistics of HU distribution with different quantile ranges for PCa LNs masks and metastases.

Full size table

2.5-Dimensional (2.5D) object detection pipeline

Data preprocessing

As shown in Fig. 1, Our data preprocessing pipeline consists of two paths for images fed into the pretrained network and the fine-tuned model, respectively. For the path of pretrained processing, we performed 2.5D concatenation, HU transformation, black border crop-out, and soft tissue windowing sequentially. For that of fine-tuned processing, we performed 2.5D concatenation, HU transformation, LN CTV contour mask- and crop-out, and quantile windowing strategies. Specifically, 2.5D here means that we will channel-wise concatenate the central CT slice along with its adjacent superior/inferior slices. HU transformation is to convert the DICOM pixels stored in the bundled “three-channel” images into HUs, and LN CTV contour mask- and crop-out operation set the pixels outside of the expanded central pelvic nodal contours on CT to zero and crop the image to only keep the CTV region so as to ease the fine-tune learning process.

After the above procedures, we wrap up both paths by performing uniform normalization and data augmentation of the images. The data is geometrically augmented using random resizing (image largest width to 640–800), horizontal flipping (p = 0.5) and random rotation (angle 0–180°), and morphologically augmented using random gaussian noise (kernel = 5, sigma = 1) and random brightness.

Modeling pipeline

As shown in Fig. 2, the complete design of workflow includes three steps, the initial pretrained whole slice imaging (WSI)-Mask R-CNN, the further fine-tuned Regional Mask R-CNN, and the “window bagging”. Our rationales will be elaborated on in below.

Pretraining to fine tuning

For Mask R-CNN with ResNet-X²⁸ backbones, researchers commonly use weights pretrained on ImageNet²⁹. Nevertheless, limited by our small set of training data, we suspect that directly training from the ImageNet pretrained weights might not lead to model convergence but instead overfit the current training set. Therefore, we designed a pretraining-to-finetuning workflow to maximize the information our model could extract from the limited training set. In the pretraining stage, we input the detection network with WSIs—CT scans without LNs CTV mask-out—and performed a quick and dirty training to let the model grasp initial coarse morphological structures in the patients’ pelvic CTs dataset (WSI-Mask R-CNN). Next, in the fine-tuning stage, by training the detector with the input of the pelvic nodal masked-out slices loaded with WSI pretrained weights (Regional-Mask R-CNN), we improved the starting point for a better chance of reaching the global optima with back-propagation. Since we will only perform object detection in this specific task, we blocked off the mask-branch of the standard Mask-R-CNN network in both the pretraining and fine-tuning stages. Modeling details can be seen in Fig. 2.

Prediction post-processing

During experiments, we found that our Regional-Mask R-CNN still suffered from two types of false positives—predictions near the outer boundary of expansion zone and vascular/bowel structures—that could benefit from post-processing. Three hyper-parameters (see $\tau$₄₋₆ in Table 3) were cross-validated to automate the post-processing. For FPs of the expansion zone boundary, we set ${\tau }_{4}$ to regularize the valid predictable LNs nodal expansion zone from a range of 1–10 pixels. For vascular/bowel structure FPs, we set $\tau$₅₋₆ to determine the quantile of all HUs within the predicted detection box (${\tau }_{6}$) above which threshold of HUs (${\tau }_{5}$) was not taken in the final prediction set. Since vessel and bowel patterns both have higher HUs than pelvic nodes on contract enhanced CTs.

Table 3 Tunable hyper-parameters elaboration for the paragraphs of prediction post-processing and “bagging” in Sect. 2.2.2.

Full size table

“Window bagging”

To further enhance model performance, we bagged multiple post-processed Regional Mask R-CNN trained with different quantile windowing inputs, the so-called “window bagging”, to count the votes from the crowd. Notably, bootstrap of the dataset was not conducted here for each voter since we believe that inputs with different quantile windowing could diversify the training information and therefore avoid collinearity. Details of our “window bagging” workflow can be seen in Fig. 2.

${\tau }_{1-3}$ are cross-validated hyper-parameters for “window bagging” tuning. ${\tau }_{1}$ is the intersection over union (IoU) threshold for determining the detection boxes generated from different voters as the final “window bagging” prediction. ${\tau }_{2}$ decides the number of voters in the final “window bagging” models. ${\tau }_{3}$ is the IoU for recognition of whether the bagged prediction hits ground truths (GTs).

Loss function

Although hybrid loss functions have been used recently in various deep networks^30,31,32, our loss function kept the same as the original Mask R-CNN due to its efficiency with the dataset.

$$L={L}_{cls}+{L}_{box}+{L}_{mask}$$

(1)

where ${L}_{cls}$ and ${L}_{box}$ still follows the definition in Faster R-CNN³³ and ${L}_{mask}$ is the average binary cross entropy loss proposed in Mask R-CNN²².

Model training

Our 2.5D object detection pipeline was implemented in detectron2 (https://github.com/facebookresearch/detectron2) project using PyTorch and performed on a GPU cluster with 4 × RTXA6000. Figure 3 shows the two training processes in detail.

For WSI Mask R-CNN, we trained the three-channel whole slice images on stochastic gradient descent (SGD) optimizer for 3 k iterations, with a batch size of 64 (4 $\times$ 16), learning rate (LR) of 0.01 decreasing by tenfold at 2 k iterations, a momentum of 0.9, and weight decay of 0.0001.

For Regional Mask R-CNN, we fine-tuned the pelvic nodal contour masked-out three-channel images using SGD for 6 k iterations with a batch size of 64 (4 $\times$ 16), LR of 0.005 decreasing by tenfold at 4 k and 5 k iterations, respectively, a momentum of 0.9, and weight decay of 0.0001. The final training loss decreased to around 0.4.

Model evaluation

We reported the best performance, tuned from individual criteria, including sensitivity, precision, and F-1 score for steps of prediction post-processing and “window-bagging”. Sensitivity is defined at the metastasis level, which means that if the model could locate one slice of a single metastatic LNs, we count this entire metastasis as a hit. Precision is defined as the slice level, which counts each slice of metastases captured by the detection box predictions. All metrics are evaluated on node instead of patient level.

Results

Positive pelvic LN GTs with the CTV contours are visualized in Fig. 4. Qualitative and quantitative results are presented in Fig. 5 and Table 4, respectively. Figure 5 enlarges the representative 2D images to highlight the sub-regions near the predictive or ground-truth positive LNs, and the detection boxes. Note that a positive LN can be found in multiple adjacent 2D slices, and a number of positive LNs could apprear in one slice. Visually from Fig. 5, there is not a clear difference between true positives (TPs), FPs, and false negatives (FNs), showing the challenge of directly using the CT for manual lymph node detection and classification.

Table 4 Performance comparison and ablation studies on the test set of ⁶⁸Ga-PSMA-11 PET/CT.

Full size table

Table 4 shows a quantitative comparison of detection methods. The single ImageNet-pretrained Regional Mask R-CNN resulted in robust sensitivity achieving ~ 80% AUC and detecting > 60% of the positive LNs but low precision under 30%. Fine-tuning individual Regional Mask R-CNNs from weights of WSI Mask R-CNN improves the precision by ~ 5% without compromising sensitivity. Prediction post-processing improved each learner by another 15%. Lastly, via “window bagging” of Regional Mask R-CNN pretrained on WSI as well as prediction post-processing, we obtained another 5% gain in precision score with a high sensitivity of 83.351% and AUC of 90.034%.

Discussion and conclusion

In the study, we developed a 2.5D deep learning pipeline for prostate metastatic LNs. As shown in Fig. 5, the differences between negative and positive nodes are subtle in CT, making it impractical for human observers to perform the detection task. However, after supervised learning based on PSMA-PET, our AI pipeline located the majority of positive pelvic LNs solely based on pelvic LN region extracted from CT scans, achieving an AUC of 90.034%, sensitivity of 83.351% and specificity of 58.621% out of 196 positive pelvic LNs (18 patients) in the test set. Our results show more promising performance compared to the triple-combining 2.5D U-Net proposed by Zhao et al., where the specificity of 54.8% and positive predictive value of 59.7% were reported for the case where solely CT was input to their network¹⁷.

Object detection of metastatic PCa lymph nodes using WSI CT scans is a challenging task mainly due to the enormous class imbalance between positive and negative voxels, the almost identical morphological patterns between abnormal and normal LNs, the large variance of appearances of the normal and abnormal tissues, the interference from complex pelvic structures (vascular, bowel, and pelvic bone structures), the infeasibility to balance positive and negative LNs on a WSI, and, in this specific task, a relatively small dataset to train the deep learning network. Nevertheless, our object detection pipeline still achieved superior sensitivity and relatively lower specificity than the easier binary classification problem.

We combatted those facts with five strategies: transfer learning from WSI imaging, fine tuning from regional pelvic LN CTVs, prediction post-processing, and “window bagging”. Our results show an additive and progressive improvement indicating independent mechanisms with these strategies (1) pretraining on entire CT slices provides more background information; (2) precise regional searching within CTVs greatly simplifies the complexity of feature learning; (3) prediction post-processing with tuned hyper-parameters helps refine the spatial and pixel-wise search regions; (4) “window bagging” of voters synthesizes individual training cohorts to reduce FPs while improving the robustness of sensitivity.

The present study has important clinical implications. Pelvic LN recurrence after definitive local therapy can be treated with external beam radiation therapy with or without androgen deprivation therapy. Many studies have demonstrated good efficacy and safety profile of whole pelvic radiation with simultaneous integrated boost to lymph nodes with gross disease^3,34. Another more targeted yet experimental approach is to deliver stereotactic body radiation therapy specifically to individual lymph nodes that are involved without irradiating the pelvic lymph node region comprehensively^35,36,37. In either approach, detailed information regarding the location of pelvic LNs harboring PCa is essential for treatment planning. Traditional CT-based detection method largely relies on morphological characteristics of the LNs, such as size (≥ 9–10 mm), presence of fatty hilum, shape (oval vs. round), and the short/long axis ratio³⁸. PSMA PET/CT was able to detect LN metastasis in nodes under 10 mm in size, with one study reporting a 60% detection rate for nodes between 2 and 5 mm³⁹. Patients with lower Gleason score (GS) tended to have smaller PSMA–positive LNs (mean 7.7 mm), than patients with intermediate- (mean 9.4 mm) and high GS cohorts. Based on the CT morphology criteria, only 34% of low GS patients, 56% of intermediate GS patients, and 53% of high GS patients were considered CT positive⁴⁰. The examples shown in Fig. 5 confirm the challenge of visually detecting positive lymph nodes.

As PSMA PET/CT has yet to become widely available due to financial and availability barriers, a low cost and easily accessible alternative approach that can help predict the presence and location of potential pelvic LN involvement based solely on conventional diagnostic CT is extremely appealing. The method developed here is not intended to replace PSMA PET/CT. Rather, it may help clinicians select patients who may benefit the most from PSMA PET/CT. The high accuracy of classifying patients with or without positive LNs is conducive for such a task.

The current dataset with 52 training patients is still far from sufficient, leaving space to further reduce the FPs and FNs with more training data. Additionally, the current pipeline benefits from manual pelvic LNs CTV segmentation that helps focus on a smaller and more relevant search volume. However, manual labeling of the structure can be inconsistent. Moreover, LN CTVs for radiotherapy purposes do not precisely delineate the individual pelvis lymph nodes. Additional non-LN tissues are included in the CTV, complicating the detection task. In the future, an automated pelvis LN segmentation network can be trained to improve both aspects based on curated CT with detailed labeling of the structure, such as the data released by the CAMELYON17 challenges. We also plan to apply more complex z-dimensional slice fusion strategies to provide more context information for the network and adding more background information via pretraining from other datasets, including DeepLesion⁴¹, Luna16⁴² and etc. In addition, adding attention gating into the network is another direction to explore. Lastly, as an extension of this work, the performance of our proposed approach can be compared with the performance of a capsule network since capsule networks can preserve spatial relationships of learned features and have been proposed recently for image classification tasks^43,44,45.

Another limitation of the study is that the PSMA PET is not a perfect ground truth for training and validation. PSMA PET detection sensitivity has been reported between 40 and 60% in a study⁴⁶ for patients with low PSA levels. However, the same method used in the study should be applicable as enhanced diagnostic information from histopathology and complementary imaging modalities, e.g., hyperpolarized C-13 MRI, becomes available.

Data availability

The datasets generated and/or analysed during the current study are not publicly available due to a confidentiality agreement associated with using these data and institutional policy but are available from the corresponding author on reasonable request with a legal data transfer agreement between institutions.

Code availability

The code for implementing this project is open-sourced at https://github.com/FluteXu/PSMA-Detection.

References

Torre, L. A. et al. Global cancer statistics, 2012: Global Cancer Statistics, 2012. CA. Cancer J. Clin. 65, 87–108 (2015).
Article Google Scholar
Schaeffer, E. et al. NCCN guidelines insights: Prostate cancer, Version 1. 2021: Featured updates to the NCCN guidelines. J. Natl. Compr. Canc. Netw. 19, 134–143 (2021).
Article CAS Google Scholar
Briganti, A. et al. Early salvage radiation therapy does not compromise cancer control in patients with pT3N0 prostate cancer after radical prostatectomy: Results of a match-controlled multi-institutional analysis. Eur. Urol. 62, 472–487 (2012).
Article Google Scholar
Fossati, N. et al. Impact of early salvage radiation therapy in patients with persistently elevated or rising prostate-specific antigen after radical prostatectomy. Eur. Urol. 73, 436–444 (2018).
Article Google Scholar
Parent, E. E. & Schuster, D. M. Update on ¹⁸F-fluciclovine PET for prostate cancer imaging. J. Nucl. Med. 59, 733–739 (2018).
Article CAS Google Scholar
De Visschere, P. J. L. et al. A systematic review on the role of imaging in early recurrent prostate cancer. Eur. Urol. Oncol. 2, 47–76 (2019).
Article Google Scholar
Moradi, F., Farolfi, A., Fanti, S. & Iagaru, A. Prostate cancer: Molecular imaging and MRI. Eur. J. Radiol. 143, 109893 (2021).
Article Google Scholar
Pyka, T. et al. Comparison of bone scintigraphy and ⁶⁸Ga-PSMA PET for skeletal staging in prostate cancer. Eur. J. Nucl. Med. Mol. Imaging 43, 2114–2121 (2016).
Article CAS Google Scholar
Jani, A. B. et al. 18F-fluciclovine-PET/CT imaging versus conventional imaging alone to guide postprostatectomy salvage radiotherapy for prostate cancer (EMPIRE-1): A single centre, open-label, phase 2/3 randomised controlled trial. Lancet Lond. Engl. 397, 1895–1904 (2021).
Article CAS Google Scholar
Calais, J. et al. 18F-fluciclovine PET-CT and ⁶⁸Ga-PSMA-11 PET-CT in patients with early biochemical recurrence after prostatectomy: A prospective, single-centre, single-arm, comparative imaging trial. Lancet Oncol. 20, 1286–1294 (2019).
Article CAS Google Scholar
Zheng, Q. et al. Artificial intelligence performance in detecting tumor metastasis from medical radiology imaging: A systematic review and meta-analysis. EClinicalMedicine 31, 100669 (2021).
Article Google Scholar
Zhou, L.-Q. et al. Lymph node metastasis prediction from primary breast cancer US images using deep learning. Radiology 294, 19–28 (2020).
Article Google Scholar
Ariji, Y. et al. Contrast-enhanced computed tomography image assessment of cervical lymph node metastasis in patients with oral cancer by using a deep learning system of artificial intelligence. Oral Surg. Oral Med. Oral Pathol. Oral Radiol. 127, 458–463 (2019).
Article Google Scholar
Hartenstein, A. et al. Prostate cancer nodal staging: Using deep learning to predict ⁶⁸Ga-PSMA-positivity from CT imaging alone. Sci. Rep. 10, 3398 (2020).
Article ADS CAS Google Scholar
Roy, K., Banik, D., Bhattacharjee, D. & Nasipuri, M. Patch-based system for Classification of Breast Histology images using deep learning. Comput. Med. Imaging Graph. 71, 90–103 (2019).
Article Google Scholar
Zhao, Z.-Q., Zheng, P., Xu, S.-T. & Wu, X. Object detection with deep learning: A review. IEEE Trans. Neural Netw. Learn. Syst. 30, 3212–3232 (2019).
Article Google Scholar
Zhao, Y. et al. Deep neural network for automatic characterization of lesions on ⁶⁸Ga-PSMA-11 PET/CT. Eur. J. Nucl. Med. Mol. Imaging 47, 603–613 (2020).
Article CAS Google Scholar
Redmon, J., Divvala, S., Girshick, R. & Farhadi, A. You only look once: Unified, real-time object detection. 779–788 (2016).
Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. http://arxiv.org/abs/150504597 [Cs] (2015).
Lu, X., Li, Q., Li, B. & Yan, J. MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object Detection. http://arxiv.org/abs/00911528 [Cs] (2020).
Zhang, J., Liu, M. & Shen, D. Detecting anatomical landmarks from limited medical imaging data using two-stage task-oriented deep neural networks. IEEE Trans. Image Process Publ. IEEE Signal Process. Soc. 26, 4753–4764 (2017).
Article ADS Google Scholar
He, K., Gkioxari, G., Dollár, P. & Girshick, R. Mask R-CNN. http://arxiv.org/abs/170306870 [Cs] (2018).
Fendler, W. P. et al. ⁶⁸Ga-PSMA PET/CT: Joint EANM and SNMMI procedure guideline for prostate cancer imaging: Version 1.0. Eur. J. Nucl. Med. Mol. Imaging 44, 1014–1024 (2017).
Article Google Scholar
Haupt, F. et al. ⁶⁸Ga-PSMA-11 PET/CT in patients with recurrent prostate cancer—a modified protocol compared with the common protocol. Eur. J. Nucl. Med. Mol. Imaging 47, 624–631 (2020).
Article CAS Google Scholar
Calais, J. et al. ⁶⁸Ga-PSMA-11 PET/CT mapping of prostate cancer biochemical recurrence after radical prostatectomy in 270 patients with a PSA level of less than 1.0 ng/mL: Impact on salvage radiotherapy planning. J. Nucl. Med. 59, 230–237 (2018).
Article CAS Google Scholar
Lawton, C. A. F. et al. RTOG GU radiation oncology specialists reach consensus on pelvic lymph node volumes for high-risk prostate cancer. Int. J. Radiat. Oncol. 74, 383–387 (2009).
Article Google Scholar
Michalski, J. M. et al. Development of RTOG consensus guidelines for the definition of the clinical target volume for postoperative conformal radiation therapy for prostate cancer. Int. J. Radiat. Oncol. 76, 361–368 (2010).
Article Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep Residual Learning for Image Recognition. http://arxiv.org/abs/151203385 [Cs] (2015).
Deng, J. et al. ImageNet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition 248–255 (IEEE, 2009). https://doi.org/10.1109/CVPR.2009.5206848.
Goceri, E. Diagnosis of skin diseases in the era of deep learning and mobile technology. Comput. Biol. Med. 134, 104458 (2021).
Article Google Scholar
Göçeri, E. An application for automated diagnosis of facial dermatological diseases. İzmir Katip Çelebi Üniv. Sağlık Bilim Fakültesi Derg. 6, 91–99 (2021).
Google Scholar
Li, Z. et al. Low-dose CT image denoising with improving WGAN and hybrid loss function. Comput. Math. Methods Med. 2021, 1–14 (2021).
ADS Google Scholar
Ren, S., He, K., Girshick, R. & Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. http://arxiv.org/abs/150601497 [Cs] (2016).
Fodor, A. et al. Toxicity and efficacy of salvage carbon 11-choline positron emission tomography/computed tomography-guided radiation therapy in patients with lymph node recurrence of prostate cancer. BJU Int. 119, 406–413 (2017).
Article CAS Google Scholar
De Bleser, E. et al. Metastasis-directed therapy in treating nodal oligorecurrent prostate cancer: A multi-institutional analysis comparing the outcome and toxicity of stereotactic body radiotherapy and elective nodal radiotherapy. Eur. Urol. 76, 732–739 (2019).
Article Google Scholar
Lépinoy, A. et al. Salvage extended field or involved field nodal irradiation in 18F-fluorocholine PET/CT oligorecurrent nodal failures from prostate cancer. Eur. J. Nucl. Med. Mol. Imaging 46, 40–48 (2019).
Article Google Scholar
Ost, P. et al. Metastasis-directed therapy of regional and distant recurrences after curative treatment of prostate cancer: A systematic review of the literature. Eur. Urol. 67, 852–863 (2015).
Article Google Scholar
Flechsig, P. et al. Quantitative volumetric CT-histogram analysis in N-staging of 18F-FDG-equivocal patients with lung cancer. J. Nucl. Med. 55, 559–564 (2014).
Article CAS Google Scholar
van Leeuwen, P. J. et al. Prospective evaluation of ⁶⁸Gallium-prostate-specific membrane antigen positron emission tomography/computed tomography for preoperative lymph node staging in prostate cancer. BJU Int. 119, 209–215 (2017).
Article Google Scholar
Vinsensia, M. et al. ⁶⁸Ga-PSMA PET/CT and volumetric morphology of PET-positive lymph nodes stratified by tumor differentiation of prostate cancer. J. Nucl. Med. 58, 1949–1955 (2017).
Article CAS Google Scholar
Yan, K., Wang, X., Lu, L. & Summers, R. M. DeepLesion: Automated mining of large-scale lesion annotations and universal lesion detection with deep learning. J. Med. Imaging Bellingham Wash 5, 036501 (2018).
Google Scholar
Setio, A. A. A. et al. Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: The LUNA16 challenge. Med. Image Anal. 42, 1–13 (2017).
Article Google Scholar
Goceri, E. CapsNet topology to classify tumours from brain images and comparative evaluation. IET Image Process. 14, 882–889 (2020).
Article Google Scholar
Goceri, E. Analysis of capsule networks for image classification (2021).
Goceri, E. Capsule neural networks in classification of skin lesions. 29–36 (2021).
Fendler, W. P. et al. Prostate-specific membrane antigen ligand positron emission tomography in men with nonmetastatic castration-resistant prostate cancer. Clin. Cancer Res. 25, 7448–7454 (2019).
Article CAS Google Scholar

Download references

Acknowledgements

We thank Drs. Jeremie Calais and Johannes Czernin for the PSMA PET/CT data. The study is supported in part by NIH R01CA259008 and DOD W81XWH2210044.

Author information

Authors and Affiliations

Computer Science, University of California, Los Angeles, CA, 90035, USA
Di Xu
Radiation Oncology, University of California, Los Angeles, CA, 90035, USA
Di Xu, Martin Ma, Minsong Cao, Amar U. Kishan & Ke Sheng
Department of Radiation Oncology, VA Greater Los Angeles Healthcare System, Los Angeles, CA, 90035, USA
Nicholas G. Nickols
Computer Science, Pepperdine University, 24255 Pacific Coast Hwy, Los Angeles, CA, 90263, USA
Fabien Scalzo
Department of Radiation Oncology, University of California, San Francisco, CA, 94115, USA
Ke Sheng

Authors

Di Xu
View author publications
You can also search for this author in PubMed Google Scholar
Martin Ma
View author publications
You can also search for this author in PubMed Google Scholar
Minsong Cao
View author publications
You can also search for this author in PubMed Google Scholar
Amar U. Kishan
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas G. Nickols
View author publications
You can also search for this author in PubMed Google Scholar
Fabien Scalzo
View author publications
You can also search for this author in PubMed Google Scholar
Ke Sheng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.X. performed the experiments, analyzed the data and drafted the manuscript. M.M., A.U.K. and N.N., reviewed the data, performed manual labeling, verified0 the results, and helped draft the manuscript. MC helped organize and register the images and edited the manuscript. F.S. and K.S. provided technical advice for D.X. K.S. initiated the project, designed the experiments, and helped write the manuscript.

Corresponding author

Correspondence to Ke Sheng.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xu, D., Ma, M., Cao, M. et al. Mask R-CNN assisted 2.5D object detection pipeline of ⁶⁸Ga-PSMA-11 PET/CT-positive metastatic pelvic lymph node after radical prostatectomy from solely CT imaging. Sci Rep 13, 1696 (2023). https://doi.org/10.1038/s41598-023-28669-y

Download citation

Received: 02 July 2022
Accepted: 23 January 2023
Published: 30 January 2023
DOI: https://doi.org/10.1038/s41598-023-28669-y

This article is cited by

Improved prostate cancer diagnosis using a modified ResNet50-based deep learning architecture
- Fatma M. Talaat
- Shaker El-Sappagh
- Esraa Hassan
BMC Medical Informatics and Decision Making (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Accuracy of standard clinical 3T prostate MRI for pelvic lymph node staging: Comparison to 68Ga-PSMA PET-CT

The grade of individual prostate cancer lesions predicted by magnetic resonance imaging and positron emission tomography

The impact of the co-registration technique and analysis methodology in comparison studies between advanced imaging modalities and whole-mount-histology reference in primary prostate cancer

Introduction

Materials and methods

Dataset

Patients and data management

68Ga-PSMA-11 PET/CT image acquisition

Pelvic lymph node clinical target volumes and PET lesion contouring

Data split

Windowing analysis

2.5-Dimensional (2.5D) object detection pipeline

Data preprocessing

Modeling pipeline

Pretraining to fine tuning

Prediction post-processing

“Window bagging”

Loss function

Model training

Model evaluation

Results

Discussion and conclusion

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Improved prostate cancer diagnosis using a modified ResNet50-based deep learning architecture

Comments

Search

Quick links

⁶⁸Ga-PSMA-11 PET/CT image acquisition