Performance of Deep Learning and Genitourinary Radiologists in Detection of Prostate Cancer Using 3-T Multiparametric Magnetic Resonance Imaging

Ruiming Cao; Xinran Zhong; Sohrab Afshari; Ely Felker; Voraparee Suvannarerg; Teeravut Tubtawee; Sitaram Vangala; Fabien Scalzo; Steven Raman; Kyunghyun Sung

doi:10.1002/jmri.27595

Performance of Deep Learning and Genitourinary Radiologists in Detection of Prostate Cancer Using 3-T Multiparametric Magnetic Resonance Imaging

Ruiming Cao, Xinran Zhong, Sohrab Afshari, Ely Felker, Voraparee Suvannarerg, Teeravut Tubtawee, Sitaram Vangala, Fabien Scalzo, Steven Raman, Kyunghyun Sung

Research output: Contribution to journal › Article › peer-review

18 Scopus citations

Abstract

Background: Several deep learning-based techniques have been developed for prostate cancer (PCa) detection using multiparametric magnetic resonance imaging (mpMRI), but few of them have been rigorously evaluated relative to radiologists' performance or whole-mount histopathology (WMHP). Purpose: To compare the performance of a previously proposed deep learning algorithm, FocalNet, and expert radiologists in the detection of PCa on mpMRI with WMHP as the reference. Study Type: Retrospective, single-center study. Subjects: A total of 553 patients (development cohort: 427 patients; evaluation cohort: 126 patients) who underwent 3-T mpMRI prior to radical prostatectomy from October 2010 to February 2018. Field Strength/Sequence: 3-T, T2-weighted imaging and diffusion-weighted imaging. Assessment: FocalNet was trained on the development cohort to predict PCa locations by detection points, with a confidence value for each point, on the evaluation cohort. Four fellowship-trained genitourinary (GU) radiologists independently evaluated the evaluation cohort to detect suspicious PCa foci, annotate detection point locations, and assign a five-point suspicion score (1: least suspicious, 5: most suspicious) for each annotated detection point. The PCa detection performance of FocalNet and radiologists were evaluated by the lesion detection sensitivity vs. the number of false-positive detections at different thresholds on suspicion scores. Clinically significant lesions: Gleason Group (GG) ≥ 2 or pathological size ≥ 10 mm. Index lesions: the highest GG and the largest pathological size (secondary). Statistical Tests: Bootstrap hypothesis test for the detection sensitivity between radiologists and FocalNet. Results: For the overall differential detection sensitivity, FocalNet was 5.1% and 4.7% below the radiologists for clinically significant and index lesions, respectively; however, the differences were not statistically significant (P = 0.413 and P = 0.282, respectively). Data Conclusion: FocalNet achieved slightly lower but not statistically significant PCa detection performance compared with GU radiologists. Compared with radiologists, FocalNet demonstrated similar detection performance for a highly sensitive setting (suspicion score ≥ 1) or a highly specific setting (suspicion score = 5), while lower performance in between. Level of Evidence: 3. Technical Efficacy Stage: 2.

Original language	English (US)
Pages (from-to)	474-483
Number of pages	10
Journal	Journal of Magnetic Resonance Imaging
Volume	54
Issue number	2
DOIs	https://doi.org/10.1002/jmri.27595
State	Published - Aug 2021

Keywords

automatic cancer detection
deep learning
multiparametric MRI
prostate cancer

ASJC Scopus subject areas

Radiology Nuclear Medicine and imaging

Access to Document

10.1002/jmri.27595

Cite this

@article{197a892cdfc2437896583ac2ec3512cb,

title = "Performance of Deep Learning and Genitourinary Radiologists in Detection of Prostate Cancer Using 3-T Multiparametric Magnetic Resonance Imaging",

abstract = "Background: Several deep learning-based techniques have been developed for prostate cancer (PCa) detection using multiparametric magnetic resonance imaging (mpMRI), but few of them have been rigorously evaluated relative to radiologists' performance or whole-mount histopathology (WMHP). Purpose: To compare the performance of a previously proposed deep learning algorithm, FocalNet, and expert radiologists in the detection of PCa on mpMRI with WMHP as the reference. Study Type: Retrospective, single-center study. Subjects: A total of 553 patients (development cohort: 427 patients; evaluation cohort: 126 patients) who underwent 3-T mpMRI prior to radical prostatectomy from October 2010 to February 2018. Field Strength/Sequence: 3-T, T2-weighted imaging and diffusion-weighted imaging. Assessment: FocalNet was trained on the development cohort to predict PCa locations by detection points, with a confidence value for each point, on the evaluation cohort. Four fellowship-trained genitourinary (GU) radiologists independently evaluated the evaluation cohort to detect suspicious PCa foci, annotate detection point locations, and assign a five-point suspicion score (1: least suspicious, 5: most suspicious) for each annotated detection point. The PCa detection performance of FocalNet and radiologists were evaluated by the lesion detection sensitivity vs. the number of false-positive detections at different thresholds on suspicion scores. Clinically significant lesions: Gleason Group (GG) ≥ 2 or pathological size ≥ 10 mm. Index lesions: the highest GG and the largest pathological size (secondary). Statistical Tests: Bootstrap hypothesis test for the detection sensitivity between radiologists and FocalNet. Results: For the overall differential detection sensitivity, FocalNet was 5.1% and 4.7% below the radiologists for clinically significant and index lesions, respectively; however, the differences were not statistically significant (P = 0.413 and P = 0.282, respectively). Data Conclusion: FocalNet achieved slightly lower but not statistically significant PCa detection performance compared with GU radiologists. Compared with radiologists, FocalNet demonstrated similar detection performance for a highly sensitive setting (suspicion score ≥ 1) or a highly specific setting (suspicion score = 5), while lower performance in between. Level of Evidence: 3. Technical Efficacy Stage: 2.",

keywords = "automatic cancer detection, deep learning, multiparametric MRI, prostate cancer",

author = "Ruiming Cao and Xinran Zhong and Sohrab Afshari and Ely Felker and Voraparee Suvannarerg and Teeravut Tubtawee and Sitaram Vangala and Fabien Scalzo and Steven Raman and Kyunghyun Sung",

note = "Publisher Copyright: {\textcopyright} 2021 International Society for Magnetic Resonance in Medicine.",

year = "2021",

month = aug,

doi = "10.1002/jmri.27595",

language = "English (US)",

volume = "54",

pages = "474--483",

journal = "Journal of Magnetic Resonance Imaging",

issn = "1053-1807",

publisher = "John Wiley and Sons Inc.",

number = "2",

}

TY - JOUR

T1 - Performance of Deep Learning and Genitourinary Radiologists in Detection of Prostate Cancer Using 3-T Multiparametric Magnetic Resonance Imaging

AU - Cao, Ruiming

AU - Zhong, Xinran

AU - Afshari, Sohrab

AU - Felker, Ely

AU - Suvannarerg, Voraparee

AU - Tubtawee, Teeravut

AU - Vangala, Sitaram

AU - Scalzo, Fabien

AU - Raman, Steven

AU - Sung, Kyunghyun

PY - 2021/8

Y1 - 2021/8

N2 - Background: Several deep learning-based techniques have been developed for prostate cancer (PCa) detection using multiparametric magnetic resonance imaging (mpMRI), but few of them have been rigorously evaluated relative to radiologists' performance or whole-mount histopathology (WMHP). Purpose: To compare the performance of a previously proposed deep learning algorithm, FocalNet, and expert radiologists in the detection of PCa on mpMRI with WMHP as the reference. Study Type: Retrospective, single-center study. Subjects: A total of 553 patients (development cohort: 427 patients; evaluation cohort: 126 patients) who underwent 3-T mpMRI prior to radical prostatectomy from October 2010 to February 2018. Field Strength/Sequence: 3-T, T2-weighted imaging and diffusion-weighted imaging. Assessment: FocalNet was trained on the development cohort to predict PCa locations by detection points, with a confidence value for each point, on the evaluation cohort. Four fellowship-trained genitourinary (GU) radiologists independently evaluated the evaluation cohort to detect suspicious PCa foci, annotate detection point locations, and assign a five-point suspicion score (1: least suspicious, 5: most suspicious) for each annotated detection point. The PCa detection performance of FocalNet and radiologists were evaluated by the lesion detection sensitivity vs. the number of false-positive detections at different thresholds on suspicion scores. Clinically significant lesions: Gleason Group (GG) ≥ 2 or pathological size ≥ 10 mm. Index lesions: the highest GG and the largest pathological size (secondary). Statistical Tests: Bootstrap hypothesis test for the detection sensitivity between radiologists and FocalNet. Results: For the overall differential detection sensitivity, FocalNet was 5.1% and 4.7% below the radiologists for clinically significant and index lesions, respectively; however, the differences were not statistically significant (P = 0.413 and P = 0.282, respectively). Data Conclusion: FocalNet achieved slightly lower but not statistically significant PCa detection performance compared with GU radiologists. Compared with radiologists, FocalNet demonstrated similar detection performance for a highly sensitive setting (suspicion score ≥ 1) or a highly specific setting (suspicion score = 5), while lower performance in between. Level of Evidence: 3. Technical Efficacy Stage: 2.

AB - Background: Several deep learning-based techniques have been developed for prostate cancer (PCa) detection using multiparametric magnetic resonance imaging (mpMRI), but few of them have been rigorously evaluated relative to radiologists' performance or whole-mount histopathology (WMHP). Purpose: To compare the performance of a previously proposed deep learning algorithm, FocalNet, and expert radiologists in the detection of PCa on mpMRI with WMHP as the reference. Study Type: Retrospective, single-center study. Subjects: A total of 553 patients (development cohort: 427 patients; evaluation cohort: 126 patients) who underwent 3-T mpMRI prior to radical prostatectomy from October 2010 to February 2018. Field Strength/Sequence: 3-T, T2-weighted imaging and diffusion-weighted imaging. Assessment: FocalNet was trained on the development cohort to predict PCa locations by detection points, with a confidence value for each point, on the evaluation cohort. Four fellowship-trained genitourinary (GU) radiologists independently evaluated the evaluation cohort to detect suspicious PCa foci, annotate detection point locations, and assign a five-point suspicion score (1: least suspicious, 5: most suspicious) for each annotated detection point. The PCa detection performance of FocalNet and radiologists were evaluated by the lesion detection sensitivity vs. the number of false-positive detections at different thresholds on suspicion scores. Clinically significant lesions: Gleason Group (GG) ≥ 2 or pathological size ≥ 10 mm. Index lesions: the highest GG and the largest pathological size (secondary). Statistical Tests: Bootstrap hypothesis test for the detection sensitivity between radiologists and FocalNet. Results: For the overall differential detection sensitivity, FocalNet was 5.1% and 4.7% below the radiologists for clinically significant and index lesions, respectively; however, the differences were not statistically significant (P = 0.413 and P = 0.282, respectively). Data Conclusion: FocalNet achieved slightly lower but not statistically significant PCa detection performance compared with GU radiologists. Compared with radiologists, FocalNet demonstrated similar detection performance for a highly sensitive setting (suspicion score ≥ 1) or a highly specific setting (suspicion score = 5), while lower performance in between. Level of Evidence: 3. Technical Efficacy Stage: 2.

KW - automatic cancer detection

KW - deep learning

KW - multiparametric MRI

KW - prostate cancer

UR - http://www.scopus.com/inward/record.url?scp=85102266898&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85102266898&partnerID=8YFLogxK

U2 - 10.1002/jmri.27595

DO - 10.1002/jmri.27595

M3 - Article

C2 - 33709532

AN - SCOPUS:85102266898

SN - 1053-1807

VL - 54

SP - 474

EP - 483

JO - Journal of Magnetic Resonance Imaging

JF - Journal of Magnetic Resonance Imaging

IS - 2

ER -

Performance of Deep Learning and Genitourinary Radiologists in Detection of Prostate Cancer Using 3-T Multiparametric Magnetic Resonance Imaging

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this