Deep learning from multiple experts improves identification of amyloid neuropathologies

Daniel R. Wong; Ziqi Tang; Nicholas C. Mew; Sakshi Das; Justin Athey; Kirsty E. McAleese; Julia K. Kofler; Margaret E. Flanagan; Ewa Borys; Charles L. White; Atul J. Butte; Brittany N. Dugger; Michael J. Keiser

doi:10.1186/s40478-022-01365-0

Deep learning from multiple experts improves identification of amyloid neuropathologies

Daniel R. Wong, Ziqi Tang, Nicholas C. Mew, Sakshi Das, Justin Athey, Kirsty E. McAleese, Julia K. Kofler, Margaret E. Flanagan, Ewa Borys, Charles L. White, Atul J. Butte, Brittany N. Dugger, Michael J. Keiser

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

Pathologists can label pathologies differently, making it challenging to yield consistent assessments in the absence of one ground truth. To address this problem, we present a deep learning (DL) approach that draws on a cohort of experts, weighs each contribution, and is robust to noisy labels. We collected 100,495 annotations on 20,099 candidate amyloid beta neuropathologies (cerebral amyloid angiopathy (CAA), and cored and diffuse plaques) from three institutions, independently annotated by five experts. DL methods trained on a consensus-of-two strategy yielded 12.6–26% improvements by area under the precision recall curve (AUPRC) when compared to those that learned individualized annotations. This strategy surpassed individual-expert models, even when unfairly assessed on benchmarks favoring them. Moreover, ensembling over individual models was robust to hidden random annotators. In blind prospective tests of 52,555 subsequent expert-annotated images, the models labeled pathologies like their human counterparts (consensus model AUPRC = 0.74 cored; 0.69 CAA). This study demonstrates a means to combine multiple ground truths into a common-ground DL model that yields consistent diagnoses informed by multiple and potentially variable expert opinions.

Original language	English (US)
Article number	66
Journal	Acta Neuropathologica Communications
Volume	10
Issue number	1
DOIs	https://doi.org/10.1186/s40478-022-01365-0
State	Published - Dec 2022

Keywords

Algorithms
Amyloid beta
Consensus
Deep learning
Expert annotators
Histopathology

ASJC Scopus subject areas

Pathology and Forensic Medicine
Clinical Neurology
Cellular and Molecular Neuroscience

Access to Document

10.1186/s40478-022-01365-0

Cite this

Wong, D. R., Tang, Z., Mew, N. C., Das, S., Athey, J., McAleese, K. E., Kofler, J. K., Flanagan, M. E., Borys, E., White, C. L., Butte, A. J., Dugger, B. N., & Keiser, M. J. (2022). Deep learning from multiple experts improves identification of amyloid neuropathologies. Acta Neuropathologica Communications, 10(1), Article 66. https://doi.org/10.1186/s40478-022-01365-0

@article{a2e9220b03d74def86a8b49927fbf4ea,

title = "Deep learning from multiple experts improves identification of amyloid neuropathologies",

abstract = "Pathologists can label pathologies differently, making it challenging to yield consistent assessments in the absence of one ground truth. To address this problem, we present a deep learning (DL) approach that draws on a cohort of experts, weighs each contribution, and is robust to noisy labels. We collected 100,495 annotations on 20,099 candidate amyloid beta neuropathologies (cerebral amyloid angiopathy (CAA), and cored and diffuse plaques) from three institutions, independently annotated by five experts. DL methods trained on a consensus-of-two strategy yielded 12.6–26% improvements by area under the precision recall curve (AUPRC) when compared to those that learned individualized annotations. This strategy surpassed individual-expert models, even when unfairly assessed on benchmarks favoring them. Moreover, ensembling over individual models was robust to hidden random annotators. In blind prospective tests of 52,555 subsequent expert-annotated images, the models labeled pathologies like their human counterparts (consensus model AUPRC = 0.74 cored; 0.69 CAA). This study demonstrates a means to combine multiple ground truths into a common-ground DL model that yields consistent diagnoses informed by multiple and potentially variable expert opinions.",

keywords = "Algorithms, Amyloid beta, Consensus, Deep learning, Expert annotators, Histopathology",

author = "Wong, {Daniel R.} and Ziqi Tang and Mew, {Nicholas C.} and Sakshi Das and Justin Athey and McAleese, {Kirsty E.} and Kofler, {Julia K.} and Flanagan, {Margaret E.} and Ewa Borys and White, {Charles L.} and Butte, {Atul J.} and Dugger, {Brittany N.} and Keiser, {Michael J.}",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s).",

year = "2022",

month = dec,

doi = "10.1186/s40478-022-01365-0",

language = "English (US)",

volume = "10",

journal = "Acta Neuropathologica Communications",

issn = "2051-5960",

publisher = "BioMed Central",

number = "1",

}

TY - JOUR

T1 - Deep learning from multiple experts improves identification of amyloid neuropathologies

AU - Wong, Daniel R.

AU - Tang, Ziqi

AU - Mew, Nicholas C.

AU - Das, Sakshi

AU - Athey, Justin

AU - McAleese, Kirsty E.

AU - Kofler, Julia K.

AU - Flanagan, Margaret E.

AU - Borys, Ewa

AU - White, Charles L.

AU - Butte, Atul J.

AU - Dugger, Brittany N.

AU - Keiser, Michael J.

PY - 2022/12

Y1 - 2022/12

N2 - Pathologists can label pathologies differently, making it challenging to yield consistent assessments in the absence of one ground truth. To address this problem, we present a deep learning (DL) approach that draws on a cohort of experts, weighs each contribution, and is robust to noisy labels. We collected 100,495 annotations on 20,099 candidate amyloid beta neuropathologies (cerebral amyloid angiopathy (CAA), and cored and diffuse plaques) from three institutions, independently annotated by five experts. DL methods trained on a consensus-of-two strategy yielded 12.6–26% improvements by area under the precision recall curve (AUPRC) when compared to those that learned individualized annotations. This strategy surpassed individual-expert models, even when unfairly assessed on benchmarks favoring them. Moreover, ensembling over individual models was robust to hidden random annotators. In blind prospective tests of 52,555 subsequent expert-annotated images, the models labeled pathologies like their human counterparts (consensus model AUPRC = 0.74 cored; 0.69 CAA). This study demonstrates a means to combine multiple ground truths into a common-ground DL model that yields consistent diagnoses informed by multiple and potentially variable expert opinions.

AB - Pathologists can label pathologies differently, making it challenging to yield consistent assessments in the absence of one ground truth. To address this problem, we present a deep learning (DL) approach that draws on a cohort of experts, weighs each contribution, and is robust to noisy labels. We collected 100,495 annotations on 20,099 candidate amyloid beta neuropathologies (cerebral amyloid angiopathy (CAA), and cored and diffuse plaques) from three institutions, independently annotated by five experts. DL methods trained on a consensus-of-two strategy yielded 12.6–26% improvements by area under the precision recall curve (AUPRC) when compared to those that learned individualized annotations. This strategy surpassed individual-expert models, even when unfairly assessed on benchmarks favoring them. Moreover, ensembling over individual models was robust to hidden random annotators. In blind prospective tests of 52,555 subsequent expert-annotated images, the models labeled pathologies like their human counterparts (consensus model AUPRC = 0.74 cored; 0.69 CAA). This study demonstrates a means to combine multiple ground truths into a common-ground DL model that yields consistent diagnoses informed by multiple and potentially variable expert opinions.

KW - Algorithms

KW - Amyloid beta

KW - Consensus

KW - Deep learning

KW - Expert annotators

KW - Histopathology

UR - http://www.scopus.com/inward/record.url?scp=85128893080&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85128893080&partnerID=8YFLogxK

U2 - 10.1186/s40478-022-01365-0

DO - 10.1186/s40478-022-01365-0

M3 - Article

C2 - 35484610

AN - SCOPUS:85128893080

SN - 2051-5960

VL - 10

JO - Acta Neuropathologica Communications

JF - Acta Neuropathologica Communications

IS - 1

M1 - 66

ER -

Deep learning from multiple experts improves identification of amyloid neuropathologies

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this