Development and Validation of an Electronic Medical Record Algorithm to Identify Phenotypes of Rotator Cuff Tear

Chan Gao; Run Fan; Gregory D. Ayers; Ayush Giri; Kindred Harris; Ravi Atreya; Pedro L. Teixeira; Nitin B. Jain

doi:10.1002/pmrj.12367

Development and Validation of an Electronic Medical Record Algorithm to Identify Phenotypes of Rotator Cuff Tear

Chan Gao, Run Fan, Gregory D. Ayers, Ayush Giri, Kindred Harris, Ravi Atreya, Pedro L. Teixeira, Nitin B. Jain

Research output: Contribution to journal › Article › peer-review

1 Scopus citations

Abstract

Background: A lack of studies with large sample sizes of patients with rotator cuff tears is a barrier to performing clinical and genomic research. Objective: To develop and validate an electronic medical record (EMR)–based algorithm to identify individuals with and without rotator cuff tear. Design: We used a deidentified version of the EMR of more than 2 million subjects. A screening algorithm was applied to classify subjects into likely rotator cuff tear and likely normal rotator cuff groups. From these subjects, 500 likely rotator cuff tear and 500 likely normal rotator cuff were randomly chosen for algorithm development. Chart review of all 1000 subjects confirmed the true phenotype of rotator cuff tear or normal rotator cuff based on magnetic resonance imaging and operative report. An algorithm was then developed based on logistic regression and validation of the algorithm was performed. Results: The variables significantly predicting rotator cuff tear included the number of times a Current Procedural Terminology code related to rotator cuff procedures was used (odds ratio [OR] = 3.3; 95% confidence interval [CI]: 1.6-6.8 for ≥3 vs 0), the number of times a term related to rotator cuff lesions occurred in radiology reports (OR = 2.2; 95% CI: 1.2-4.1 for ≥1 vs 0), and the number of times a term related to rotator cuff lesions occurred in physician notes (OR = 4.5; 95% CI: 2.2-9.1 for 1 or 2 times vs 0). This phenotyping algorithm had a specificity of 0.89 (95% CI: 0.79-0.95) for rotator cuff tear, area under the curve (AUC) of 0.842, and diagnostic likelihood ratios (DLRs), DLR+ and DLR− of 5.94 (95% CI: 3.07-11.48) and 0.363 (95% CI: 0.291-0.453). Conclusion: Our informatics algorithm enables identification of cohorts of individuals with and without rotator cuff tear from an EMR-based data set with moderate accuracy.

Original language	English (US)
Pages (from-to)	1099-1105
Number of pages	7
Journal	PM and R
Volume	12
Issue number	11
DOIs	https://doi.org/10.1002/pmrj.12367
State	Published - Nov 1 2020
Externally published	Yes

ASJC Scopus subject areas

Physical Therapy, Sports Therapy and Rehabilitation
Rehabilitation
Neurology
Clinical Neurology

Access to Document

10.1002/pmrj.12367

Cite this

@article{5c5d5fa4e4864595bc308e8549dab5bf,

title = "Development and Validation of an Electronic Medical Record Algorithm to Identify Phenotypes of Rotator Cuff Tear",

abstract = "Background: A lack of studies with large sample sizes of patients with rotator cuff tears is a barrier to performing clinical and genomic research. Objective: To develop and validate an electronic medical record (EMR)–based algorithm to identify individuals with and without rotator cuff tear. Design: We used a deidentified version of the EMR of more than 2 million subjects. A screening algorithm was applied to classify subjects into likely rotator cuff tear and likely normal rotator cuff groups. From these subjects, 500 likely rotator cuff tear and 500 likely normal rotator cuff were randomly chosen for algorithm development. Chart review of all 1000 subjects confirmed the true phenotype of rotator cuff tear or normal rotator cuff based on magnetic resonance imaging and operative report. An algorithm was then developed based on logistic regression and validation of the algorithm was performed. Results: The variables significantly predicting rotator cuff tear included the number of times a Current Procedural Terminology code related to rotator cuff procedures was used (odds ratio [OR] = 3.3; 95% confidence interval [CI]: 1.6-6.8 for ≥3 vs 0), the number of times a term related to rotator cuff lesions occurred in radiology reports (OR = 2.2; 95% CI: 1.2-4.1 for ≥1 vs 0), and the number of times a term related to rotator cuff lesions occurred in physician notes (OR = 4.5; 95% CI: 2.2-9.1 for 1 or 2 times vs 0). This phenotyping algorithm had a specificity of 0.89 (95% CI: 0.79-0.95) for rotator cuff tear, area under the curve (AUC) of 0.842, and diagnostic likelihood ratios (DLRs), DLR+ and DLR− of 5.94 (95% CI: 3.07-11.48) and 0.363 (95% CI: 0.291-0.453). Conclusion: Our informatics algorithm enables identification of cohorts of individuals with and without rotator cuff tear from an EMR-based data set with moderate accuracy.",

author = "Chan Gao and Run Fan and Ayers, {Gregory D.} and Ayush Giri and Kindred Harris and Ravi Atreya and Teixeira, {Pedro L.} and Jain, {Nitin B.}",

note = "Funding Information: The project described was supported by CTSA award No. UL1TR000445 from the National Center for Advancing Translational Sciences. Its contents are solely the responsibility of the authors and do not necessarily represent official views of the National Center for Advancing Translational Sciences or the National Institutes of Health. Pedro Teixeira was supported by U01HG008672 (VGER, The Vanderbilt Genomic‐Electronic Records Project) for his work on the algorithm. Publisher Copyright: {\textcopyright} 2020 American Academy of Physical Medicine and Rehabilitation",

year = "2020",

month = nov,

day = "1",

doi = "10.1002/pmrj.12367",

language = "English (US)",

volume = "12",

pages = "1099--1105",

journal = "PM and R",

issn = "1934-1482",

publisher = "Elsevier Inc.",

number = "11",

}

TY - JOUR

T1 - Development and Validation of an Electronic Medical Record Algorithm to Identify Phenotypes of Rotator Cuff Tear

AU - Gao, Chan

AU - Fan, Run

AU - Ayers, Gregory D.

AU - Giri, Ayush

AU - Harris, Kindred

AU - Atreya, Ravi

AU - Teixeira, Pedro L.

AU - Jain, Nitin B.

N1 - Funding Information: The project described was supported by CTSA award No. UL1TR000445 from the National Center for Advancing Translational Sciences. Its contents are solely the responsibility of the authors and do not necessarily represent official views of the National Center for Advancing Translational Sciences or the National Institutes of Health. Pedro Teixeira was supported by U01HG008672 (VGER, The Vanderbilt Genomic‐Electronic Records Project) for his work on the algorithm. Publisher Copyright: © 2020 American Academy of Physical Medicine and Rehabilitation

PY - 2020/11/1

Y1 - 2020/11/1

N2 - Background: A lack of studies with large sample sizes of patients with rotator cuff tears is a barrier to performing clinical and genomic research. Objective: To develop and validate an electronic medical record (EMR)–based algorithm to identify individuals with and without rotator cuff tear. Design: We used a deidentified version of the EMR of more than 2 million subjects. A screening algorithm was applied to classify subjects into likely rotator cuff tear and likely normal rotator cuff groups. From these subjects, 500 likely rotator cuff tear and 500 likely normal rotator cuff were randomly chosen for algorithm development. Chart review of all 1000 subjects confirmed the true phenotype of rotator cuff tear or normal rotator cuff based on magnetic resonance imaging and operative report. An algorithm was then developed based on logistic regression and validation of the algorithm was performed. Results: The variables significantly predicting rotator cuff tear included the number of times a Current Procedural Terminology code related to rotator cuff procedures was used (odds ratio [OR] = 3.3; 95% confidence interval [CI]: 1.6-6.8 for ≥3 vs 0), the number of times a term related to rotator cuff lesions occurred in radiology reports (OR = 2.2; 95% CI: 1.2-4.1 for ≥1 vs 0), and the number of times a term related to rotator cuff lesions occurred in physician notes (OR = 4.5; 95% CI: 2.2-9.1 for 1 or 2 times vs 0). This phenotyping algorithm had a specificity of 0.89 (95% CI: 0.79-0.95) for rotator cuff tear, area under the curve (AUC) of 0.842, and diagnostic likelihood ratios (DLRs), DLR+ and DLR− of 5.94 (95% CI: 3.07-11.48) and 0.363 (95% CI: 0.291-0.453). Conclusion: Our informatics algorithm enables identification of cohorts of individuals with and without rotator cuff tear from an EMR-based data set with moderate accuracy.

AB - Background: A lack of studies with large sample sizes of patients with rotator cuff tears is a barrier to performing clinical and genomic research. Objective: To develop and validate an electronic medical record (EMR)–based algorithm to identify individuals with and without rotator cuff tear. Design: We used a deidentified version of the EMR of more than 2 million subjects. A screening algorithm was applied to classify subjects into likely rotator cuff tear and likely normal rotator cuff groups. From these subjects, 500 likely rotator cuff tear and 500 likely normal rotator cuff were randomly chosen for algorithm development. Chart review of all 1000 subjects confirmed the true phenotype of rotator cuff tear or normal rotator cuff based on magnetic resonance imaging and operative report. An algorithm was then developed based on logistic regression and validation of the algorithm was performed. Results: The variables significantly predicting rotator cuff tear included the number of times a Current Procedural Terminology code related to rotator cuff procedures was used (odds ratio [OR] = 3.3; 95% confidence interval [CI]: 1.6-6.8 for ≥3 vs 0), the number of times a term related to rotator cuff lesions occurred in radiology reports (OR = 2.2; 95% CI: 1.2-4.1 for ≥1 vs 0), and the number of times a term related to rotator cuff lesions occurred in physician notes (OR = 4.5; 95% CI: 2.2-9.1 for 1 or 2 times vs 0). This phenotyping algorithm had a specificity of 0.89 (95% CI: 0.79-0.95) for rotator cuff tear, area under the curve (AUC) of 0.842, and diagnostic likelihood ratios (DLRs), DLR+ and DLR− of 5.94 (95% CI: 3.07-11.48) and 0.363 (95% CI: 0.291-0.453). Conclusion: Our informatics algorithm enables identification of cohorts of individuals with and without rotator cuff tear from an EMR-based data set with moderate accuracy.

UR - http://www.scopus.com/inward/record.url?scp=85083993311&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85083993311&partnerID=8YFLogxK

U2 - 10.1002/pmrj.12367

DO - 10.1002/pmrj.12367

M3 - Article

C2 - 32198840

AN - SCOPUS:85083993311

SN - 1934-1482

VL - 12

SP - 1099

EP - 1105

JO - PM and R

JF - PM and R

IS - 11

ER -

Development and Validation of an Electronic Medical Record Algorithm to Identify Phenotypes of Rotator Cuff Tear

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this