Interrater Reliability in Assessing Quality of Diagnostic Accuracy Studies Using the QUADAS Tool. A Preliminary Assessment

William Hollingworth; L. Santiago Medina; Robert E. Lenkinski; Dean K. Shibata; Byron Bernal; David Zurakowski; Bryan Comstock; Jeffrey G. Jarvik

doi:10.1016/j.acra.2006.03.008

Interrater Reliability in Assessing Quality of Diagnostic Accuracy Studies Using the QUADAS Tool. A Preliminary Assessment

William Hollingworth, L. Santiago Medina, Robert E. Lenkinski, Dean K. Shibata, Byron Bernal, David Zurakowski, Bryan Comstock, Jeffrey G. Jarvik

Research output: Contribution to journal › Article › peer-review

46 Scopus citations

Abstract

Rationale and Objectives: Quality Assessment of Diagnostic Accuracy Studies (QUADAS) is a new tool to measure the methodological quality of diagnostic accuracy studies in systematic reviews. We used data from a systematic review of magnetic resonance spectroscopy (MRS) in the characterization of suspected brain tumors to provide a preliminary evaluation of the inter-rater reliability of QUADAS. Materials and Methods: A structured literature search identified 19 diagnostic accuracy studies. These publications were distributed randomly to primary and secondary reviewers for dual independent assessment. Reviewers recorded methodological quality by using QUADAS on a custom-designed spreadsheet. We calculated correlation, percentage of agreement, and κ statistic to assess inter-rater reliability. Results: Most studies in our review were judged to have used an accurate reference standard. Conversely, the MRS literature frequently failed to specify the length of time between index and reference tests or that the clinicians were unaware of the index test findings when reporting the reference standard. There was good correlation (ρ = 0.78) between reviewers in assessment of the overall number of quality criteria met. However, mean agreement for individual QUADAS questions was only fair (κ = 0.22) and ranged from no agreement beyond chance (κ < 0) to moderate agreement (κ = 0.58). Conclusion: Inter-rater reliability in our study was relatively low. Nevertheless, we believe that QUADAS potentially is a useful tool for highlighting the strengths and weaknesses of existing diagnostic accuracy studies. Low reliability suggests that different reviewers will reach different conclusions if QUADAS is used to exclude "low-quality" articles from meta-analyses. We discuss methods for improving the validity and reliability of QUADAS.

Original language	English (US)
Pages (from-to)	803-810
Number of pages	8
Journal	Academic radiology
Volume	13
Issue number	7
DOIs	https://doi.org/10.1016/j.acra.2006.03.008
State	Published - Jul 2006

Keywords

Sensitivity and specificity
evidence-based medicine
methods
radiology
review, systematic

ASJC Scopus subject areas

Radiology Nuclear Medicine and imaging

Access to Document

10.1016/j.acra.2006.03.008

Cite this

@article{7755d868b50b4a158f78d08fb198b0fe,

title = "Interrater Reliability in Assessing Quality of Diagnostic Accuracy Studies Using the QUADAS Tool. A Preliminary Assessment",

abstract = "Rationale and Objectives: Quality Assessment of Diagnostic Accuracy Studies (QUADAS) is a new tool to measure the methodological quality of diagnostic accuracy studies in systematic reviews. We used data from a systematic review of magnetic resonance spectroscopy (MRS) in the characterization of suspected brain tumors to provide a preliminary evaluation of the inter-rater reliability of QUADAS. Materials and Methods: A structured literature search identified 19 diagnostic accuracy studies. These publications were distributed randomly to primary and secondary reviewers for dual independent assessment. Reviewers recorded methodological quality by using QUADAS on a custom-designed spreadsheet. We calculated correlation, percentage of agreement, and κ statistic to assess inter-rater reliability. Results: Most studies in our review were judged to have used an accurate reference standard. Conversely, the MRS literature frequently failed to specify the length of time between index and reference tests or that the clinicians were unaware of the index test findings when reporting the reference standard. There was good correlation (ρ = 0.78) between reviewers in assessment of the overall number of quality criteria met. However, mean agreement for individual QUADAS questions was only fair (κ = 0.22) and ranged from no agreement beyond chance (κ < 0) to moderate agreement (κ = 0.58). Conclusion: Inter-rater reliability in our study was relatively low. Nevertheless, we believe that QUADAS potentially is a useful tool for highlighting the strengths and weaknesses of existing diagnostic accuracy studies. Low reliability suggests that different reviewers will reach different conclusions if QUADAS is used to exclude {"}low-quality{"} articles from meta-analyses. We discuss methods for improving the validity and reliability of QUADAS.",

keywords = "Sensitivity and specificity, evidence-based medicine, methods, radiology, review, systematic",

author = "William Hollingworth and Medina, {L. Santiago} and Lenkinski, {Robert E.} and Shibata, {Dean K.} and Byron Bernal and David Zurakowski and Bryan Comstock and Jarvik, {Jeffrey G.}",

note = "Funding Information: The systematic review was funded by a grant from the Neuroradiology Education and Research Foundation. The sponsor played no role in the interpretation of data, writing of the manuscript, or the decision to submit for publication. The views expressed in this paper are those of the authors and not necessarily of the sponsor. ",

year = "2006",

month = jul,

doi = "10.1016/j.acra.2006.03.008",

language = "English (US)",

volume = "13",

pages = "803--810",

journal = "Academic radiology",

issn = "1076-6332",

publisher = "Elsevier USA",

number = "7",

}

TY - JOUR

T1 - Interrater Reliability in Assessing Quality of Diagnostic Accuracy Studies Using the QUADAS Tool. A Preliminary Assessment

AU - Hollingworth, William

AU - Medina, L. Santiago

AU - Lenkinski, Robert E.

AU - Shibata, Dean K.

AU - Bernal, Byron

AU - Zurakowski, David

AU - Comstock, Bryan

AU - Jarvik, Jeffrey G.

N1 - Funding Information: The systematic review was funded by a grant from the Neuroradiology Education and Research Foundation. The sponsor played no role in the interpretation of data, writing of the manuscript, or the decision to submit for publication. The views expressed in this paper are those of the authors and not necessarily of the sponsor.

PY - 2006/7

Y1 - 2006/7

N2 - Rationale and Objectives: Quality Assessment of Diagnostic Accuracy Studies (QUADAS) is a new tool to measure the methodological quality of diagnostic accuracy studies in systematic reviews. We used data from a systematic review of magnetic resonance spectroscopy (MRS) in the characterization of suspected brain tumors to provide a preliminary evaluation of the inter-rater reliability of QUADAS. Materials and Methods: A structured literature search identified 19 diagnostic accuracy studies. These publications were distributed randomly to primary and secondary reviewers for dual independent assessment. Reviewers recorded methodological quality by using QUADAS on a custom-designed spreadsheet. We calculated correlation, percentage of agreement, and κ statistic to assess inter-rater reliability. Results: Most studies in our review were judged to have used an accurate reference standard. Conversely, the MRS literature frequently failed to specify the length of time between index and reference tests or that the clinicians were unaware of the index test findings when reporting the reference standard. There was good correlation (ρ = 0.78) between reviewers in assessment of the overall number of quality criteria met. However, mean agreement for individual QUADAS questions was only fair (κ = 0.22) and ranged from no agreement beyond chance (κ < 0) to moderate agreement (κ = 0.58). Conclusion: Inter-rater reliability in our study was relatively low. Nevertheless, we believe that QUADAS potentially is a useful tool for highlighting the strengths and weaknesses of existing diagnostic accuracy studies. Low reliability suggests that different reviewers will reach different conclusions if QUADAS is used to exclude "low-quality" articles from meta-analyses. We discuss methods for improving the validity and reliability of QUADAS.

AB - Rationale and Objectives: Quality Assessment of Diagnostic Accuracy Studies (QUADAS) is a new tool to measure the methodological quality of diagnostic accuracy studies in systematic reviews. We used data from a systematic review of magnetic resonance spectroscopy (MRS) in the characterization of suspected brain tumors to provide a preliminary evaluation of the inter-rater reliability of QUADAS. Materials and Methods: A structured literature search identified 19 diagnostic accuracy studies. These publications were distributed randomly to primary and secondary reviewers for dual independent assessment. Reviewers recorded methodological quality by using QUADAS on a custom-designed spreadsheet. We calculated correlation, percentage of agreement, and κ statistic to assess inter-rater reliability. Results: Most studies in our review were judged to have used an accurate reference standard. Conversely, the MRS literature frequently failed to specify the length of time between index and reference tests or that the clinicians were unaware of the index test findings when reporting the reference standard. There was good correlation (ρ = 0.78) between reviewers in assessment of the overall number of quality criteria met. However, mean agreement for individual QUADAS questions was only fair (κ = 0.22) and ranged from no agreement beyond chance (κ < 0) to moderate agreement (κ = 0.58). Conclusion: Inter-rater reliability in our study was relatively low. Nevertheless, we believe that QUADAS potentially is a useful tool for highlighting the strengths and weaknesses of existing diagnostic accuracy studies. Low reliability suggests that different reviewers will reach different conclusions if QUADAS is used to exclude "low-quality" articles from meta-analyses. We discuss methods for improving the validity and reliability of QUADAS.

KW - Sensitivity and specificity

KW - evidence-based medicine

KW - methods

KW - radiology

KW - review, systematic

UR - http://www.scopus.com/inward/record.url?scp=33744999823&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33744999823&partnerID=8YFLogxK

U2 - 10.1016/j.acra.2006.03.008

DO - 10.1016/j.acra.2006.03.008

M3 - Article

C2 - 16777553

AN - SCOPUS:33744999823

SN - 1076-6332

VL - 13

SP - 803

EP - 810

JO - Academic radiology

JF - Academic radiology

IS - 7

ER -

Interrater Reliability in Assessing Quality of Diagnostic Accuracy Studies Using the QUADAS Tool. A Preliminary Assessment

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this