An evaluation of methods for the stratified analysis of clustered binary data in community intervention trials

James X. Song; Chul W. Ahn

doi:10.1002/sim.1390

An evaluation of methods for the stratified analysis of clustered binary data in community intervention trials

James X. Song, Chul W. Ahn

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

A simulation study is conducted in a community intervention setting. Several methods of stratified analysis of clustered binary data are compared in terms of empirical significance and empirical power levels. They are the Mantel-Haenszel test statistic (χ_MH²), the adjusted Mantel-Haenszel test statistic of Donald-Donner (χ_DD²), Rao-Scott (χ_RSN² and χ_RSP²), and Zhang-Boos (χ_ZBN² and χ_ZBP²), Wald (χ_W²), robust Wald (χ_RW²), score (χ_S²), robust score (χ_RS²), and the test statistic based on generalized linear mixed model (GLMM) (χ_GLMM². When ρ≠0, χ_MH² has inflated type I error, and it should not be used when observations are correlated. The results also warn of the use of χ_RSN² and χ_RW² due to their poor performance in terms of empirical significance level. χ_ZBP² and χ_GLMM² have better empirical significance levels as compared to other statistics; however, χ_ZBP² tends to have lower empirical powers than other statistics when the number of clusters (N) is less than 24. χ_RSP² provides the highest empirical powers when ρ≥0.1 and N ≤ 12. When ρ≤0.01, we recommend the use of χ_RS² and χ_GLMM² since they have better overall performance in terms of empirical significance levels and empirical power levels.

Original language	English (US)
Pages (from-to)	2205-2216
Number of pages	12
Journal	Statistics in Medicine
Volume	22
Issue number	13
DOIs	https://doi.org/10.1002/sim.1390
State	Published - Jul 15 2003

Keywords

Community intervention
Correlated binary data
Simulation
Stratified analysis

ASJC Scopus subject areas

Epidemiology
Statistics and Probability

Access to Document

10.1002/sim.1390

Cite this

@article{17db93df7d32443fa448f6f862d162dd,

title = "An evaluation of methods for the stratified analysis of clustered binary data in community intervention trials",

abstract = "A simulation study is conducted in a community intervention setting. Several methods of stratified analysis of clustered binary data are compared in terms of empirical significance and empirical power levels. They are the Mantel-Haenszel test statistic (χMH2), the adjusted Mantel-Haenszel test statistic of Donald-Donner (χDD2), Rao-Scott (χRSN2 and χRSP2), and Zhang-Boos (χZBN2 and χZBP2), Wald (χW2), robust Wald (χRW2), score (χS2), robust score (χRS2), and the test statistic based on generalized linear mixed model (GLMM) (χGLMM2. When ρ≠0, χMH2 has inflated type I error, and it should not be used when observations are correlated. The results also warn of the use of χRSN2 and χRW2 due to their poor performance in terms of empirical significance level. χZBP2 and χGLMM2 have better empirical significance levels as compared to other statistics; however, χZBP2 tends to have lower empirical powers than other statistics when the number of clusters (N) is less than 24. χRSP2 provides the highest empirical powers when ρ≥0.1 and N ≤ 12. When ρ≤0.01, we recommend the use of χRS2 and χGLMM2 since they have better overall performance in terms of empirical significance levels and empirical power levels.",

keywords = "Community intervention, Correlated binary data, Simulation, Stratified analysis",

author = "Song, {James X.} and Ahn, {Chul W.}",

year = "2003",

month = jul,

day = "15",

doi = "10.1002/sim.1390",

language = "English (US)",

volume = "22",

pages = "2205--2216",

journal = "Statistics in Medicine",

issn = "0277-6715",

publisher = "John Wiley and Sons Ltd",

number = "13",

}

TY - JOUR

T1 - An evaluation of methods for the stratified analysis of clustered binary data in community intervention trials

AU - Song, James X.

AU - Ahn, Chul W.

PY - 2003/7/15

Y1 - 2003/7/15

N2 - A simulation study is conducted in a community intervention setting. Several methods of stratified analysis of clustered binary data are compared in terms of empirical significance and empirical power levels. They are the Mantel-Haenszel test statistic (χMH2), the adjusted Mantel-Haenszel test statistic of Donald-Donner (χDD2), Rao-Scott (χRSN2 and χRSP2), and Zhang-Boos (χZBN2 and χZBP2), Wald (χW2), robust Wald (χRW2), score (χS2), robust score (χRS2), and the test statistic based on generalized linear mixed model (GLMM) (χGLMM2. When ρ≠0, χMH2 has inflated type I error, and it should not be used when observations are correlated. The results also warn of the use of χRSN2 and χRW2 due to their poor performance in terms of empirical significance level. χZBP2 and χGLMM2 have better empirical significance levels as compared to other statistics; however, χZBP2 tends to have lower empirical powers than other statistics when the number of clusters (N) is less than 24. χRSP2 provides the highest empirical powers when ρ≥0.1 and N ≤ 12. When ρ≤0.01, we recommend the use of χRS2 and χGLMM2 since they have better overall performance in terms of empirical significance levels and empirical power levels.

AB - A simulation study is conducted in a community intervention setting. Several methods of stratified analysis of clustered binary data are compared in terms of empirical significance and empirical power levels. They are the Mantel-Haenszel test statistic (χMH2), the adjusted Mantel-Haenszel test statistic of Donald-Donner (χDD2), Rao-Scott (χRSN2 and χRSP2), and Zhang-Boos (χZBN2 and χZBP2), Wald (χW2), robust Wald (χRW2), score (χS2), robust score (χRS2), and the test statistic based on generalized linear mixed model (GLMM) (χGLMM2. When ρ≠0, χMH2 has inflated type I error, and it should not be used when observations are correlated. The results also warn of the use of χRSN2 and χRW2 due to their poor performance in terms of empirical significance level. χZBP2 and χGLMM2 have better empirical significance levels as compared to other statistics; however, χZBP2 tends to have lower empirical powers than other statistics when the number of clusters (N) is less than 24. χRSP2 provides the highest empirical powers when ρ≥0.1 and N ≤ 12. When ρ≤0.01, we recommend the use of χRS2 and χGLMM2 since they have better overall performance in terms of empirical significance levels and empirical power levels.

KW - Community intervention

KW - Correlated binary data

KW - Simulation

KW - Stratified analysis

UR - http://www.scopus.com/inward/record.url?scp=0037772413&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0037772413&partnerID=8YFLogxK

U2 - 10.1002/sim.1390

DO - 10.1002/sim.1390

M3 - Article

C2 - 12820284

AN - SCOPUS:0037772413

SN - 0277-6715

VL - 22

SP - 2205

EP - 2216

JO - Statistics in Medicine

JF - Statistics in Medicine

IS - 13

ER -

An evaluation of methods for the stratified analysis of clustered binary data in community intervention trials

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this