Increasing the power: A practical approach to goodness-of-fit test for logistic regression models with continuous predictors

Xian Jin Xie; Jane Pendergast; William Clarke

doi:10.1016/j.csda.2007.09.027

Increasing the power: A practical approach to goodness-of-fit test for logistic regression models with continuous predictors

Xian Jin Xie, Jane Pendergast, William Clarke

Research output: Contribution to journal › Article › peer-review

27 Scopus citations

Abstract

When continuous predictors are present, classical Pearson and deviance goodness-of-fit tests to assess logistic model fit break down. The Hosmer-Lemeshow test can be used in these situations. While simple to perform and widely used, it does not have desirable power in many cases and provides no further information on the source of any detectable lack of fit. Tsiatis proposed a score statistic to test for covariate regional effects. While conceptually elegant, its lack of a general rule for how to partition the covariate space has, to a certain degree, limited its popularity. We propose a new method for goodness-of-fit testing that uses a very general partitioning strategy (clustering) in the covariate space and either a Pearson statistic or a score statistic. Properties of the proposed statistics are discussed, and a simulation study demonstrates increased power to detect model misspecification in a variety of settings. An application of these different methods on data from a clinical trial illustrates their use. Discussions on further improvement of the proposed tests and extending this new method to other data situations, such as ordinal response regression models are also included.

Original language	English (US)
Pages (from-to)	2703-2713
Number of pages	11
Journal	Computational Statistics and Data Analysis
Volume	52
Issue number	5
DOIs	https://doi.org/10.1016/j.csda.2007.09.027
State	Published - Jan 20 2008

Keywords

Cluster analysis
Continuous covariates
Generalized linear model
Goodness-of-fit test
Logistic regression

ASJC Scopus subject areas

Statistics and Probability
Computational Mathematics
Computational Theory and Mathematics
Applied Mathematics

Access to Document

10.1016/j.csda.2007.09.027

Cite this

@article{ef8cf8dad37f4c3284f54ad23d0a2b13,

title = "Increasing the power: A practical approach to goodness-of-fit test for logistic regression models with continuous predictors",

abstract = "When continuous predictors are present, classical Pearson and deviance goodness-of-fit tests to assess logistic model fit break down. The Hosmer-Lemeshow test can be used in these situations. While simple to perform and widely used, it does not have desirable power in many cases and provides no further information on the source of any detectable lack of fit. Tsiatis proposed a score statistic to test for covariate regional effects. While conceptually elegant, its lack of a general rule for how to partition the covariate space has, to a certain degree, limited its popularity. We propose a new method for goodness-of-fit testing that uses a very general partitioning strategy (clustering) in the covariate space and either a Pearson statistic or a score statistic. Properties of the proposed statistics are discussed, and a simulation study demonstrates increased power to detect model misspecification in a variety of settings. An application of these different methods on data from a clinical trial illustrates their use. Discussions on further improvement of the proposed tests and extending this new method to other data situations, such as ordinal response regression models are also included.",

keywords = "Cluster analysis, Continuous covariates, Generalized linear model, Goodness-of-fit test, Logistic regression",

author = "Xie, {Xian Jin} and Jane Pendergast and William Clarke",

year = "2008",

month = jan,

day = "20",

doi = "10.1016/j.csda.2007.09.027",

language = "English (US)",

volume = "52",

pages = "2703--2713",

journal = "Computational Statistics and Data Analysis",

issn = "0167-9473",

publisher = "Elsevier",

number = "5",

}

TY - JOUR

T1 - Increasing the power

T2 - A practical approach to goodness-of-fit test for logistic regression models with continuous predictors

AU - Xie, Xian Jin

AU - Pendergast, Jane

AU - Clarke, William

PY - 2008/1/20

Y1 - 2008/1/20

N2 - When continuous predictors are present, classical Pearson and deviance goodness-of-fit tests to assess logistic model fit break down. The Hosmer-Lemeshow test can be used in these situations. While simple to perform and widely used, it does not have desirable power in many cases and provides no further information on the source of any detectable lack of fit. Tsiatis proposed a score statistic to test for covariate regional effects. While conceptually elegant, its lack of a general rule for how to partition the covariate space has, to a certain degree, limited its popularity. We propose a new method for goodness-of-fit testing that uses a very general partitioning strategy (clustering) in the covariate space and either a Pearson statistic or a score statistic. Properties of the proposed statistics are discussed, and a simulation study demonstrates increased power to detect model misspecification in a variety of settings. An application of these different methods on data from a clinical trial illustrates their use. Discussions on further improvement of the proposed tests and extending this new method to other data situations, such as ordinal response regression models are also included.

AB - When continuous predictors are present, classical Pearson and deviance goodness-of-fit tests to assess logistic model fit break down. The Hosmer-Lemeshow test can be used in these situations. While simple to perform and widely used, it does not have desirable power in many cases and provides no further information on the source of any detectable lack of fit. Tsiatis proposed a score statistic to test for covariate regional effects. While conceptually elegant, its lack of a general rule for how to partition the covariate space has, to a certain degree, limited its popularity. We propose a new method for goodness-of-fit testing that uses a very general partitioning strategy (clustering) in the covariate space and either a Pearson statistic or a score statistic. Properties of the proposed statistics are discussed, and a simulation study demonstrates increased power to detect model misspecification in a variety of settings. An application of these different methods on data from a clinical trial illustrates their use. Discussions on further improvement of the proposed tests and extending this new method to other data situations, such as ordinal response regression models are also included.

KW - Cluster analysis

KW - Continuous covariates

KW - Generalized linear model

KW - Goodness-of-fit test

KW - Logistic regression

UR - http://www.scopus.com/inward/record.url?scp=38149094864&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=38149094864&partnerID=8YFLogxK

U2 - 10.1016/j.csda.2007.09.027

DO - 10.1016/j.csda.2007.09.027

M3 - Article

AN - SCOPUS:38149094864

SN - 0167-9473

VL - 52

SP - 2703

EP - 2713

JO - Computational Statistics and Data Analysis

JF - Computational Statistics and Data Analysis

IS - 5

ER -

Increasing the power: A practical approach to goodness-of-fit test for logistic regression models with continuous predictors

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this