A family-based likelihood ratio test for general pedigree structures that allows for genotyping error and missing data

Yang Yang, Carol A. Wise, Derek Gordon, Stephen J. Finch

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

The purpose of this work is the development of a family-based association test that allows for random genotyping errors and missing data and makes use of information on affected and unaffected pedigree members. We derive the conditional likelihood functions of the general nuclear family for the following scenarios: complete parental genotype data and no genotyping errors; only one genotyped parent and no genotyping errors; no parental genotype data and no genotyping errors; and no parental genotype data with genotyping errors. We find maximum likelihood estimates of the marker locus parameters, including the penetrances and population genotype frequencies under the null hypothesis that all penetrance values are equal and under the alternative hypothesis. We then compute the likelihood ratio test. We perform simulations to assess the adequacy of the central chi-square distribution approximation when the null hypothesis is true. We also perform simulations to compare the power of the TDT and this likelihood-based method. Finally, we apply our method to 23 SNPs genotyped in nuclear families from a recently published study of idiopathic scoliosis (IS). Our simulations suggest that this likelihood ratio test statistic follows a central chi-square distribution with 1 degree of freedom under the null hypothesis, even in the presence of missing data and genotyping errors. The power comparison shows that this likelihood ratio test is more powerful than the original TDT for the simulations considered. For the IS data, the marker rs7843033 shows the most significant evidence for our method (p = 0.0003), which is consistent with a previous report, which found rs7843033 to be the 2nd most significant TDTae p value among a set of 23 SNPs.

Original languageEnglish (US)
Pages (from-to)99-110
Number of pages12
JournalHuman Heredity
Volume66
Issue number2
DOIs
StatePublished - Mar 2008

Fingerprint

Pedigree
Genotype
Likelihood Functions
Penetrance
Scoliosis
Chi-Square Distribution
Nuclear Family
Single Nucleotide Polymorphism
Population
Power (Psychology)

Keywords

  • Family based association study
  • Genetic association studies
  • Missing data
  • TDT-like tests

ASJC Scopus subject areas

  • Genetics(clinical)

Cite this

A family-based likelihood ratio test for general pedigree structures that allows for genotyping error and missing data. / Yang, Yang; Wise, Carol A.; Gordon, Derek; Finch, Stephen J.

In: Human Heredity, Vol. 66, No. 2, 03.2008, p. 99-110.

Research output: Contribution to journalArticle

@article{45cbbd6e7c5540c9ac77118d519e8ac2,
title = "A family-based likelihood ratio test for general pedigree structures that allows for genotyping error and missing data",
abstract = "The purpose of this work is the development of a family-based association test that allows for random genotyping errors and missing data and makes use of information on affected and unaffected pedigree members. We derive the conditional likelihood functions of the general nuclear family for the following scenarios: complete parental genotype data and no genotyping errors; only one genotyped parent and no genotyping errors; no parental genotype data and no genotyping errors; and no parental genotype data with genotyping errors. We find maximum likelihood estimates of the marker locus parameters, including the penetrances and population genotype frequencies under the null hypothesis that all penetrance values are equal and under the alternative hypothesis. We then compute the likelihood ratio test. We perform simulations to assess the adequacy of the central chi-square distribution approximation when the null hypothesis is true. We also perform simulations to compare the power of the TDT and this likelihood-based method. Finally, we apply our method to 23 SNPs genotyped in nuclear families from a recently published study of idiopathic scoliosis (IS). Our simulations suggest that this likelihood ratio test statistic follows a central chi-square distribution with 1 degree of freedom under the null hypothesis, even in the presence of missing data and genotyping errors. The power comparison shows that this likelihood ratio test is more powerful than the original TDT for the simulations considered. For the IS data, the marker rs7843033 shows the most significant evidence for our method (p = 0.0003), which is consistent with a previous report, which found rs7843033 to be the 2nd most significant TDTae p value among a set of 23 SNPs.",
keywords = "Family based association study, Genetic association studies, Missing data, TDT-like tests",
author = "Yang Yang and Wise, {Carol A.} and Derek Gordon and Finch, {Stephen J.}",
year = "2008",
month = "3",
doi = "10.1159/000119109",
language = "English (US)",
volume = "66",
pages = "99--110",
journal = "Human Heredity",
issn = "0001-5652",
publisher = "S. Karger AG",
number = "2",

}

TY - JOUR

T1 - A family-based likelihood ratio test for general pedigree structures that allows for genotyping error and missing data

AU - Yang, Yang

AU - Wise, Carol A.

AU - Gordon, Derek

AU - Finch, Stephen J.

PY - 2008/3

Y1 - 2008/3

N2 - The purpose of this work is the development of a family-based association test that allows for random genotyping errors and missing data and makes use of information on affected and unaffected pedigree members. We derive the conditional likelihood functions of the general nuclear family for the following scenarios: complete parental genotype data and no genotyping errors; only one genotyped parent and no genotyping errors; no parental genotype data and no genotyping errors; and no parental genotype data with genotyping errors. We find maximum likelihood estimates of the marker locus parameters, including the penetrances and population genotype frequencies under the null hypothesis that all penetrance values are equal and under the alternative hypothesis. We then compute the likelihood ratio test. We perform simulations to assess the adequacy of the central chi-square distribution approximation when the null hypothesis is true. We also perform simulations to compare the power of the TDT and this likelihood-based method. Finally, we apply our method to 23 SNPs genotyped in nuclear families from a recently published study of idiopathic scoliosis (IS). Our simulations suggest that this likelihood ratio test statistic follows a central chi-square distribution with 1 degree of freedom under the null hypothesis, even in the presence of missing data and genotyping errors. The power comparison shows that this likelihood ratio test is more powerful than the original TDT for the simulations considered. For the IS data, the marker rs7843033 shows the most significant evidence for our method (p = 0.0003), which is consistent with a previous report, which found rs7843033 to be the 2nd most significant TDTae p value among a set of 23 SNPs.

AB - The purpose of this work is the development of a family-based association test that allows for random genotyping errors and missing data and makes use of information on affected and unaffected pedigree members. We derive the conditional likelihood functions of the general nuclear family for the following scenarios: complete parental genotype data and no genotyping errors; only one genotyped parent and no genotyping errors; no parental genotype data and no genotyping errors; and no parental genotype data with genotyping errors. We find maximum likelihood estimates of the marker locus parameters, including the penetrances and population genotype frequencies under the null hypothesis that all penetrance values are equal and under the alternative hypothesis. We then compute the likelihood ratio test. We perform simulations to assess the adequacy of the central chi-square distribution approximation when the null hypothesis is true. We also perform simulations to compare the power of the TDT and this likelihood-based method. Finally, we apply our method to 23 SNPs genotyped in nuclear families from a recently published study of idiopathic scoliosis (IS). Our simulations suggest that this likelihood ratio test statistic follows a central chi-square distribution with 1 degree of freedom under the null hypothesis, even in the presence of missing data and genotyping errors. The power comparison shows that this likelihood ratio test is more powerful than the original TDT for the simulations considered. For the IS data, the marker rs7843033 shows the most significant evidence for our method (p = 0.0003), which is consistent with a previous report, which found rs7843033 to be the 2nd most significant TDTae p value among a set of 23 SNPs.

KW - Family based association study

KW - Genetic association studies

KW - Missing data

KW - TDT-like tests

UR - http://www.scopus.com/inward/record.url?scp=41649115381&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=41649115381&partnerID=8YFLogxK

U2 - 10.1159/000119109

DO - 10.1159/000119109

M3 - Article

C2 - 18382089

AN - SCOPUS:41649115381

VL - 66

SP - 99

EP - 110

JO - Human Heredity

JF - Human Heredity

SN - 0001-5652

IS - 2

ER -