Home > Research > Publications & Outputs > CLUMPHAP

Links

Text available via DOI:

View graph of relations

CLUMPHAP: a simple tool for performing haplotype-based association analysis

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

CLUMPHAP: a simple tool for performing haplotype-based association analysis. / Knight, Jo; Curtis, David; Sham, Pak C.
In: Genetic Epidemiology, Vol. 32, No. 6, 09.2008, p. 539-545.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

Knight, J, Curtis, D & Sham, PC 2008, 'CLUMPHAP: a simple tool for performing haplotype-based association analysis', Genetic Epidemiology, vol. 32, no. 6, pp. 539-545. https://doi.org/10.1002/gepi.20327

APA

Vancouver

Knight J, Curtis D, Sham PC. CLUMPHAP: a simple tool for performing haplotype-based association analysis. Genetic Epidemiology. 2008 Sept;32(6):539-545. Epub 2008 Apr 7. doi: 10.1002/gepi.20327

Author

Knight, Jo ; Curtis, David ; Sham, Pak C. / CLUMPHAP : a simple tool for performing haplotype-based association analysis. In: Genetic Epidemiology. 2008 ; Vol. 32, No. 6. pp. 539-545.

Bibtex

@article{14d044c268d948ab89cdb96a86456a05,
title = "CLUMPHAP: a simple tool for performing haplotype-based association analysis",
abstract = "The completion of the HapMap Project and the development of high-throughput single nucleotide polymorphism genotyping technologies have greatly enhanced the prospects of identifying and characterizing the genetic variants that influence complex traits. In principle, association analysis of haplotypes rather than single nucleotide polymorphisms may better capture an underlying causal variant, but the multiple haplotypes can lead to reduced statistical power due to the testing of (and need to correct for) a large number of haplotypes. This paper presents a novel method based on clustering similar haplotypes to address this issue. The method, implemented in the CLUMPHAP program, is an extension of the CLUMP program designed for the analysis of multi-allelic markers (Sham and Curtis [1995] Ann. Hum. Genet. 59(Pt1):97-105). CLUMPHAP performs a hierarchical clustering of the haplotypes and then computes the chi(2) statistic between each haplotype cluster and disease; the statistical significance of the largest of the chi(2) statistics is obtained by permutation testing. A significant result suggests that the presence of a disease-causing variant in the haplotype cluster is over-represented in cases. Using simulation studies, we have compared CLUMPHAP and more widely used approaches in terms of their statistical power to identify an untyped susceptibility locus. Our results show that CLUMPHAP tends to have greater power than the omnibus haplotype test and is comparable in power to multiple regression locus-coding approaches.",
keywords = "Cluster Analysis, Computer Simulation, Gene Frequency, Genetic Markers, Genetic Predisposition to Disease, Haplotypes, Humans, Logistic Models, Models, Genetic, Models, Statistical, Polymorphism, Single Nucleotide, Software",
author = "Jo Knight and David Curtis and Sham, {Pak C.}",
note = "(c) 2008 Wiley-Liss, Inc.",
year = "2008",
month = sep,
doi = "10.1002/gepi.20327",
language = "English",
volume = "32",
pages = "539--545",
journal = "Genetic Epidemiology",
issn = "0741-0395",
publisher = "Wiley-Liss Inc.",
number = "6",

}

RIS

TY - JOUR

T1 - CLUMPHAP

T2 - a simple tool for performing haplotype-based association analysis

AU - Knight, Jo

AU - Curtis, David

AU - Sham, Pak C.

N1 - (c) 2008 Wiley-Liss, Inc.

PY - 2008/9

Y1 - 2008/9

N2 - The completion of the HapMap Project and the development of high-throughput single nucleotide polymorphism genotyping technologies have greatly enhanced the prospects of identifying and characterizing the genetic variants that influence complex traits. In principle, association analysis of haplotypes rather than single nucleotide polymorphisms may better capture an underlying causal variant, but the multiple haplotypes can lead to reduced statistical power due to the testing of (and need to correct for) a large number of haplotypes. This paper presents a novel method based on clustering similar haplotypes to address this issue. The method, implemented in the CLUMPHAP program, is an extension of the CLUMP program designed for the analysis of multi-allelic markers (Sham and Curtis [1995] Ann. Hum. Genet. 59(Pt1):97-105). CLUMPHAP performs a hierarchical clustering of the haplotypes and then computes the chi(2) statistic between each haplotype cluster and disease; the statistical significance of the largest of the chi(2) statistics is obtained by permutation testing. A significant result suggests that the presence of a disease-causing variant in the haplotype cluster is over-represented in cases. Using simulation studies, we have compared CLUMPHAP and more widely used approaches in terms of their statistical power to identify an untyped susceptibility locus. Our results show that CLUMPHAP tends to have greater power than the omnibus haplotype test and is comparable in power to multiple regression locus-coding approaches.

AB - The completion of the HapMap Project and the development of high-throughput single nucleotide polymorphism genotyping technologies have greatly enhanced the prospects of identifying and characterizing the genetic variants that influence complex traits. In principle, association analysis of haplotypes rather than single nucleotide polymorphisms may better capture an underlying causal variant, but the multiple haplotypes can lead to reduced statistical power due to the testing of (and need to correct for) a large number of haplotypes. This paper presents a novel method based on clustering similar haplotypes to address this issue. The method, implemented in the CLUMPHAP program, is an extension of the CLUMP program designed for the analysis of multi-allelic markers (Sham and Curtis [1995] Ann. Hum. Genet. 59(Pt1):97-105). CLUMPHAP performs a hierarchical clustering of the haplotypes and then computes the chi(2) statistic between each haplotype cluster and disease; the statistical significance of the largest of the chi(2) statistics is obtained by permutation testing. A significant result suggests that the presence of a disease-causing variant in the haplotype cluster is over-represented in cases. Using simulation studies, we have compared CLUMPHAP and more widely used approaches in terms of their statistical power to identify an untyped susceptibility locus. Our results show that CLUMPHAP tends to have greater power than the omnibus haplotype test and is comparable in power to multiple regression locus-coding approaches.

KW - Cluster Analysis

KW - Computer Simulation

KW - Gene Frequency

KW - Genetic Markers

KW - Genetic Predisposition to Disease

KW - Haplotypes

KW - Humans

KW - Logistic Models

KW - Models, Genetic

KW - Models, Statistical

KW - Polymorphism, Single Nucleotide

KW - Software

U2 - 10.1002/gepi.20327

DO - 10.1002/gepi.20327

M3 - Journal article

C2 - 18395815

VL - 32

SP - 539

EP - 545

JO - Genetic Epidemiology

JF - Genetic Epidemiology

SN - 0741-0395

IS - 6

ER -