A pragmatic suggestion for dealing with results for candidate genes obtained from genome wide association studies

Home > Research > Publications & Outputs > A pragmatic suggestion for dealing with results...

Data Science Institute

Electronic data

art%3A10.1186%2F1471-2156-8-20
Rights statement: © 2007 Curtis et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Final published version, 168 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Text available via DOI:

https://doi.org/10.1186/1471-2156-8-20
Final published version

Keywords

Bayes Theorem, False Positive Reactions, Genetic Markers, Genetic Predisposition to Disease, Genome, Human, Humans, Likelihood Functions

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

David Curtis
Anna E. Vine
Jo Knight

More...

Article number	20
<mark>Journal publication date</mark>	10/05/2007
<mark>Journal</mark>	Genetics
Volume	8
Number of pages	6
Publication Status	Published
<mark>Original language</mark>	English

Abstract

BACKGROUND: Researchers may embark on a genome-wide association study before fully investigating candidate regions which have been reported to produce evidence to suggest that they harbour susceptibility loci. If the genome wide study had not been carried out then results which demonstrated only modest statistical significance from candidate regions would be judged to be of interest and would stimulate further investigation. However if hundreds of thousands of markers are typed then inevitably very large numbers of such results will occur by chance and those from candidate regions may attract no special attention.

RESULTS: An approach is proposed in which differential treatment is afforded to markers from candidate regions and from those that are routinely typed in the context of a genome wide scan. Different prior probabilities are assigned to the two types of marker. A likelihood ratio is derived from the reported p value for each marker, calculated as LR = echiinv(1,p)/2, and the posterior odds in favour of a true positive association are obtained. These odds can be used to rank the markers with a view to suggesting the regions in which further genotyping is indicated. We suggest that prior probabilities be specified such that a candidate marker significant at p = 0.01 and a routine marker significant at p = 0.00001 will yield similar values for the posterior odds. We show that this can be achieved by setting a value for prior probability of association to 0.1 for candidate markers and to 0.00018 for routine markers.

CONCLUSION: It is essential that formal procedures be adopted in order to avoid modestly positively results from candidate regions being swamped by the huge number of nominally significant results which will be obtained when very many markers are genotyped. Software to carry out the conversion from p values to posterior odds is available from http://www.mds.qmul.ac.uk/statgen/grpsoft.html.

Bibliographic note

© 2007 Curtis et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Research

Electronic data

Links

Text available via DOI:

Keywords