Assessing models for genetic prediction of complex traits - Research Portal

Home > Research > Publications & Outputs > Assessing models for genetic prediction of comp...

Data Science Institute

Associated organisational unit

DSI - Health

Electronic data

art%3A10.1186%2Fs12864-015-1616-z
Rights statement: © 2015 Gagliano et al.; licensee BioMed Central. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
Final published version, 807 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Text available via DOI:

https://doi.org/10.1186/s12864-015-1616-z
Final published version

Keywords

Algorithms, Area Under Curve, Computer Simulation, Databases, Genetic, Genome, Human, Humans, Models, Genetic, Quantitative Trait Loci, ROC Curve, Risk

View graph of relations

Assessing models for genetic prediction of complex traits: a comparison of visualization and quantitative methods

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Sarah A. Gagliano
Andrew D. Paterson
Michael E. Weale
Jo Knight

More...

Article number	405
<mark>Journal publication date</mark>	22/05/2015
<mark>Journal</mark>	BMC Genomics
Volume	16
Number of pages	11
Publication Status	Published
<mark>Original language</mark>	English

Abstract

BACKGROUND: In silico models have recently been created in order to predict which genetic variants are more likely to contribute to the risk of a complex trait given their functional characteristics. However, there has been no comprehensive review as to which type of predictive accuracy measures and data visualization techniques are most useful for assessing these models.

METHODS: We assessed the performance of the models for predicting risk using various methodologies, some of which include: receiver operating characteristic (ROC) curves, histograms of classification probability, and the novel use of the quantile-quantile plot. These measures have variable interpretability depending on factors such as whether the dataset is balanced in terms of numbers of genetic variants classified as risk variants versus those that are not.

RESULTS: We conclude that the area under the curve (AUC) is a suitable starting place, and for models with similar AUCs, violin plots are particularly useful for examining the distribution of the risk scores.

Bibliographic note

© 2015 Gagliano et al.; licensee BioMed Central. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Research

Associated organisational unit

Electronic data

Links

Text available via DOI:

Keywords

Assessing models for genetic prediction of complex traits: a comparison of visualization and quantitative methods

Abstract

Bibliographic note

Quick Links

Connect With Us

Faculties & Depts

Contact Us