Home > Research > Publications & Outputs > Integrated species distribution models

Links

Text available via DOI:

View graph of relations

Integrated species distribution models: A comparison of approaches under different data quality scenarios

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Integrated species distribution models: A comparison of approaches under different data quality scenarios. / Suhaimi, Siti Sarah Ahmad; Blair, Gordon; Jarvis, S.G.
In: Diversity and Distributions, Vol. 27, No. 6, 30.06.2021, p. 1066-1075.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Suhaimi SSA, Blair G, Jarvis SG. Integrated species distribution models: A comparison of approaches under different data quality scenarios. Diversity and Distributions. 2021 Jun 30;27(6):1066-1075. Epub 2021 Mar 1. doi: 10.1111/ddi.13255

Author

Suhaimi, Siti Sarah Ahmad ; Blair, Gordon ; Jarvis, S.G. / Integrated species distribution models : A comparison of approaches under different data quality scenarios. In: Diversity and Distributions. 2021 ; Vol. 27, No. 6. pp. 1066-1075.

Bibtex

@article{0c901d68c349430b9058ade5c2feb333,
title = "Integrated species distribution models: A comparison of approaches under different data quality scenarios",
abstract = "AimIntegrated species distribution modelling has emerged as a useful tool for ecologists to exploit the range of information available on where species occur. In particular, the ability to combine large numbers of ad hoc or presence‐only (PO) records with more structured presence–absence (PA) data can allow ecologists to account for biases in PO data which often confound modelling efforts. A range of modelling techniques have been suggested to implement integrated species distribution models (IDMs) including joint likelihood models, including one dataset as a covariate or informative prior, and fitting a correlation structure between datasets. We aim to investigate the performance of different types of integrated models under realistic ecological data scenarios.InnovationWe use a virtual ecologist approach to investigate which integrated model is most advantageous under varying levels of spatial bias in PO data, sample size of PA data and spatial overlap between datasets.Main conclusionsJoint likelihood models were the best performing models when spatial bias in PO data was low, or could be modelled, but gave poor estimates when there were unknown biases in the data. Correlation models provided good model estimates even when there were unknown biases and when good quality PA data were spatially limited. Including PO data via an informative prior provided little improvement over modelling PA data alone and was inferior to using either the joint likelihood or correlation approach. Our results suggest that correlation models provide a robust alternative to joint likelihood models when covariates related to effort or detection in PO data are not available. Ecologists should be aware of the limitations of each approach and consider how well biases in the data can be modelled when deciding which type of IDM to use.",
keywords = "citizen science, distribution, informative prior, integrated model, joint likelihood, presence–absence, presence‐only, simulation",
author = "Suhaimi, {Siti Sarah Ahmad} and Gordon Blair and S.G. Jarvis",
year = "2021",
month = jun,
day = "30",
doi = "10.1111/ddi.13255",
language = "English",
volume = "27",
pages = "1066--1075",
journal = "Diversity and Distributions",
issn = "1366-9516",
publisher = "Blackwell Publishing Ltd",
number = "6",

}

RIS

TY - JOUR

T1 - Integrated species distribution models

T2 - A comparison of approaches under different data quality scenarios

AU - Suhaimi, Siti Sarah Ahmad

AU - Blair, Gordon

AU - Jarvis, S.G.

PY - 2021/6/30

Y1 - 2021/6/30

N2 - AimIntegrated species distribution modelling has emerged as a useful tool for ecologists to exploit the range of information available on where species occur. In particular, the ability to combine large numbers of ad hoc or presence‐only (PO) records with more structured presence–absence (PA) data can allow ecologists to account for biases in PO data which often confound modelling efforts. A range of modelling techniques have been suggested to implement integrated species distribution models (IDMs) including joint likelihood models, including one dataset as a covariate or informative prior, and fitting a correlation structure between datasets. We aim to investigate the performance of different types of integrated models under realistic ecological data scenarios.InnovationWe use a virtual ecologist approach to investigate which integrated model is most advantageous under varying levels of spatial bias in PO data, sample size of PA data and spatial overlap between datasets.Main conclusionsJoint likelihood models were the best performing models when spatial bias in PO data was low, or could be modelled, but gave poor estimates when there were unknown biases in the data. Correlation models provided good model estimates even when there were unknown biases and when good quality PA data were spatially limited. Including PO data via an informative prior provided little improvement over modelling PA data alone and was inferior to using either the joint likelihood or correlation approach. Our results suggest that correlation models provide a robust alternative to joint likelihood models when covariates related to effort or detection in PO data are not available. Ecologists should be aware of the limitations of each approach and consider how well biases in the data can be modelled when deciding which type of IDM to use.

AB - AimIntegrated species distribution modelling has emerged as a useful tool for ecologists to exploit the range of information available on where species occur. In particular, the ability to combine large numbers of ad hoc or presence‐only (PO) records with more structured presence–absence (PA) data can allow ecologists to account for biases in PO data which often confound modelling efforts. A range of modelling techniques have been suggested to implement integrated species distribution models (IDMs) including joint likelihood models, including one dataset as a covariate or informative prior, and fitting a correlation structure between datasets. We aim to investigate the performance of different types of integrated models under realistic ecological data scenarios.InnovationWe use a virtual ecologist approach to investigate which integrated model is most advantageous under varying levels of spatial bias in PO data, sample size of PA data and spatial overlap between datasets.Main conclusionsJoint likelihood models were the best performing models when spatial bias in PO data was low, or could be modelled, but gave poor estimates when there were unknown biases in the data. Correlation models provided good model estimates even when there were unknown biases and when good quality PA data were spatially limited. Including PO data via an informative prior provided little improvement over modelling PA data alone and was inferior to using either the joint likelihood or correlation approach. Our results suggest that correlation models provide a robust alternative to joint likelihood models when covariates related to effort or detection in PO data are not available. Ecologists should be aware of the limitations of each approach and consider how well biases in the data can be modelled when deciding which type of IDM to use.

KW - citizen science

KW - distribution

KW - informative prior

KW - integrated model

KW - joint likelihood

KW - presence–absence

KW - presence‐only

KW - simulation

U2 - 10.1111/ddi.13255

DO - 10.1111/ddi.13255

M3 - Journal article

VL - 27

SP - 1066

EP - 1075

JO - Diversity and Distributions

JF - Diversity and Distributions

SN - 1366-9516

IS - 6

ER -