Home > Research > Publications & Outputs > The Forgotten Semantics of Regression Modeling ...

Links

Text available via DOI:

View graph of relations

The Forgotten Semantics of Regression Modeling in Geography

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

The Forgotten Semantics of Regression Modeling in Geography. / Comber, A.J.; Harris, P.; Lü, Y. et al.
In: Geographical Analysis, Vol. 53, No. 1, 01.01.2021, p. 113-134.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

Comber, AJ, Harris, P, Lü, Y, Wu, L & Atkinson, PM 2021, 'The Forgotten Semantics of Regression Modeling in Geography', Geographical Analysis, vol. 53, no. 1, pp. 113-134. https://doi.org/10.1111/gean.12199

APA

Comber, A. J., Harris, P., Lü, Y., Wu, L., & Atkinson, P. M. (2021). The Forgotten Semantics of Regression Modeling in Geography. Geographical Analysis, 53(1), 113-134. https://doi.org/10.1111/gean.12199

Vancouver

Comber AJ, Harris P, Lü Y, Wu L, Atkinson PM. The Forgotten Semantics of Regression Modeling in Geography. Geographical Analysis. 2021 Jan 1;53(1):113-134. Epub 2019 May 29. doi: 10.1111/gean.12199

Author

Comber, A.J. ; Harris, P. ; Lü, Y. et al. / The Forgotten Semantics of Regression Modeling in Geography. In: Geographical Analysis. 2021 ; Vol. 53, No. 1. pp. 113-134.

Bibtex

@article{95b11f77ef2c4a9c969a28b075f83032,
title = "The Forgotten Semantics of Regression Modeling in Geography",
abstract = "This article is concerned with the semantics associated with the statistical analysis of spatial data. It takes the simplest case of the prediction of variable y as a function of covariate(s) x, in which predicted y is always an approximation of y and only ever a function of x, thus, inheriting many of the spatial characteristics of x, and illustrates several core issues using “synthetic” remote sensing and “real” soils case studies. The outputs of regression models and, therefore, the meaning of predicted y, are shown to vary due to (1) choices about data: the specification of x (which covariates to include), the support of x (measurement scales and granularity), the measurement of x and the error of x, and (2) choices about the model including its functional form and the method of model identification. Some of these issues are more widely recognized than others. Thus, the study provides definition to the multiple ways in which regression prediction and inference are affected by data and model choices. The article invites researchers to pause and consider the semantic meaning of predicted y, which is often nothing more than a scaled version of covariate(s) x, and argues that it is na{\"i}ve to ignore this.",
author = "A.J. Comber and P. Harris and Y. L{\"u} and L. Wu and P.M. Atkinson",
year = "2021",
month = jan,
day = "1",
doi = "10.1111/gean.12199",
language = "English",
volume = "53",
pages = "113--134",
journal = "Geographical Analysis",
issn = "0016-7363",
publisher = "WILEY-BLACKWELL PUBLISHING, INC",
number = "1",

}

RIS

TY - JOUR

T1 - The Forgotten Semantics of Regression Modeling in Geography

AU - Comber, A.J.

AU - Harris, P.

AU - Lü, Y.

AU - Wu, L.

AU - Atkinson, P.M.

PY - 2021/1/1

Y1 - 2021/1/1

N2 - This article is concerned with the semantics associated with the statistical analysis of spatial data. It takes the simplest case of the prediction of variable y as a function of covariate(s) x, in which predicted y is always an approximation of y and only ever a function of x, thus, inheriting many of the spatial characteristics of x, and illustrates several core issues using “synthetic” remote sensing and “real” soils case studies. The outputs of regression models and, therefore, the meaning of predicted y, are shown to vary due to (1) choices about data: the specification of x (which covariates to include), the support of x (measurement scales and granularity), the measurement of x and the error of x, and (2) choices about the model including its functional form and the method of model identification. Some of these issues are more widely recognized than others. Thus, the study provides definition to the multiple ways in which regression prediction and inference are affected by data and model choices. The article invites researchers to pause and consider the semantic meaning of predicted y, which is often nothing more than a scaled version of covariate(s) x, and argues that it is naïve to ignore this.

AB - This article is concerned with the semantics associated with the statistical analysis of spatial data. It takes the simplest case of the prediction of variable y as a function of covariate(s) x, in which predicted y is always an approximation of y and only ever a function of x, thus, inheriting many of the spatial characteristics of x, and illustrates several core issues using “synthetic” remote sensing and “real” soils case studies. The outputs of regression models and, therefore, the meaning of predicted y, are shown to vary due to (1) choices about data: the specification of x (which covariates to include), the support of x (measurement scales and granularity), the measurement of x and the error of x, and (2) choices about the model including its functional form and the method of model identification. Some of these issues are more widely recognized than others. Thus, the study provides definition to the multiple ways in which regression prediction and inference are affected by data and model choices. The article invites researchers to pause and consider the semantic meaning of predicted y, which is often nothing more than a scaled version of covariate(s) x, and argues that it is naïve to ignore this.

U2 - 10.1111/gean.12199

DO - 10.1111/gean.12199

M3 - Journal article

VL - 53

SP - 113

EP - 134

JO - Geographical Analysis

JF - Geographical Analysis

SN - 0016-7363

IS - 1

ER -