Home > Research > Publications & Outputs > Spatial Analysis Made Easy with Linear Regressi...

Electronic data

  • 1902.08679

    Accepted author manuscript, 1.58 MB, PDF document

    Available under license: CC BY-NC-ND: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License

Links

Text available via DOI:

View graph of relations

Spatial Analysis Made Easy with Linear Regression and Kernels

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published
Close
Article number100362
<mark>Journal publication date</mark>31/12/2019
<mark>Journal</mark>Epidemics
Volume29
Number of pages12
Publication StatusPublished
Early online date21/08/19
<mark>Original language</mark>English

Abstract

Kernel methods are a popular technique for extending linear models to handle non-linear spatial problems via a mapping to an implicit, high-dimensional feature space. While kernel methods are computationally cheaper than an explicit feature mapping, they are still subject to cubic cost on the number of points. Given only a few thousand locations, this computational cost rapidly outstrips the currently available computational power. This paper aims to provide an overview of kernel methods from first-principals (with a focus on ridge regression) and progress to a review of random Fourier features (RFF), a method that enables the scaling of kernel methods to big datasets. We show how the RFF method is capable of approximating the full kernel matrix, providing a significant computational speed-up for a negligible cost to accuracy and can be incorporated into many existing spatial methods using only a few lines of code. We give an example of the implementation of RFFs on a simulated spatial data set to illustrate these properties. Lastly, we summarise the main issues with RFFs and highlight some of the advanced techniques aimed at alleviating them. At each stage, the associated R code is provided.