Final published version, 1.28 MB, PDF document
Available under license: CC BY
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - Visualizing patterns in spatially ambiguous point data
AU - Huck, Jonathan
AU - Whyatt, Duncan
AU - Coulton, Paul
PY - 2015/4
Y1 - 2015/4
N2 - As technologies permitting both the creation and retrieval of data containing spatial information continue to develop, so do the number of visualisations using such data. This spatial information will often comprise a place-name that may be ‘geocoded’ into coordinates, and displayed on a map, frequently using a ‘heatmap-style’ visualisation to reveal patterns in the data. Across a dataset, however, there is often ambiguity in the geographic scale to which a place-name refers (country, county, town, street etc.), and attempts to simultaneously map data at a multitude of different scales will result in the formation of ‘false hotspots’ within the map. These form at the centres of administrative areas (countries, counties, towns etc.) and introduce erroneous patterns into the dataset whilst obscuring real ones, resulting in misleading visualisations of the patterns in the dataset. This paper therefore proposes a new algorithm to intelligently redistribute data that would otherwise contribute to these ‘false hotspots’, removing them to locations that likely reflect real-world patterns at a homogenous scale, and so allow more representative visualisations to be created, without the negative effects of ‘false hotspots’ resulting from multi-scale data. This technique demonstrated on a sample dataset taken from Twitter, and validated against the ‘geotagged’ portion of the same dataset.
AB - As technologies permitting both the creation and retrieval of data containing spatial information continue to develop, so do the number of visualisations using such data. This spatial information will often comprise a place-name that may be ‘geocoded’ into coordinates, and displayed on a map, frequently using a ‘heatmap-style’ visualisation to reveal patterns in the data. Across a dataset, however, there is often ambiguity in the geographic scale to which a place-name refers (country, county, town, street etc.), and attempts to simultaneously map data at a multitude of different scales will result in the formation of ‘false hotspots’ within the map. These form at the centres of administrative areas (countries, counties, towns etc.) and introduce erroneous patterns into the dataset whilst obscuring real ones, resulting in misleading visualisations of the patterns in the dataset. This paper therefore proposes a new algorithm to intelligently redistribute data that would otherwise contribute to these ‘false hotspots’, removing them to locations that likely reflect real-world patterns at a homogenous scale, and so allow more representative visualisations to be created, without the negative effects of ‘false hotspots’ resulting from multi-scale data. This technique demonstrated on a sample dataset taken from Twitter, and validated against the ‘geotagged’ portion of the same dataset.
U2 - 10.5311/JOSIS.2015.10.211
DO - 10.5311/JOSIS.2015.10.211
M3 - Journal article
SP - 47
EP - 66
JO - Journal of Spatial Information Systems
JF - Journal of Spatial Information Systems
IS - 10
ER -