Home > Research > Publications & Outputs > GeoMatch

Electronic data

  • GeoMatch_IEEE_BigData_preprint

    Rights statement: © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Accepted author manuscript, 400 KB, PDF document

Links

Text available via DOI:

View graph of relations

GeoMatch: Efficient Large-Scale Map Matching on Apache Spark

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published
Close
Publication date24/01/2019
Host publication2018 IEEE International Conference on Big Data (Big Data)
PublisherIEEE
Pages384-391
Number of pages8
ISBN (electronic)9781538650356
<mark>Original language</mark>English

Abstract

We contribute by developing GeoMatch as a novel, scalable, and efficient big-data pipeline for large-scale map matching on Apache Spark. GeoMatch improves existing spatial big data solutions by utilizing a novel spatial partitioning scheme inspired by Hilbert space-filling curves. Thanks to the partitioning scheme, GeoMatch can effectively balance operations across different processing units and achieve significant performance gains. We demonstrate the effectiveness of GeoMatch through rigorous and extensive benchmarks that consider data sets containing large-scale urban spatial data sets ranging from 166, 253 to 3.78 billion location measurements. Our results show over 17-fold performance improvements compared to previous works while achieving better processing accuracy than current solutions (97.48%).