Home > Research > Publications & Outputs > Mining open datasets for transparency in taxi t...

Electronic data

  • s13688-015-0060-2

    Rights statement: © 2015 Noulas et al. Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

    Final published version, 3.6 MB, PDF document

    Available under license: CC BY: Creative Commons Attribution 4.0 International License

Links

Text available via DOI:

View graph of relations

Mining open datasets for transparency in taxi transport in metropolitan environments

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Mining open datasets for transparency in taxi transport in metropolitan environments. / Noulas, Anastasios; Salnikov, Vsevolod; Lambiotte, Renaud et al.
In: EPJ Data Science, Vol. 4, 23, 10.12.2015.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Noulas A, Salnikov V, Lambiotte R, Mascolo C. Mining open datasets for transparency in taxi transport in metropolitan environments. EPJ Data Science. 2015 Dec 10;4:23. doi: 10.1140/epjds/s13688-015-0060-2

Author

Noulas, Anastasios ; Salnikov, Vsevolod ; Lambiotte, Renaud et al. / Mining open datasets for transparency in taxi transport in metropolitan environments. In: EPJ Data Science. 2015 ; Vol. 4.

Bibtex

@article{df63f1a730bf4d5da942fd84a809f60c,
title = "Mining open datasets for transparency in taxi transport in metropolitan environments",
abstract = "Uber has recently been introducing novel practices in urban taxi transport. Journey prices can change dynamically in almost real time and also vary geographically from one area to another in a city, a strategy known as surge pricing. In this paper, we explore the power of the new generation of open datasets towards understanding the impact of the new disruption technologies that emerge in the area of public transport. With our primary goal being a more transparent economic landscape for urban commuters, we provide a direct price comparison between Uber and the Yellow Cab company in New York. We discover that Uber, despite its lower standard pricing rates, effectively charges higher fares on average, especially during short in length, but frequent in occurrence, taxi journeys. Building on this insight, we develop a smartphone application, OpenStreetCab, that offers a personalized consultation to mobile users on which taxi provider is cheaper for their journey. Almost five months after its launch, the app has attracted more than three thousand users in a single city. Their journey queries have provided additional insights on the potential savings similar technologies can have for urban commuters, with a highlight being that on average, a user in New York saves 6 U.S. Dollars per taxi journey if they pick the cheapest taxi provider. We run extensive experiments to show how Uber{\textquoteright}s surge pricing is the driving factor of higher journey prices and therefore higher potential savings for our application{\textquoteright}s users. Finally, motivated by the observation that Uber{\textquoteright}s surge pricing is occurring more frequently that intuitively expected, we formulate a prediction task where the aim becomes to predict a geographic area{\textquoteright}s tendency to surge. Using exogenous to Uber data, in particular Yellow Cab and Foursquare data, we show how it is possible to estimate customer demand within an area, and by extension surge pricing, with high accuracy.",
author = "Anastasios Noulas and Vsevolod Salnikov and Renaud Lambiotte and Cecilia Mascolo",
note = "{\textcopyright} 2015 Noulas et al. Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.",
year = "2015",
month = dec,
day = "10",
doi = "10.1140/epjds/s13688-015-0060-2",
language = "English",
volume = "4",
journal = "EPJ Data Science",
publisher = "Springer Science + Business Media",

}

RIS

TY - JOUR

T1 - Mining open datasets for transparency in taxi transport in metropolitan environments

AU - Noulas, Anastasios

AU - Salnikov, Vsevolod

AU - Lambiotte, Renaud

AU - Mascolo, Cecilia

N1 - © 2015 Noulas et al. Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

PY - 2015/12/10

Y1 - 2015/12/10

N2 - Uber has recently been introducing novel practices in urban taxi transport. Journey prices can change dynamically in almost real time and also vary geographically from one area to another in a city, a strategy known as surge pricing. In this paper, we explore the power of the new generation of open datasets towards understanding the impact of the new disruption technologies that emerge in the area of public transport. With our primary goal being a more transparent economic landscape for urban commuters, we provide a direct price comparison between Uber and the Yellow Cab company in New York. We discover that Uber, despite its lower standard pricing rates, effectively charges higher fares on average, especially during short in length, but frequent in occurrence, taxi journeys. Building on this insight, we develop a smartphone application, OpenStreetCab, that offers a personalized consultation to mobile users on which taxi provider is cheaper for their journey. Almost five months after its launch, the app has attracted more than three thousand users in a single city. Their journey queries have provided additional insights on the potential savings similar technologies can have for urban commuters, with a highlight being that on average, a user in New York saves 6 U.S. Dollars per taxi journey if they pick the cheapest taxi provider. We run extensive experiments to show how Uber’s surge pricing is the driving factor of higher journey prices and therefore higher potential savings for our application’s users. Finally, motivated by the observation that Uber’s surge pricing is occurring more frequently that intuitively expected, we formulate a prediction task where the aim becomes to predict a geographic area’s tendency to surge. Using exogenous to Uber data, in particular Yellow Cab and Foursquare data, we show how it is possible to estimate customer demand within an area, and by extension surge pricing, with high accuracy.

AB - Uber has recently been introducing novel practices in urban taxi transport. Journey prices can change dynamically in almost real time and also vary geographically from one area to another in a city, a strategy known as surge pricing. In this paper, we explore the power of the new generation of open datasets towards understanding the impact of the new disruption technologies that emerge in the area of public transport. With our primary goal being a more transparent economic landscape for urban commuters, we provide a direct price comparison between Uber and the Yellow Cab company in New York. We discover that Uber, despite its lower standard pricing rates, effectively charges higher fares on average, especially during short in length, but frequent in occurrence, taxi journeys. Building on this insight, we develop a smartphone application, OpenStreetCab, that offers a personalized consultation to mobile users on which taxi provider is cheaper for their journey. Almost five months after its launch, the app has attracted more than three thousand users in a single city. Their journey queries have provided additional insights on the potential savings similar technologies can have for urban commuters, with a highlight being that on average, a user in New York saves 6 U.S. Dollars per taxi journey if they pick the cheapest taxi provider. We run extensive experiments to show how Uber’s surge pricing is the driving factor of higher journey prices and therefore higher potential savings for our application’s users. Finally, motivated by the observation that Uber’s surge pricing is occurring more frequently that intuitively expected, we formulate a prediction task where the aim becomes to predict a geographic area’s tendency to surge. Using exogenous to Uber data, in particular Yellow Cab and Foursquare data, we show how it is possible to estimate customer demand within an area, and by extension surge pricing, with high accuracy.

U2 - 10.1140/epjds/s13688-015-0060-2

DO - 10.1140/epjds/s13688-015-0060-2

M3 - Journal article

VL - 4

JO - EPJ Data Science

JF - EPJ Data Science

M1 - 23

ER -