Malte Bonart
October 16, 2019
This work and the underlying source code is available on GitHub.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Day Pass/ Three Day Pass | Annual Membership | |
---|---|---|
Customer (11%) | Subscriber (89%) | |
12.00$ / day | 169.00$ / year | |
max 30 min | max 45 min | |
4.00$ / 15 min | 2.50$ / 15 min |
Trips with a duration > 2 hours and < 20 seconds have been removed from the analysis (~0.3%). Customer ride on average 1441, Subscribers 721 seconds.
6% of all trips from customers have an age value of 49.
0:Sunday - 6:Saturday.
The baseline was constructed by classifying all trips with unknown
gender as customers
. Nearest neighbour classification is based on time
, start location
, end location
and tripduration
.
features | dimensions | logistic regression (f-score) |
---|---|---|
tripduration | 1 | 0.15 |
+ gender | 3 | 0.70 |
+ age | 5 | 0.71 |
+ time | 45 | 0.71 |
+ area | 173 | 0.72 |
The training is based on a random sample of n=5000000 trips, due to resource and time constraints.
biking vs. | driving | driving (traffic) | transit |
---|---|---|---|
Average differences | -107 | -105 | 209 |
Median differences | 2 | -9 | 263 |
Biking faster | 50% | 48% | 83% |
Based on a n=2000 random sample of trips, collected with the GoogleMaps Directions API. Wilcoxon signed-rank test and t-test for pairs are both significant. biking faster | biking slower
driver inattenion
failure to yield right of way
confusion of bicyclist
traffic control disregarded
passing or lane usage improber
Top reasons for NYC motor vehicle collisions where at least one biker was injured.
Source: http://tiny.cc/af3kez