Uber Pickups In New York City
Trip data for over 20 million Uber (and other for-hire vehicle) trips in NYC
@kaggle.fivethirtyeight_uber_pickups_in_new_york_city
Trip data for over 20 million Uber (and other for-hire vehicle) trips in NYC
@kaggle.fivethirtyeight_uber_pickups_in_new_york_city
This directory contains data on over 4.5 million Uber pickups in New York City from April to September 2014, and 14.3 million more Uber pickups from January to June 2015. Trip-level data on 10 other for-hire vehicle (FHV) companies, as well as aggregated data for 329 FHV companies, is also included. All the files are as they were received on August 3, Sept. 15 and Sept. 22, 2015.
FiveThirtyEight obtained the data from the NYC Taxi & Limousine Commission (TLC) by submitting a Freedom of Information Law request on July 20, 2015. The TLC has sent us the data in batches as it continues to review trip data Uber and other HFV companies have submitted to it. The TLC's correspondence with FiveThirtyEight is included in the files TLC_letter.pdf, TLC_letter2.pdf and TLC_letter3.pdf. TLC records requests can be made here.
This data was used for four FiveThirtyEight stories: Uber Is Serving New York’s Outer Boroughs More Than Taxis Are, Public Transit Should Be Uber’s New Best Friend, Uber Is Taking Millions Of Manhattan Rides Away From Taxis, and Is Uber Making NYC Rush-Hour Traffic Worse?.
The dataset contains, roughly, four groups of files:
There are six files of raw data on Uber pickups in New York City from April to September 2014. The files are separated by month and each has the following columns:
Date/Time : The date and time of the Uber pickupLat : The latitude of the Uber pickupLon : The longitude of the Uber pickupBase : The TLC base company code affiliated with the Uber pickupThese files are named:
uber-raw-data-apr14.csvuber-raw-data-aug14.csvuber-raw-data-jul14.csvuber-raw-data-jun14.csvuber-raw-data-may14.csvuber-raw-data-sep14.csvAlso included is the file uber-raw-data-janjune-15.csv This file has the following columns:
Dispatching_base_num : The TLC base company code of the base that dispatched the UberPickup_date : The date and time of the Uber pickupAffiliated_base_num : The TLC base company code affiliated with the Uber pickuplocationID : The pickup location ID affiliated with the Uber pickupThe Base codes are for the following Uber bases:
B02512 : Unter
B02598 : Hinter
B02617 : Weiter
B02682 : Schmecken
B02764 : Danach-NY
B02765 : Grun
B02835 : Dreist
B02836 : Drinnen
For coarse-grained location information from these pickups, the file taxi-zone-lookup.csv shows the taxi Zone (essentially, neighborhood) and Borough for each locationID.
The dataset also contains 10 files of raw data on pickups from 10 for-hire vehicle (FHV) companies. The trip information varies by company, but can include day of trip, time of trip, pickup location, driver's for-hire license number, and vehicle's for-hire license number.
These files are named:
American_B01362.csvDiplo_B01196.csvHighclass_B01717.csvSkyline_B00111.csvCarmel_B00256.csvFederal_02216.csvLyft_B02510.csvDial7_B00887.csvFirstclass_B01536.csvPrestige_B01338.csvThere is also a file other-FHV-data-jan-aug-2015.csv containing daily pickup data for 329 FHV companies from January 2015 through August 2015.
The file Uber-Jan-Feb-FOIL.csv contains aggregated daily Uber trip statistics in January and February 2015.
CREATE TABLE other_american_b01362 (
"date" TIMESTAMP,
"time" VARCHAR,
"pick_up_address" VARCHAR,
"unnamed_3" VARCHAR -- Unnamed: 3,
"unnamed_4" VARCHAR -- Unnamed: 4,
"unnamed_5" VARCHAR -- Unnamed: 5
);CREATE TABLE other_dial7_b00887 (
"date" TIMESTAMP,
"time" VARCHAR,
"state" VARCHAR,
"pufrom" VARCHAR,
"address" VARCHAR,
"street" VARCHAR
);CREATE TABLE other_federal_02216 (
"date" TIMESTAMP,
"time" VARCHAR,
"pu_address" VARCHAR,
"do_address" VARCHAR,
"routing_details" VARCHAR,
"pu_address_1" VARCHAR,
"status" VARCHAR
);CREATE TABLE other_fhv_services_jan_aug_2015 (
"base_number" VARCHAR,
"base_name" VARCHAR,
"pick_up_date" TIMESTAMP,
"number_of_trips" BIGINT,
"number_of_vehicles" VARCHAR
);CREATE TABLE other_lyft_b02510 (
"time_of_trip" TIMESTAMP,
"start_lat" DOUBLE,
"start_lng" DOUBLE,
"unnamed_3" VARCHAR -- Unnamed: 3
);CREATE TABLE other_skyline_b00111 (
"date" TIMESTAMP,
"time" VARCHAR,
"n__street_address" VARCHAR -- Street Address,
"n__city_state" VARCHAR -- City State,
"unnamed_4" VARCHAR -- Unnamed: 4,
"unnamed_5" VARCHAR -- Unnamed: 5
);CREATE TABLE uber_jan_feb_foil (
"dispatching_base_number" VARCHAR,
"date" TIMESTAMP,
"active_vehicles" BIGINT,
"trips" BIGINT
);CREATE TABLE uber_raw_data_apr14 (
"date_time" TIMESTAMP,
"lat" DOUBLE,
"lon" DOUBLE,
"base" VARCHAR
);CREATE TABLE uber_raw_data_aug14 (
"date_time" TIMESTAMP,
"lat" DOUBLE,
"lon" DOUBLE,
"base" VARCHAR
);CREATE TABLE uber_raw_data_janjune_15 (
"dispatching_base_num" VARCHAR,
"pickup_date" TIMESTAMP,
"affiliated_base_num" VARCHAR,
"locationid" BIGINT
);CREATE TABLE uber_raw_data_jul14 (
"date_time" TIMESTAMP,
"lat" DOUBLE,
"lon" DOUBLE,
"base" VARCHAR
);CREATE TABLE uber_raw_data_jun14 (
"date_time" TIMESTAMP,
"lat" DOUBLE,
"lon" DOUBLE,
"base" VARCHAR
);CREATE TABLE uber_raw_data_may14 (
"date_time" TIMESTAMP,
"lat" DOUBLE,
"lon" DOUBLE,
"base" VARCHAR
);CREATE TABLE uber_raw_data_sep14 (
"date_time" TIMESTAMP,
"lat" DOUBLE,
"lon" DOUBLE,
"base" VARCHAR
);Anyone who has the link will be able to view this.