Baselight

January Flight Delay Prediction

US Flight Data for the month of Jan 2019 and Jan 2020.

@kaggle.divyansh22_flight_delay_prediction

Loading...
Loading...

About this Dataset

January Flight Delay Prediction

This is the first part of flight delay prediction i.e. for the month of January. To check the second part i.e. for the month of February, please have a look at this dataset: February Flight Delay Prediction

This data is collected from the Bureau of Transportation Statistics, Govt. of the USA. This data is open-sourced under U.S. Govt. Works. This dataset contains all the flights in the month of January 2019 and January 2020. There are more than 400,000 flights in the month of January itself throughout the United States. The features were manually chosen to do a primary time series analysis. There are several other features available on their website.

This data could well be used to predict the flight delay at the destination airport specifically for the month of January in upcoming years as the data is for January only.

Tables

Jan 2019 Ontime

@kaggle.divyansh22_flight_delay_prediction.jan_2019_ontime
  • 8.28 MB
  • 583985 rows
  • 22 columns
Loading...

CREATE TABLE jan_2019_ontime (
  "day_of_month" BIGINT,
  "day_of_week" BIGINT,
  "op_unique_carrier" VARCHAR,
  "op_carrier_airline_id" BIGINT,
  "op_carrier" VARCHAR,
  "tail_num" VARCHAR,
  "op_carrier_fl_num" BIGINT,
  "origin_airport_id" BIGINT,
  "origin_airport_seq_id" BIGINT,
  "origin" VARCHAR,
  "dest_airport_id" BIGINT,
  "dest_airport_seq_id" BIGINT,
  "dest" VARCHAR,
  "dep_time" DOUBLE,
  "dep_del15" DOUBLE,
  "dep_time_blk" VARCHAR,
  "arr_time" DOUBLE,
  "arr_del15" DOUBLE,
  "cancelled" DOUBLE,
  "diverted" DOUBLE,
  "distance" DOUBLE,
  "unnamed_21" VARCHAR
);

Jan 2020 Ontime

@kaggle.divyansh22_flight_delay_prediction.jan_2020_ontime
  • 8.66 MB
  • 607346 rows
  • 22 columns
Loading...

CREATE TABLE jan_2020_ontime (
  "day_of_month" BIGINT,
  "day_of_week" BIGINT,
  "op_unique_carrier" VARCHAR,
  "op_carrier_airline_id" BIGINT,
  "op_carrier" VARCHAR,
  "tail_num" VARCHAR,
  "op_carrier_fl_num" BIGINT,
  "origin_airport_id" BIGINT,
  "origin_airport_seq_id" BIGINT,
  "origin" VARCHAR,
  "dest_airport_id" BIGINT,
  "dest_airport_seq_id" BIGINT,
  "dest" VARCHAR,
  "dep_time" DOUBLE,
  "dep_del15" DOUBLE,
  "dep_time_blk" VARCHAR,
  "arr_time" DOUBLE,
  "arr_del15" DOUBLE,
  "cancelled" DOUBLE,
  "diverted" DOUBLE,
  "distance" DOUBLE,
  "unnamed_21" VARCHAR
);

Share link

Anyone who has the link will be able to view this.