Baselight

AiportsUSA20112020clean

Commercial airplane flight delays in the United States 2011-2020 after EDA

@kaggle.giancarlomarchesi_quickstartedavisualizations

Loading...
Loading...

About this Dataset

AiportsUSA20112020clean

The contains flight statistics for all airports in the United States from January 2011 to December 2020. Each observation is reported by month, year, airport, and airline. Flights can be categorized as on time, delayed, canceled, or diverted. Flight delays are attributed to five causes: carrier, weather, NAS, security, and late aircraft. The data was downloaded from the Bureau of Transportation Statistics website https://www.transtats.bts.gov/OT_Delay/OT_DelayCause1.asp.

The accompanying notebook explores commercial airplane flight delays in the United States using Python's visualization capabilities in Matplotlib and Seaborn, through the lenses of seasonality, airport traffic, and airline performance.

The clean data set (delays_clean.csv) is analyzed using the following visualizations:

Bar chart
Bar chart subplots
Lollipop chart
Tree maps
Line plot
Histogram
Histogram subplots
Horizontal stacked bar chart
Ranked horizontal bar chart
Box plot
Pareto chart - double axis
Marginal histogram
Pie charts
Scatter plot
Violin plot
Map chart
Linear regression

Tables

Delays Clean

@kaggle.giancarlomarchesi_quickstartedavisualizations.delays_clean
  • 10.92 MB
  • 157906 rows
  • 35 columns
Loading...

CREATE TABLE delays_clean (
  "year" BIGINT,
  "month" BIGINT,
  "carrier" VARCHAR,
  "carrier_name" VARCHAR,
  "airport" VARCHAR,
  "arr_flights" DOUBLE,
  "arr_del15" DOUBLE,
  "carrier_ct" DOUBLE,
  "weather_ct" DOUBLE,
  "nas_ct" DOUBLE,
  "security_ct" DOUBLE,
  "late_aircraft_ct" DOUBLE,
  "arr_cancelled" DOUBLE,
  "arr_diverted" DOUBLE,
  "min_delay" DOUBLE,
  "carrier_delay" DOUBLE,
  "weather_delay" DOUBLE,
  "nas_delay" DOUBLE,
  "security_delay" DOUBLE,
  "late_aircraft_delay" DOUBLE,
  "arr_ontime" DOUBLE,
  "airport_name" VARCHAR,
  "city" VARCHAR,
  "state" VARCHAR,
  "perontime" DOUBLE,
  "perdelay" DOUBLE,
  "percancelled" DOUBLE,
  "perdiverted" DOUBLE,
  "per_car_delay" DOUBLE,
  "per_wea_delay" DOUBLE,
  "per_nas_delay" DOUBLE,
  "per_sec_delay" DOUBLE,
  "per_late_delay" DOUBLE,
  "latitude" DOUBLE,
  "longitude" DOUBLE
);

Share link

Anyone who has the link will be able to view this.