US 2023 Civil Flights, Delays, Meteo And Aircrafts
Perfect dataset to perform multiple analyzes and train a prediction model.
@kaggle.bordanova_2023_us_civil_flights_delay_meteo_and_aircraft
Perfect dataset to perform multiple analyzes and train a prediction model.
@kaggle.bordanova_2023_us_civil_flights_delay_meteo_and_aircraft
I offer you a complete dataset of US civil aviation 2023 containing :
The information on flights comes from the BTS (Bureau of Transportation Statistics) website and I used the Meteostat Python library to scrape the information. I also developed a correspondence table for the airports, their identifiers and geolocation coordinates.
US_flights_2023.csv
Main table containing all flight information except weather. Cancelled or diverted flights have been extracted in a dedicated table to avoid biasing delay statistics and analysis.
weather_meteo_by_airport.csv
Table containing weather data (temperatures, air pressure, snow cover, precipitation, wind strength and direction) for each airport and day of the year. This table can be joined to the main table, using either the departure or arrival airport ID, or both, to study in-flight variations.
Cancelled_Diverted_2023.csv
Table des annulations et vols déviés pour une analyse dédiée. Il est possible de créer une jointure avec la table météo pour identifier les causes probables.
Here's my dashboard associated with the analysis of civil aviation delays in the United States in 2023. You'll find some interesting indicators and an observation on the various correlations between delays. This dashboard was designed on a student version of Power Bi, with pre-processing in Python and Pandas. Thanks for your feedback.
Thank you for your feedback
CREATE TABLE airports_geolocation (
"iata_code" VARCHAR,
"airport" VARCHAR,
"city" VARCHAR,
"state" VARCHAR,
"country" VARCHAR,
"latitude" DOUBLE,
"longitude" DOUBLE
);CREATE TABLE cancelled_diverted_2023 (
"flightdate" TIMESTAMP,
"day_of_week" BIGINT,
"airline" VARCHAR,
"tail_number" VARCHAR,
"cancelled" DOUBLE,
"diverted" DOUBLE,
"dep_airport" VARCHAR,
"dep_cityname" VARCHAR,
"deptime_label" VARCHAR,
"dep_delay" DOUBLE,
"dep_delay_tag" BIGINT,
"dep_delay_type" VARCHAR,
"arr_airport" VARCHAR,
"arr_cityname" VARCHAR,
"arr_delay" DOUBLE,
"arr_delay_type" VARCHAR,
"flight_duration" DOUBLE,
"distance_type" VARCHAR,
"delay_carrier" DOUBLE,
"delay_weather" DOUBLE,
"delay_nas" DOUBLE,
"delay_security" DOUBLE,
"delay_lastaircraft" DOUBLE
);CREATE TABLE maj_us_flight_january_2024 (
"flightdate" TIMESTAMP,
"day_of_week" BIGINT,
"airline" VARCHAR,
"tail_number" VARCHAR,
"dep_airport" VARCHAR,
"dep_cityname" VARCHAR,
"deptime_label" VARCHAR,
"dep_delay" BIGINT,
"dep_delay_tag" BIGINT,
"dep_delay_type" VARCHAR,
"arr_airport" VARCHAR,
"arr_cityname" VARCHAR,
"arr_delay" BIGINT,
"arr_delay_type" VARCHAR,
"flight_duration" BIGINT,
"distance_type" VARCHAR,
"delay_carrier" BIGINT,
"delay_weather" BIGINT,
"delay_nas" BIGINT,
"delay_security" BIGINT,
"delay_lastaircraft" BIGINT,
"manufacturer" VARCHAR,
"model" VARCHAR,
"aicraft_age" BIGINT
);CREATE TABLE us_flights_2023 (
"flightdate" TIMESTAMP,
"day_of_week" BIGINT,
"airline" VARCHAR,
"tail_number" VARCHAR,
"dep_airport" VARCHAR,
"dep_cityname" VARCHAR,
"deptime_label" VARCHAR,
"dep_delay" BIGINT,
"dep_delay_tag" BIGINT,
"dep_delay_type" VARCHAR,
"arr_airport" VARCHAR,
"arr_cityname" VARCHAR,
"arr_delay" BIGINT,
"arr_delay_type" VARCHAR,
"flight_duration" BIGINT,
"distance_type" VARCHAR,
"delay_carrier" BIGINT,
"delay_weather" BIGINT,
"delay_nas" BIGINT,
"delay_security" BIGINT,
"delay_lastaircraft" BIGINT,
"manufacturer" VARCHAR,
"model" VARCHAR,
"aicraft_age" BIGINT
);CREATE TABLE weather_meteo_by_airport (
"time" TIMESTAMP,
"tavg" DOUBLE,
"tmin" DOUBLE,
"tmax" DOUBLE,
"prcp" DOUBLE,
"snow" DOUBLE,
"wdir" DOUBLE,
"wspd" DOUBLE,
"pres" DOUBLE,
"airport_id" VARCHAR
);Anyone who has the link will be able to view this.