Paralympic Games
Web scraped data of historical Paralympic Games, events, and athletes
@kaggle.katiepress_paralympic_games
Web scraped data of historical Paralympic Games, events, and athletes
@kaggle.katiepress_paralympic_games
Data is scraped from the International Paralympic Committee (IPC) website for all available summer and winter Paralympic Games, along with the Wikipedia table of host cities for additional info.
The first file, medal_standings is a combination of the overall medal standings tables for each year and the medal standings by sport for each year.
The second file, medal_athlete, has the overall number of athletes, events, and medals for each individual sport, along with the individual events and athletes/medals associated with those events.
NOTE: An attempt was made to clean up the athlete names in the medal_athlete file so that each athlete name is on a separate row (data is considered tidy/long format). However, the names are not 100% accurate, and the original athlete info column was retained for information and further processing if needed.
Original code used for scraping/cleaning is on my Github.
CREATE TABLE medal_athlete (
"games_code" VARCHAR,
"games_year" BIGINT,
"games_city" VARCHAR,
"games_country" VARCHAR,
"games_continent" VARCHAR,
"games_start" VARCHAR,
"games_end" VARCHAR,
"games_season" VARCHAR,
"sport" VARCHAR,
"sport_code" VARCHAR,
"event_dates" VARCHAR,
"event_venue" VARCHAR,
"events" BIGINT,
"npcs" BIGINT,
"athletes" BIGINT,
"event" VARCHAR,
"medal" VARCHAR,
"npc" VARCHAR,
"npc_new" VARCHAR,
"npc_name" VARCHAR,
"athlete_name" VARCHAR,
"athlete_info_og" VARCHAR
);
CREATE TABLE medal_standings (
"games_code" VARCHAR,
"games_year" BIGINT,
"games_city" VARCHAR,
"games_country" VARCHAR,
"games_continent" VARCHAR,
"games_start" VARCHAR,
"games_end" VARCHAR,
"games_season" VARCHAR,
"npc" VARCHAR,
"npc_new" VARCHAR,
"npc_name" VARCHAR,
"rank_type" VARCHAR,
"npc_rank" DOUBLE,
"npc_gold" BIGINT,
"npc_silver" BIGINT,
"npc_bronze" BIGINT,
"sport_code" VARCHAR,
"sport" VARCHAR
);
Anyone who has the link will be able to view this.