Baselight

Paralympic Games

Web scraped data of historical Paralympic Games, events, and athletes

@kaggle.katiepress_paralympic_games

Loading...
Loading...

About this Dataset

Paralympic Games

Data is scraped from the International Paralympic Committee (IPC) website for all available summer and winter Paralympic Games, along with the Wikipedia table of host cities for additional info.

The first file, medal_standings is a combination of the overall medal standings tables for each year and the medal standings by sport for each year.

The second file, medal_athlete, has the overall number of athletes, events, and medals for each individual sport, along with the individual events and athletes/medals associated with those events.

NOTE: An attempt was made to clean up the athlete names in the medal_athlete file so that each athlete name is on a separate row (data is considered tidy/long format). However, the names are not 100% accurate, and the original athlete info column was retained for information and further processing if needed.

Original code used for scraping/cleaning is on my Github.

Tables

Medal Athlete

@kaggle.katiepress_paralympic_games.medal_athlete
  • 555.57 KB
  • 29170 rows
  • 22 columns
Loading...

CREATE TABLE medal_athlete (
  "games_code" VARCHAR,
  "games_year" BIGINT,
  "games_city" VARCHAR,
  "games_country" VARCHAR,
  "games_continent" VARCHAR,
  "games_start" VARCHAR,
  "games_end" VARCHAR,
  "games_season" VARCHAR,
  "sport" VARCHAR,
  "sport_code" VARCHAR,
  "event_dates" VARCHAR,
  "event_venue" VARCHAR,
  "events" BIGINT,
  "npcs" BIGINT,
  "athletes" BIGINT,
  "event" VARCHAR,
  "medal" VARCHAR,
  "npc" VARCHAR,
  "npc_new" VARCHAR,
  "npc_name" VARCHAR,
  "athlete_name" VARCHAR,
  "athlete_info_og" VARCHAR
);

Medal Standings

@kaggle.katiepress_paralympic_games.medal_standings
  • 30.8 KB
  • 2346 rows
  • 18 columns
Loading...

CREATE TABLE medal_standings (
  "games_code" VARCHAR,
  "games_year" BIGINT,
  "games_city" VARCHAR,
  "games_country" VARCHAR,
  "games_continent" VARCHAR,
  "games_start" VARCHAR,
  "games_end" VARCHAR,
  "games_season" VARCHAR,
  "npc" VARCHAR,
  "npc_new" VARCHAR,
  "npc_name" VARCHAR,
  "rank_type" VARCHAR,
  "npc_rank" DOUBLE,
  "npc_gold" BIGINT,
  "npc_silver" BIGINT,
  "npc_bronze" BIGINT,
  "sport_code" VARCHAR,
  "sport" VARCHAR
);

Share link

Anyone who has the link will be able to view this.