120 Years Of Olympic History: Athletes And Results
basic bio data on athletes and medal results from Athens 1896 to Rio 2016
@kaggle.heesoo37_120_years_of_olympic_history_athletes_and_results
basic bio data on athletes and medal results from Athens 1896 to Rio 2016
@kaggle.heesoo37_120_years_of_olympic_history_athletes_and_results
This is a historical dataset on the modern Olympic Games, including all the Games from Athens 1896 to Rio 2016. I scraped this data from www.sports-reference.com in May 2018. The R code I used to scrape and wrangle the data is on GitHub. I recommend checking my kernel before starting your own analysis.
Note that the Winter and Summer Games were held in the same year up until 1992. After that, they staggered them such that Winter Games occur on a four year cycle starting with 1994, then Summer in 1996, then Winter in 1998, and so on. A common mistake people make when analyzing this data is to assume that the Summer and Winter Games have always been staggered.
The file athlete_events.csv contains 271116 rows and 15 columns. Each row corresponds to an individual athlete competing in an individual Olympic event (athlete-events). The columns are:
The Olympic data on www.sports-reference.com is the result of an incredible amount of research by a group of Olympic history enthusiasts and self-proclaimed 'statistorians'. Check out their blog for more information. All I did was consolidated their decades of work into a convenient format for data analysis.
This dataset provides an opportunity to ask questions about how the Olympics have evolved over time, including questions about the participation and performance of women, different nations, and different sports and events.
CREATE TABLE athlete_events (
  "id" BIGINT,
  "name" VARCHAR,
  "sex" VARCHAR,
  "age" DOUBLE,
  "height" DOUBLE,
  "weight" DOUBLE,
  "team" VARCHAR,
  "noc" VARCHAR,
  "games" VARCHAR,
  "year" BIGINT,
  "season" VARCHAR,
  "city" VARCHAR,
  "sport" VARCHAR,
  "event" VARCHAR,
  "medal" VARCHAR
);CREATE TABLE noc_regions (
  "noc" VARCHAR,
  "region" VARCHAR,
  "notes" VARCHAR
);Anyone who has the link will be able to view this.