English Premier League Match Events And Results
2001-2002 season onwards
@kaggle.josephvm_english_premier_league_game_events_and_results
2001-2002 season onwards
@kaggle.josephvm_english_premier_league_game_events_and_results
There are two csvs with match information, matches.csv and events.csv, that contain information about each match from the 2001-2002 season through roughly current during the 2021-2022 season. matches.csv contains information such as the teams playing, final score, date, and lineups. events.csv contains events that happened in a game, at what time, and in what game.
There is one csv with table information, all_tables.csv, that contains the tables from the 2001-2002 season through roughly current during the 2021-2022 season.
There is 3 csvs with aggregated stats in the agg_stats folder. They have data from 2002 through present.
I plan on updating this dataset with data approximately weekly while a season is ongoing.
Note: The Year column in matches.csv contains the year that the season started in, not the year that the match took place.
Note: 107 of 380 matches in the 2001-2002 season have no commentary.
Match data was scrapped by me from: https://www.espn.com/soccer/fixtures/_/date/20210413/league/eng.1
Tables were scrapped by me from: https://www.espn.com/soccer/standings/_/league/ENG.1/season/2020
Image
Code used to scrap the data is located here.
CREATE TABLE all_tables (
"place" BIGINT,
"team" VARCHAR,
"gp" BIGINT,
"w" BIGINT,
"d" BIGINT,
"l" BIGINT,
"gf" BIGINT,
"ga" BIGINT,
"gd" BIGINT,
"p" BIGINT,
"year" BIGINT
);
CREATE TABLE events (
"id" BIGINT,
"time" VARCHAR,
"event" VARCHAR
);
CREATE TABLE matches (
"id" BIGINT,
"home" VARCHAR,
"away" VARCHAR,
"date" VARCHAR,
"year" BIGINT,
"time_utc" VARCHAR -- Time (utc),
"attendance" DOUBLE,
"venue" VARCHAR,
"league" VARCHAR,
"part_of_competition" VARCHAR,
"game_status" VARCHAR,
"shootout" BOOLEAN,
"home_score" BIGINT,
"away_score" BIGINT,
"home_possessionpct" VARCHAR,
"away_possessionpct" VARCHAR,
"home_shotssummary" VARCHAR,
"away_shotssummary" VARCHAR,
"home_foulscommitted" DOUBLE,
"away_foulscommitted" DOUBLE,
"home_yellowcards" DOUBLE,
"away_yellowcards" DOUBLE,
"home_redcards" DOUBLE,
"away_redcards" DOUBLE,
"home_offsides" DOUBLE,
"away_offsides" DOUBLE,
"home_woncorners" DOUBLE,
"away_woncorners" DOUBLE,
"home_saves" DOUBLE,
"away_saves" DOUBLE,
"home_goal_minutes" VARCHAR,
"home_goal_scorers" VARCHAR,
"away_goal_minutes" VARCHAR,
"away_goal_scorers" VARCHAR,
"home_starting_1_num" DOUBLE,
"home_starting_1" VARCHAR,
"home_starting_2_num" DOUBLE,
"home_starting_2" VARCHAR,
"home_starting_3_num" DOUBLE,
"home_starting_3" VARCHAR,
"home_starting_4_num" DOUBLE,
"home_starting_4" VARCHAR,
"home_starting_5_num" DOUBLE,
"home_starting_5" VARCHAR,
"home_starting_6_num" DOUBLE,
"home_starting_6" VARCHAR,
"home_starting_7_num" DOUBLE,
"home_starting_7" VARCHAR,
"home_starting_8_num" DOUBLE,
"home_starting_8" VARCHAR,
"home_starting_9_num" DOUBLE,
"home_starting_9" VARCHAR,
"home_starting_10_num" DOUBLE,
"home_starting_10" VARCHAR,
"home_starting_11_num" DOUBLE,
"home_starting_11" VARCHAR,
"home_bench_1_num" DOUBLE,
"home_bench_1" VARCHAR,
"home_bench_1_minute" VARCHAR,
"home_bench_2_num" DOUBLE,
"home_bench_2" VARCHAR,
"home_bench_2_minute" VARCHAR,
"home_bench_3_num" DOUBLE,
"home_bench_3" VARCHAR,
"home_bench_3_minute" VARCHAR,
"home_bench_4_num" DOUBLE,
"home_bench_4" VARCHAR,
"home_bench_4_minute" VARCHAR,
"home_bench_5_num" DOUBLE,
"home_bench_5" VARCHAR,
"home_bench_5_minute" VARCHAR,
"away_starting_1_num" DOUBLE,
"away_starting_1" VARCHAR,
"away_starting_2_num" DOUBLE,
"away_starting_2" VARCHAR,
"away_starting_3_num" DOUBLE,
"away_starting_3" VARCHAR,
"away_starting_4_num" DOUBLE,
"away_starting_4" VARCHAR,
"away_starting_5_num" DOUBLE,
"away_starting_5" VARCHAR,
"away_starting_6_num" DOUBLE,
"away_starting_6" VARCHAR,
"away_starting_7_num" DOUBLE,
"away_starting_7" VARCHAR,
"away_starting_8_num" DOUBLE,
"away_starting_8" VARCHAR,
"away_starting_9_num" DOUBLE,
"away_starting_9" VARCHAR,
"away_starting_10_num" DOUBLE,
"away_starting_10" VARCHAR,
"away_starting_11_num" DOUBLE,
"away_starting_11" VARCHAR,
"away_bench_1_num" DOUBLE,
"away_bench_1" VARCHAR,
"away_bench_1_minute" VARCHAR,
"away_bench_2_num" DOUBLE,
"away_bench_2" VARCHAR,
"away_bench_2_minute" VARCHAR,
"away_bench_3_num" DOUBLE
);
Anyone who has the link will be able to view this.