Baselight

English Premier League

Barclays premier league

@kaggle.saife245_english_premier_league

Loading...
Loading...

About this Dataset

English Premier League

Context

  • Sports betting is a 500 billion dollar market (Sydney Herald)
  • Football is played by 250 million players in over 200 countries (most popular sport globally).
  • The English Premier League is the most popular domestic team in the world.
  • It contains 20 years of EPL matches dataset.

Content

All data is in CSV format, ready for use within standard spreadsheet applications. Please note that some abbreviations are no longer in use (in particular odds from specific bookmakers no longer used) and refer to data collected in earlier seasons. For a current list of what bookmakers are included in the dataset please visit

Key to results data:

Div = League Division
Date = Match Date (dd/mm/yy)
Time = Time of match kick-off
HomeTeam = Home Team
Away team = Away Team
FTHG and HG = Full Time Home Team Goals
FTAG and AG = Full-Time Away Team Goals
FTR and Res = Full-Time Result (H=Home Win, D=Draw, A=Away Win)
HTHG = Half Time Home Team Goals
HTAG = Half Time Away Team Goals
HTR = Half Time Result (H=Home Win, D=Draw, A=Away Win)

Match Statistics (where available)
Attendance = Crowd Attendance
Referee = Match Referee
HS = Home Team Shots
AS = Away Team Shots
HST = Home Team Shots on Target
AST = Away Team Shots on Target
HHW = Home Team Hit Woodwork
AHW = Away Team Hit Woodwork
HC = Home Team Corners
AC = Away Team Corners
HF = Home Team Fouls Committed
AF = Away Team Fouls Committed
HFKC = Home Team Free Kicks Conceded
AFKC = Away Team Free Kicks Conceded
HO = Home Team Offsides
AO = Away Team Offsides
HY = Home Team Yellow Cards
AY = Away Team Yellow Cards
HR = Home Team Red Cards
AR = Away Team Red Cards
HBP = Home Team Bookings Points (10 = yellow, 25 = red)
ABP = Away Team Bookings Points (10 = yellow, 25 = red)

Note that Free Kicks Conceded includes fouls, offsides, and any other offense committed and will always be equal to or higher than the number of fouls. Fouls make up the vast majority of Free Kicks Conceded. Free Kicks Conceded are shown when specific data on Fouls are not available (France 2nd, Belgium 1st, and Greece 1st divisions).

Note also that English and Scottish yellow cards do not include the initial yellow card when a second is shown to a player converting it into a red, but this is included as a yellow (plus red) for European games.

Inspiration

Sports betting is a 500 billion dollar market (Sydney Herald). Football is played by 250 million players in over 200 countries (most popular sport globally). The English Premier League is the most popular domestic team in the world. We use the past
20 years dataset for predictive Analysis.

Tables

N 2020–2021

@kaggle.saife245_english_premier_league.n_2020_2021
  • 138.99 KB
  • 380 rows
  • 106 columns
Loading...

CREATE TABLE n_2020_2021 (
  "div" VARCHAR,
  "date" TIMESTAMP,
  "time" VARCHAR,
  "hometeam" VARCHAR,
  "awayteam" VARCHAR,
  "fthg" BIGINT,
  "ftag" BIGINT,
  "ftr" VARCHAR,
  "hthg" BIGINT,
  "htag" BIGINT,
  "htr" VARCHAR,
  "referee" VARCHAR,
  "hs" BIGINT,
  "as" BIGINT,
  "hst" BIGINT,
  "ast" BIGINT,
  "hf" BIGINT,
  "af" BIGINT,
  "hc" BIGINT,
  "ac" BIGINT,
  "hy" BIGINT,
  "ay" BIGINT,
  "hr" BIGINT,
  "ar" BIGINT,
  "b365h" DOUBLE,
  "b365d" DOUBLE,
  "b365a" DOUBLE,
  "bwh" DOUBLE,
  "bwd" DOUBLE,
  "bwa" DOUBLE,
  "iwh" DOUBLE,
  "iwd" DOUBLE,
  "iwa" DOUBLE,
  "psh" DOUBLE,
  "psd" DOUBLE,
  "psa" DOUBLE,
  "whh" DOUBLE,
  "whd" DOUBLE,
  "wha" DOUBLE,
  "vch" DOUBLE,
  "vcd" DOUBLE,
  "vca" DOUBLE,
  "maxh" DOUBLE,
  "maxd" DOUBLE,
  "maxa" DOUBLE,
  "avgh" DOUBLE,
  "avgd" DOUBLE,
  "avga" DOUBLE,
  "b365_2_5" DOUBLE,
  "b365_2_5_c5d2e2" DOUBLE,
  "p_2_5" DOUBLE,
  "p_2_5_63c972" DOUBLE,
  "max_2_5" DOUBLE,
  "max_2_5_e1a24b" DOUBLE,
  "avg_2_5" DOUBLE,
  "avg_2_5_cdbfa9" DOUBLE,
  "ahh" DOUBLE,
  "b365ahh" DOUBLE,
  "b365aha" DOUBLE,
  "pahh" DOUBLE,
  "paha" DOUBLE,
  "maxahh" DOUBLE,
  "maxaha" DOUBLE,
  "avgahh" DOUBLE,
  "avgaha" DOUBLE,
  "b365ch" DOUBLE,
  "b365cd" DOUBLE,
  "b365ca" DOUBLE,
  "bwch" DOUBLE,
  "bwcd" DOUBLE,
  "bwca" DOUBLE,
  "iwch" DOUBLE,
  "iwcd" DOUBLE,
  "iwca" DOUBLE,
  "psch" DOUBLE,
  "pscd" DOUBLE,
  "psca" DOUBLE,
  "whch" DOUBLE,
  "whcd" DOUBLE,
  "whca" DOUBLE,
  "vcch" DOUBLE,
  "vccd" DOUBLE,
  "vcca" DOUBLE,
  "maxch" DOUBLE,
  "maxcd" DOUBLE,
  "maxca" DOUBLE,
  "avgch" DOUBLE,
  "avgcd" DOUBLE,
  "avgca" DOUBLE,
  "b365c_2_5" DOUBLE,
  "b365c_2_5_b914ed" DOUBLE,
  "pc_2_5" DOUBLE,
  "pc_2_5_bf2f8a" DOUBLE,
  "maxc_2_5" DOUBLE,
  "maxc_2_5_9de744" DOUBLE,
  "avgc_2_5" DOUBLE,
  "avgc_2_5_63f719" DOUBLE,
  "ahch" DOUBLE,
  "b365cahh" DOUBLE,
  "b365caha" DOUBLE
);

N 2021–2022

@kaggle.saife245_english_premier_league.n_2021_2022
  • 139.97 KB
  • 380 rows
  • 106 columns
Loading...

CREATE TABLE n_2021_2022 (
  "div" VARCHAR,
  "date" TIMESTAMP,
  "time" VARCHAR,
  "hometeam" VARCHAR,
  "awayteam" VARCHAR,
  "fthg" BIGINT,
  "ftag" BIGINT,
  "ftr" VARCHAR,
  "hthg" BIGINT,
  "htag" BIGINT,
  "htr" VARCHAR,
  "referee" VARCHAR,
  "hs" BIGINT,
  "as" BIGINT,
  "hst" BIGINT,
  "ast" BIGINT,
  "hf" BIGINT,
  "af" BIGINT,
  "hc" BIGINT,
  "ac" BIGINT,
  "hy" BIGINT,
  "ay" BIGINT,
  "hr" BIGINT,
  "ar" BIGINT,
  "b365h" DOUBLE,
  "b365d" DOUBLE,
  "b365a" DOUBLE,
  "bwh" DOUBLE,
  "bwd" DOUBLE,
  "bwa" DOUBLE,
  "iwh" DOUBLE,
  "iwd" DOUBLE,
  "iwa" DOUBLE,
  "psh" DOUBLE,
  "psd" DOUBLE,
  "psa" DOUBLE,
  "whh" DOUBLE,
  "whd" DOUBLE,
  "wha" DOUBLE,
  "vch" DOUBLE,
  "vcd" DOUBLE,
  "vca" DOUBLE,
  "maxh" DOUBLE,
  "maxd" DOUBLE,
  "maxa" DOUBLE,
  "avgh" DOUBLE,
  "avgd" DOUBLE,
  "avga" DOUBLE,
  "b365_2_5" DOUBLE,
  "b365_2_5_c5d2e2" DOUBLE,
  "p_2_5" DOUBLE,
  "p_2_5_63c972" DOUBLE,
  "max_2_5" DOUBLE,
  "max_2_5_e1a24b" DOUBLE,
  "avg_2_5" DOUBLE,
  "avg_2_5_cdbfa9" DOUBLE,
  "ahh" DOUBLE,
  "b365ahh" DOUBLE,
  "b365aha" DOUBLE,
  "pahh" DOUBLE,
  "paha" DOUBLE,
  "maxahh" DOUBLE,
  "maxaha" DOUBLE,
  "avgahh" DOUBLE,
  "avgaha" DOUBLE,
  "b365ch" DOUBLE,
  "b365cd" DOUBLE,
  "b365ca" DOUBLE,
  "bwch" DOUBLE,
  "bwcd" DOUBLE,
  "bwca" DOUBLE,
  "iwch" DOUBLE,
  "iwcd" DOUBLE,
  "iwca" DOUBLE,
  "psch" DOUBLE,
  "pscd" DOUBLE,
  "psca" DOUBLE,
  "whch" DOUBLE,
  "whcd" DOUBLE,
  "whca" DOUBLE,
  "vcch" DOUBLE,
  "vccd" DOUBLE,
  "vcca" DOUBLE,
  "maxch" DOUBLE,
  "maxcd" DOUBLE,
  "maxca" DOUBLE,
  "avgch" DOUBLE,
  "avgcd" DOUBLE,
  "avgca" DOUBLE,
  "b365c_2_5" DOUBLE,
  "b365c_2_5_b914ed" DOUBLE,
  "pc_2_5" DOUBLE,
  "pc_2_5_bf2f8a" DOUBLE,
  "maxc_2_5" DOUBLE,
  "maxc_2_5_9de744" DOUBLE,
  "avgc_2_5" DOUBLE,
  "avgc_2_5_63f719" DOUBLE,
  "ahch" DOUBLE,
  "b365cahh" DOUBLE,
  "b365caha" DOUBLE
);

Final Dataset

@kaggle.saife245_english_premier_league.final_dataset
  • 244.74 KB
  • 6840 rows
  • 40 columns
Loading...

CREATE TABLE final_dataset (
  "unnamed_0" BIGINT,
  "date" VARCHAR,
  "hometeam" VARCHAR,
  "awayteam" VARCHAR,
  "fthg" BIGINT,
  "ftag" BIGINT,
  "ftr" VARCHAR,
  "htgs" BIGINT,
  "atgs" BIGINT,
  "htgc" BIGINT,
  "atgc" BIGINT,
  "htp" DOUBLE,
  "atp" DOUBLE,
  "hm1" VARCHAR,
  "hm2" VARCHAR,
  "hm3" VARCHAR,
  "hm4" VARCHAR,
  "hm5" VARCHAR,
  "am1" VARCHAR,
  "am2" VARCHAR,
  "am3" VARCHAR,
  "am4" VARCHAR,
  "am5" VARCHAR,
  "mw" DOUBLE,
  "htformptsstr" VARCHAR,
  "atformptsstr" VARCHAR,
  "htformpts" BIGINT,
  "atformpts" BIGINT,
  "htwinstreak3" BIGINT,
  "htwinstreak5" BIGINT,
  "htlossstreak3" BIGINT,
  "htlossstreak5" BIGINT,
  "atwinstreak3" BIGINT,
  "atwinstreak5" BIGINT,
  "atlossstreak3" BIGINT,
  "atlossstreak5" BIGINT,
  "htgd" DOUBLE,
  "atgd" DOUBLE,
  "diffpts" DOUBLE,
  "diffformpts" DOUBLE
);

Share link

Anyone who has the link will be able to view this.