Baselight

EPL Dataset 2022/2023 (Update Every Week)

Weekly Updated 2022/2023 Premier League Dataset

@kaggle.acothaha_epl_dataset_20222023_update_every_week

Loading...
Loading...

About this Dataset

EPL Dataset 2022/2023 (Update Every Week)

Context

The Premier League (legal name: The Football Association Premier League Limited) is the highest level of the men's English football league system. Contested by 20 clubs, it operates on a system of promotion and relegation with the English Football League (EFL). Seasons typically run from August to May with each team playing 38 matches (playing all 19 other teams both home and away). Most games are played on Saturday and Sunday afternoons, with occasional weekday evening fixtures. (Wikipedia)

Arguably, English Premier League is the biggest football league on earth. It is home to some of the most successful and popular Football teams, including Manchester United, Liverpool, Arsenal and Chelsea. The league is known for its high level of competition and attracts top talent from around the globe. In addition, the Premier League has a huge following, with with an astounding broadcast in 212 territories to 643 million homes and a potential TV audience of 4.7 billion

Content

This dataset is collected using webscraping from several websites (mainly premierleague.com) and currently there are 2 files that you can utilize:

EPL_2022_2023_(date).json

This dataset contains the information of every match in EPL 2022/2023 so far

  • match_id: The unique match id for each match
  • event: the events (commentary) while the match is happening
  • matchweek: the matchweek of the match
  • team1_name: The home team name
  • team1_startings: The home starting 11
  • team1_subs: The home team subtitution players
  • team1_stat: The home team statistics (Possession%, Shots on target, Shots, Touches, Passes, Tackles, Clearences, Corners, Offsides, Yellow cards, Foul conceded)
  • team2_name: The away team name
  • team2_startings: The away starting 11
  • team2_subs: The away team subtitution players
  • team2_stat: The away team statistics (Possession%, Shots on target, Shots, Touches, Passes, Tackles, Clearences, Corners, Offsides, Yellow cards, Foul conceded)
player_info.csv

This dataset contains the information of each player in EPL 2022/2023

  • player_name: The name of the player
  • team: The team that the player play in
  • birthday: The date of birth of the player
  • position: The position of the player

Inspiration

This dataset is inspired by the film "Moneyball" which make me want to build a dataset that can be utilized to do a football analytics by looking at several information especially the event (Commentary) and one thing that I can provide to you my fellow data enthusiasts is the weekly update of the dataset which happens (hopefully) every monday.

Please feel free to give feedback on the dataset, what can be improved, what data should be added, etc.

Tables

Player Info

@kaggle.acothaha_epl_dataset_20222023_update_every_week.player_info
  • 23.02 KB
  • 658 rows
  • 5 columns
Loading...

CREATE TABLE player_info (
  "unnamed_0" BIGINT,
  "player_name" VARCHAR,
  "team" VARCHAR,
  "birthday" TIMESTAMP,
  "position" VARCHAR
);

Share link

Anyone who has the link will be able to view this.