Context
The Premier League (legal name: The Football Association Premier League Limited) is the highest level of the men's English football league system. Contested by 20 clubs, it operates on a system of promotion and relegation with the English Football League (EFL). Seasons typically run from August to May with each team playing 38 matches (playing all 19 other teams both home and away). Most games are played on Saturday and Sunday afternoons, with occasional weekday evening fixtures. (Wikipedia)
Arguably, English Premier League is the biggest football league on earth. It is home to some of the most successful and popular Football teams, including Manchester United, Liverpool, Arsenal and Chelsea. The league is known for its high level of competition and attracts top talent from around the globe. In addition, the Premier League has a huge following, with with an astounding broadcast in 212 territories to 643 million homes and a potential TV audience of 4.7 billion
Content
This dataset is collected using webscraping from several websites (mainly premierleague.com) and currently there are 2 files that you can utilize:
EPL_2022_2023_(date).json
This dataset contains the information of every match in EPL 2022/2023 so far
match_id
: The unique match id for each match
event
: the events (commentary) while the match is happening
matchweek
: the matchweek of the match
team1_name
: The home team name
team1_startings
: The home starting 11
team1_subs
: The home team subtitution players
team1_stat
: The home team statistics (Possession%, Shots on target, Shots, Touches, Passes, Tackles, Clearences, Corners, Offsides, Yellow cards, Foul conceded)
team2_name
: The away team name
team2_startings
: The away starting 11
team2_subs
: The away team subtitution players
team2_stat
: The away team statistics (Possession%, Shots on target, Shots, Touches, Passes, Tackles, Clearences, Corners, Offsides, Yellow cards, Foul conceded)
player_info.csv
This dataset contains the information of each player in EPL 2022/2023
player_name
: The name of the player
team
: The team that the player play in
birthday
: The date of birth of the player
position
: The position of the player
Inspiration
This dataset is inspired by the film "Moneyball" which make me want to build a dataset that can be utilized to do a football analytics by looking at several information especially the event (Commentary) and one thing that I can provide to you my fellow data enthusiasts is the weekly update of the dataset which happens (hopefully) every monday.
Please feel free to give feedback on the dataset, what can be improved, what data should be added, etc.