European Soccer Dataset
Season 2017/18, 2018/19 and 2019/20
@kaggle.alessiasimone_european_soccer_dataset_season_20172020
Season 2017/18, 2018/19 and 2019/20
@kaggle.alessiasimone_european_soccer_dataset_season_20172020
The soccer market is one of the most competitive markets, and it often involves some critical decisions for coaches regarding players' and teams' winning game strategy.
The dataset contains real-life teams' and players' performances in the leading five European leagues in three seasons. Are you able to answer to the following questions by analyzing the dataset?
• What are the top European leagues among the seasons, and why?
• Which teams contributed the most to these leagues?
• Who are the best goalscorer and goalkeepers of those teams?
• Is it possible to define a winning game strategy?
This dataset has been produced during a staging area in which ETL process has been applied on Tableau Prep. You can read and download my entire report here or play with my interactive Tableau dashboard here.
From the previous three dataset, columns with 'm' final, regarding 90 minutes only, has been deselected (to save space, as they are easy to compute during a preprocessing step). Then, the datasets has been unified and the final dataset has been cleaned: duplicate columns due to unification step has been removed, players' and teams' names have been corrected due to some typo errors, "position" and "nationality" abbreviations have been extended for a better understanding, rollup operation on "position" column has been applied, numerical binary values regarding the Champions League has been transformed into strings "Yes" or "No" and the "season" has been transformed into a datetime.
The final dataset contains 6824 observations of 46 features:
Please refer to the source for more details about the features.
Enjoy!
Anyone who has the link will be able to view this.