[Updated 10/2022]
FIFA concerned: From FIFA17 to FIFA23
Dataset
The dataset contains +17k unique players and more than 60 columns, general information and all KPIs the famous videogame offers. As the esport scene keeps rising espacially on FIFA, I thought it can be useful for the community (kagglers and/or gamers)
Context
The data was retrieved thanks to a crawler that I implemented to retrieve:
- Aggregated data such as name of the players, age, country
- Detailed data such as offensive potential, defense, acceleration
I like football a lot and this dataset is for me the opportunity to bring my contribution for the realization of projects that can go from simple analysis to elaboration of strategies on optimal composition under constraints...
Acknowledgements
We wouldn't be here without the help of others. I would like to thanks @karangadiya who I got inspiration from, check his repo here !
FIFA19 dataset: https://www.kaggle.com/karangadiya/fifa19
FIFA18 dataset: https://www.kaggle.com/thec03u5/fifa-18-demo-player-dataset
More details on the crawler
I used beautifulsoup to scrap https://sofifa.com/. First, I scrap the main page to get all general information and then, I scraped each player's webpage that is associated. I defined a batch size so I can parallelize the retrieving of the data. Then I merge all dataframes and cleaned the merged one. I have only 4 CPU and defined 5 batches:
- Without batch: 5h12
- With batch: 1h39
If you have any question or suggestion, feel free to comment !
Last update
I added concatenation of all dataframes. !!! Disclaimer !!! Id column is no longer primary key. the primary key would be Id + source together