Context
To run statistical analysis on cricket players. I had to find a dataset which contains large amount of player data. And accurate. ESPN CRICINFO has been providing good information on cricket players for a long time. As such the data here are quite accurate. And the final filtering made it clean of any unnecessary or irreverent data.
Content
To acquire the data, since there is no WEB API for cricinfo I had to scrape there website. only player portion though. Various player has various kind of achievement. For current time ODI, Tests and T20s have more appeal than other 2 types there (ListA and First class). After scrapping 56000 website of 56000 players only 41000 returned data. But in those 41000 some were lacking either type of games, or some different values of column. So I chopped them down to same number and types of columns running scripts thoroughly. And checking random result with the website to make sure they are good. Time period for the data is around 1990 to 2017
Acknowledgements
This dataset is made and modified and filtered by me. I am a novice in data science and web scrapping. Hence much data may or may not be accurate even if they seemed to me.
Hope somehow it helps someone.