The data was downloaded from the extensive cricket data website cricsheet.org in JSON format. I used the pandas Python library to transform the match data into ball-by-ball data with several relevant fields. This allows for the data to be used to train regression models etc
This dataset was created as part of a project where I created metrics to rank players for T20 Internationals and the Indian Premier League (IPL). The entire project materials can be found at https://github.com/jamiewelsh25/Cricket_Data_Project/
Notebooks can be found below where I delve into predicting second innings chase success as well as first innings scores. Furthermore, I build a model to evaluate batters, bowlers and all-rounders using a Runs Added Over Average Player metric.