Baselight

Cricket Data From 2000 To 2020 For South Africa

Cricket matches, players, innings, bat, bowl, venues, etc

@kaggle.bizzyvinci_south_africa_cricket_data_from_2000_to_2020

About this Dataset

Cricket Data From 2000 To 2020 For South Africa

I was looking for something challenging and I stumble upon a journal titled 'INCREASED PREDICTION ACCURACY IN THE GAME OF
CRICKET USING MACHINE LEARNING' and decided to implement the project. The first aspect is understanding the journal and cricket 😁.

The next aspect is getting the data. I wrote scripts to scrap espncricinfo. The scripts and journal can be found on github.

The data contains matches played by South Africa from 2000 to 2020 and can be used for prediction as well as exploratory analysis. It consists of 6 tables:

  • Mat: Details of matches played
  • Bat: Details of each appearance of a batter
  • Bowl: Details of each appearance of a bowler
  • Ground: Details about venues
  • Opposition: ID, name and rating of opposing teams
  • Players: Details of each player

Note: Nan values are represented as -99. Also, match is commonly regarded as mat.

Share link

Anyone who has the link will be able to view this.