Context
Kaggle dataset becomes a popular growing place to share datasets. Almost every day there will be new datasets uploaded. I am curious to explore what can be extracted from the information of each dataset.
Content
This dataset consists 2150 datasets information in 15 columns:
-
Title
-
Subtitle
-
Owner
-
Vote
-
Version History
-
Tags
-
Datatype
-
Size
-
License
-
Views
-
Downloads
-
Kernels
-
Topics
-
URL
-
Description
Acknowledgements
All data were taken from Kaggle website. Collected on 26 Feb 2018
Inspiration
With this dataset, we may try to predict the upcoming datasets uploaded, including its topics, number of votes, number of downloads, etc. Data visualization involving clustering may be performed also.