Upvoted Kaggle Datasets
2150 Kaggle datasets with at least two votes
@kaggle.canggih_voted_kaggle_dataset
2150 Kaggle datasets with at least two votes
@kaggle.canggih_voted_kaggle_dataset
Kaggle dataset becomes a popular growing place to share datasets. Almost every day there will be new datasets uploaded. I am curious to explore what can be extracted from the information of each dataset.
This dataset consists 2150 datasets information in 15 columns:
Title
Subtitle
Owner
Vote
Version History
Tags
Datatype
Size
License
Views
Downloads
Kernels
Topics
URL
Description
All data were taken from Kaggle website. Collected on 26 Feb 2018
With this dataset, we may try to predict the upcoming datasets uploaded, including its topics, number of votes, number of downloads, etc. Data visualization involving clustering may be performed also.
CREATE TABLE voted_kaggle_dataset (
"title" VARCHAR,
"subtitle" VARCHAR,
"owner" VARCHAR,
"votes" BIGINT,
"versions" VARCHAR,
"tags" VARCHAR,
"data_type" VARCHAR,
"size" VARCHAR,
"license" VARCHAR,
"views" VARCHAR,
"download" VARCHAR,
"kernels" VARCHAR,
"topics" VARCHAR,
"url" VARCHAR,
"description" VARCHAR
);Anyone who has the link will be able to view this.