Kaggle Dataset Metadata Repository
Comprehensive Metadata for Kaggle Datasets Including Owner, Usage, and Licensing
@kaggle.ijajdatanerd_kaggle_dataset_metadata_repository
Comprehensive Metadata for Kaggle Datasets Including Owner, Usage, and Licensing
@kaggle.ijajdatanerd_kaggle_dataset_metadata_repository
This dataset provides comprehensive metadata on various Kaggle datasets, offering detailed information about the dataset owners, creators, usage statistics, licensing, and more. It can help researchers, data scientists, and Kaggle enthusiasts quickly analyze the key attributes of different datasets on Kaggle. π
datasetUrl
π: The URL of the Kaggle dataset page. This directs you to the specific dataset's page on Kaggle.
ownerAvatarUrl
πΌοΈ: The URL of the dataset owner's profile avatar on Kaggle.
ownerName
π€: The name of the dataset owner. This can be the individual or organization that created and maintains the dataset.
ownerUrl
π: A link to the Kaggle profile page of the dataset owner.
ownerUserId
πΌ: The unique user ID of the dataset owner on Kaggle.
ownerTier
ποΈ: The ownership tier, such as "Tier 1" or "Tier 2," indicating the owner's status or level on Kaggle.
creatorName
π©βπ»: The name of the dataset creator, which could be different from the owner.
creatorUrl
π: A link to the Kaggle profile page of the dataset creator.
creatorUserId
πΌ: The unique user ID of the dataset creator.
scriptCount
π: The number of scripts (kernels) associated with this dataset.
scriptsUrl
π: A link to the scripts (kernels) page for the dataset, where you can explore related code.
forumUrl
π¬: The URL to the discussion forum for this dataset, where users can ask questions and share insights.
viewCount
π: The number of views the dataset page has received on Kaggle.
downloadCount
β¬οΈ: The number of times the dataset has been downloaded by users.
dateCreated
π
: The date when the dataset was first created and uploaded to Kaggle.
dateUpdated
π: The date when the dataset was last updated or modified.
voteButton
π: The metadata for the dataset's vote button, showing how users interact with the dataset's quality ratings.
categories
π·οΈ: The categories or tags associated with the dataset, helping users filter datasets based on topics of interest (e.g., "Healthcare," "Finance").
licenseName
π‘οΈ: The name of the license under which the dataset is shared (e.g., "CC0," "MIT License").
licenseShortName
π: A short form or abbreviation of the dataset's license name (e.g., "CC0" for Creative Commons Zero).
datasetSize
π¦: The size of the dataset in terms of storage, typically measured in MB or GB.
commonFileTypes
π: A list of common file types included in the dataset (e.g., .csv
, .json
, .xlsx
).
downloadUrl
β¬οΈ: A direct link to download the dataset files.
newKernelNotebookUrl
π: A link to a new kernel or notebook related to this dataset, for those who wish to explore it programmatically.
newKernelScriptUrl
π»: A link to a new script for running computations or processing data related to the dataset.
usabilityRating
π: A rating or score representing how usable the dataset is, based on user feedback.
firestorePath
π: A reference to the path in Firestore where this datasetβs metadata is stored.
datasetSlug
π·οΈ: A URL-friendly version of the dataset name, typically used for URLs.
rank
π: The dataset's rank based on certain metrics (e.g., downloads, votes, views).
datasource
π: The source or origin of the dataset (e.g., government data, private organizations).
medalUrl
π
: A URL pointing to the dataset's medal or badge, indicating the dataset's quality or relevance.
hasHashLink
π: Indicates whether the dataset has a hash link for verifying data integrity.
ownerOrganizationId
π’: The unique organization ID of the dataset's owner if the owner is an organization rather than an individual.
totalVotes
π³οΈ: The total number of votes the dataset has received from users, reflecting its popularity or quality.
category_names
π: A comma-separated string of category names that represent the datasetβs classification.
This dataset is a valuable resource for those who want to analyze Kaggle's ecosystem, discover high-quality datasets, and explore metadata in a structured way. ππ
Anyone who has the link will be able to view this.