Baselight
Sign In
kaggle

Kaggle

Data Source

@kaggle

Kaggle hosts community and competition datasets across machine learning, research, public analytics, benchmarks, notebooks, metadata, and structured data projects.

Datasets

Total public datasets added

8,820

Rows

Total rows contributed

5,590,595,304

Popularity

Total times datasets used in queries

316

Stars

Total stars received

38

Yahoo Answers Topics Dataset

Yahoo Answers Topics Dataset: Questions and Answers for Various Topics

Other
1 year ago
2
525.17 MB
0

Friends TV Show Dialog Sequences

Friends TV Show Dialog Sequences

Media and Entertainment
1 year ago
1
638.07 kB
0

JFLEG: English Grammatical Error Benchmark

English Grammatical Error Correction Dataset

Other
1 year ago
2
290.36 kB
0

ARC: Grade School Science Questions

A Challenge for Advanced Question-Answering Research

Academic Research
1 year ago
6
1.25 MB
0

Nepali Health Q&A Corpus

Investigating Cultural Influences

Healthcare
1 year ago
1
7.69 MB
0

SQL Create Context

Uncovering Implications and Insights

Other
1 year ago
1
6.39 MB
0

OpenAI Summarization Corpus

Training and Validation Data from TL;DR, CNN, and Daily Mail

Other
1 year ago
4
68.93 MB
0

Anthropic Helpfulness-Harmlessness Preference

Iterative Human-in-the-Loop Solutions

Other
1 year ago
2
181.66 MB
0

Android Games

Games released for the android os

Other
1 year ago
1
16.74 kB
0

Alpaca Cleaned

Improving Pretrained Language Model Understanding

Other
1 year ago
1
23.83 MB
0

Humans Interaction Choice Rejection

Investigating Responses Through Selection and Rejection

Other
1 year ago
2
134.38 MB
0

UK Social Contact Network

Age, Gender, and Household Characteristics

Demographics and Population Studies
1 year ago
7
66.46 kB
0

European Alps Snow Depth Observations

Spatial and Long-term Trends 1971-2019

Other
1 year ago
20
66.96 MB
0

South Park Scripts Dataset

All the Words, All the Time

Other
1 year ago
20
6.39 MB
0

Yelp Reviews Sentiment Dataset

A Challenge for Natural Language Processing

Ecommerce and Consumer Trends
1 year ago
2
270.69 MB
0

TweetEval (Multi-task Classification Benchmark)

Irony, Hate, Offensive, Stance, Emoji, Emotion, and Sentiment

Ecommerce and Consumer Trends
1 year ago
33
13.83 MB
0

NLI-TR (Turkish NLI Research)

For training Turkish language models

Academic Research
1 year ago
6
76.03 MB
0

Antidepressant Use In Scandinavia

A Study of Population Characteristics and Drug Utilization Rates

Healthcare
1 year ago
3
91.29 kB
0

Predicting Pain Reliever Misuse/Abuse

An Exploration of Demographics, Medication Use and Illicit Drug Use

Healthcare
1 year ago
1
355.63 kB
0

Afeitadoras Hombres En Amazon

Precios, Puntuaciones y Opinión de los Clientes

Ecommerce and Consumer Trends
1 year ago
1
42.06 kB
0

Esports Performance Rankings And Results

Performance Rankings and Results from Multiple Esports Platforms

Sports
1 year ago
78
399.23 kB
0

Predicting The Energy Market's Day Ahead Prices

Analyses of Energy Systems and Prices

Finance and Economics
1 year ago
1
11.65 MB
0

Amazon Customer Reviews With Sentiment

Extracting Insights from Product Ratings

Finance and Economics
1 year ago
1
6.73 MB
0

Analyzing Success Factors For Influencers

Influencers: Customer-to-Customer E-Commerce

Ecommerce and Consumer Trends
1 year ago
2
125.7 kB
0

UK International Migration

Trends and Patterns within the UK

Demographics and Population Studies
1 year ago
7
28.56 MB
0

College Completion And Efficiency Measures For US

Rates, Awards, Expenditures, and Outcomes

Demographics and Population Studies
1 year ago
4
12.81 MB
0

The Premier League

Analyzing Trends and Outcomes

Sports
1 year ago
27
2 MB
0

Miss America Titleholders

Miss america over the years

Other
1 year ago
2
29.16 kB
0

Uber And Lyft Drivers Carjackings

Dataset from: "Uber And Lyft Drivers Are Being Carjacked at Alarming Rates"

Transportation and Logistics
1 year ago
1
45.05 kB
0

Vehicle Data Collection Industry

Data From: "Who Is Collecting Data from Your Car?"

Transportation and Logistics
1 year ago
1
21.1 kB
0

Games By Ubisoft

Games released by Ubisoft

Other
1 year ago
1
8.82 kB
0

Rocket Launch Sites

Sites used for rocket launches

Other
1 year ago
16
142.57 kB
0

Mammal Species & Taxonomic Changes

Taxonomic Changes & Type Specimen Metadata

Other
1 year ago
4
2.17 MB
0

Urban Ecology Over Time

Exploring Cameras Traps, Scans and Surveys

Technology and IT
1 year ago
6
105.16 kB
0

Investment Trends In Indian Startups

An Exploration of Indian Startup Funding Rounds

Finance and Economics
1 year ago
2
351.65 kB
0

Predicting The Financial Health Of Insurance Firms

Model and predict the financial health of any insurance firm

Finance and Economics
1 year ago
25
835.1 kB
0

CoEdIT

Enhancing AI Text Editing Through 69,000 Instances

Technology and IT
1 year ago
2
10.44 MB
0

Evol Codealpaca V1

An Innovative Augmentation Strategy for NLP

Other
1 year ago
1
135.42 MB
0

Web-Harvested Image And Caption Dataset

Web-Harvested Image and Caption Dataset

Other
1 year ago
2
361.36 MB
0

NER Tagged Text Dataset

NER Tagged Text Dataset

Other
1 year ago
3
174.25 MB
0

Assembly Shellcode Dataset

The Largest Collection of Linux Assembly Shellcodes

Politics and Governance
1 year ago
3
106.4 kB
0

Short Jokes Dataset

Humorous Short Jokes

Other
1 year ago
1
15.95 MB
0

United States Baby Names Count

United States Baby Names Dataset

Demographics and Population Studies
1 year ago
3
56.63 MB
0

B Corporations Impact Data

Social and environmental impact data of Certified B Corporations worldwide

Environmental and Climate Sciences
1 year ago
1
5.93 MB
0

IMDb Movie Review Sentiment

Movie Review Sentiment

Ecommerce and Consumer Trends
1 year ago
3
82.87 MB
0

WebGL Model-based QA

WebGL Model-based Questions and Answering

Other
1 year ago
3
70.35 MB
0

Korean Translation Dataset For NLP Models

Translated Instructions and Input-Output Pairs in Korean

Other
1 year ago
1
125.95 MB
0

Cricket Commentary Analysis

Text Classification and Natural Language Processing for Commentary Insights

Technology and IT
1 year ago
3
6.95 MB
0

BSARD: French Belgian Law Dataset For IR

Retrieving Relevant Statutes for Legal Questions

Other
1 year ago
3
4.01 MB
0

LongAlpaca 12K

LongAlpaca - Generating instruct datasets from language models (longform)

Other
1 year ago
1
265.85 MB
0

Share link

Anyone who has the link will be able to view this.