Baselight
Sign In
kaggle

Kaggle

Data Source

@kaggle

Kaggle hosts community and competition datasets across machine learning, research, public analytics, benchmarks, notebooks, metadata, and structured data projects.

Datasets

Total public datasets added

8,820

Rows

Total rows contributed

5,590,595,304

Popularity

Total times datasets used in queries

316

Stars

Total stars received

38

File Validation And Training Statistics

Validation, Training, and Testing Statistics for tasksource/leandojo Files

Other
1 year ago
3
27.84 MB
0

AI2 ARC - Advanced Science Question

Promoting research in advanced question-answering

Academic Research
1 year ago
6
1.25 MB
0

AI-Shift Ameba FAQ Search

Queries and difficulty levels for AI-based FAQ search

Technology and IT
1 year ago
3
111.43 kB
0

TruthfulQA: Benchmark For Evaluating Language

Evaluating truthfulness in language models' answers

Other
1 year ago
2
498.05 kB
0

Middletownbooks Joke Training

Jokes for training joke generation

Other
1 year ago
1
2.18 MB
0

Symbolic Correlation Dataset For LLMs

Exploring the Relationship between Knowledge and Language

Other
1 year ago
1
130.15 kB
0

Academic Research Essay Instructions

A Complete Guide for Successful Research Projects

Academic Research
1 year ago
1
12.33 MB
0

Legume-Rhizobium Mutualism Evolution

Multi and Single Strain Responses

Other
1 year ago
2
35.06 kB
0

US Renewable Energy Programs

Generation, States, Programs, Contacts, and More

Environmental and Climate Sciences
1 year ago
37
10.65 MB
0

Free WiFi To Monitor Flow In Hanoian Markets

Demographics, Timing and Frequency of Visitor Behaviour

Demographics and Population Studies
1 year ago
3
12.78 MB
0

Dietary Restriction Transgenerational Fitness

Three-Generational Study in *Caenorhabditis elegans*

Healthcare
1 year ago
5
173.73 kB
0

East Bay Housing Prices: Room Shares Vs Apartments

A Tale of Two Cities

Finance and Economics
1 year ago
1
16.58 kB
0

WikiSQL (Questions And SQL Queries)

80654 hand-annotated questions and SQL queries on 24241 Wikipedia tables

Other
1 year ago
3
38.25 MB
0

Avengers Character Appearances, Deaths

Assessing Gender, Status, and Resurrections

Demographics and Population Studies
1 year ago
1
29.08 kB
0

US Jobs On Dice.com

22,000 technology job listings

Technology and IT
1 year ago
1
30.81 MB
0

Real Estate Sales 730 Days

City of Hartford real estate sales for the past 2 years

Finance and Economics
1 year ago
1
299 kB
0

ASLG-PC12 (English-ASL Gloss Parallel Corpus 2012)

Interactions between Corpus and Lexicon LREC

Other
1 year ago
1
7.18 MB
0

SciFact (Scientific Claims)

1.4K Expert-Written Claims with Structured Annotations

Academic Research
1 year ago
4
4.69 MB
0

Salaries And Job Postings By Company In Australia

Uncovering Industry Trends and Analyzing Companies’ Salary Structures

Finance and Economics
1 year ago
1
34.75 MB
0

Italian Negation Constructions - Tweets

Exploring Language Variation Across 10 Cities

Ecommerce and Consumer Trends
1 year ago
1
21.13 kB
0

Remote Jobs In Spain

Analyzing Roles, Technologies, and Salaries in October 2020

Finance and Economics
1 year ago
1
1.31 MB
0

Reddit: /r/worldnews (Submissions & Comments)

Analyzing Post Engagement

Ecommerce and Consumer Trends
1 year ago
1
268.97 kB
0

Industrial Energy End Use In The U.S

Facility-Level Combustion Energy Data

Environmental and Climate Sciences
1 year ago
2
1.16 MB
0

Landmark Detection For Tsetse Fly

Accurate Morphometric Data

Other
1 year ago
1
5.25 MB
0

NewChic Product Catalog (Customer Segmentation)

Trends, User Tastes, Brand Segmentation, and Online Shopping Opportunities

Finance and Economics
1 year ago
9
20.63 MB
0

US Tennis Courts: Capacity, Amenities, And

Discovering Court Types, Amenities, and Locations Across the US

Sports
1 year ago
1
1.28 MB
0

Airbnb Listings And Reviews In Washington, DC

Exploring Room Availability, Host Profiles, and Pricing Data

Finance and Economics
1 year ago
2
739.8 kB
0

London's Airbnb

Airbnb listings in London

Ecommerce and Consumer Trends
1 year ago
3
13.61 MB
0

Compounds For Studying Environmental Exposures

PubChemLite: Annotation Categories for Translational and Applied Research

Academic Research
1 year ago
1
73.82 MB
0

COCONUT: The COlleCtion Of Open NatUral ProducTs.

Unlocking Molecule Information

Finance and Economics
1 year ago
1
73.58 MB
0

Comprehensive Literary Greats Dataset

50,000+ Books Rated and Awarded Across Language, Genre, and Format

Media and Entertainment
1 year ago
1
42.19 MB
0

SONYC-UST Audio Tag Dataset

Annotated Real-World Urban Sounds for Multi-Label Audio Tag Prediction

Transportation and Logistics
1 year ago
1
822.85 kB
0

Game Boy Advance Games

Games Released for Game boy advanced

Other
1 year ago
3
91.05 kB
0

Game Boy Games

Games released for game boy

Other
1 year ago
3
62.08 kB
0

Google's M&A History

How Much, What For, and Where?

Other
1 year ago
12
138.39 kB
0

Submachine Guns

A dataset of known submachine gun models

Other
1 year ago
1
14.32 kB
0

Galaxy Clustering

Iris, Moon, and Circles datasets for Galaxy clustering tutorial

Other
1 year ago
3
20.43 kB
0

Lake Baikal Biomass (Decal Change)

Investigating Climate Change-Driven Regime Shifts

Environmental and Climate Sciences
1 year ago
2
72.61 kB
0

Insects Flight Dynamics

Drosophila melanogaster, Isoleucinella rotunda, and Calopteron reticulatum

Transportation and Logistics
1 year ago
50
1.25 GB
0

Intel Processors

A Comprehensive Guide

Other
1 year ago
24
184.22 kB
0

The World's Highest Mountains

A Dataset of Peaks with at Least 500m Prominence

Other
1 year ago
3
15.28 kB
0

QASPER: NLP Questions And Evidence

Discovering Answers with Expertise

Other
1 year ago
3
27.9 MB
0

MultiNLI Textual Entailment Corpus

Multi-Genre Natural Language Inference (MultiNLI)

Other
1 year ago
3
215.15 MB
0

Cmrc2018 - Chinese Machine Reading Comprehension

Chinese MRC Dataset with Language Diversities

Other
1 year ago
3
5.48 MB
0

English-Darija Bilingual Text (Moroccan Arabic)

English-Darija Bilingual Corpus for Machine Translation

Other
1 year ago
1
23.28 MB
0

Erotiquant-XL

Enhanced erotica dataset with longer context samples

Other
1 year ago
1
99.99 MB
0

Synthia-v1.3

Synthetic training data for LLM development

Technology and IT
1 year ago
1
128.27 MB
0

Kubernetes Commands

kubectl commands and descriptions for Kubernetes

Other
1 year ago
1
3.65 MB
0

Cricket Commentary Dataset

Performance Validation for Cricket Commentary Model

Sports
1 year ago
3
6.95 MB
0

Text Classification For QA Dataset

Text classification dataset for question answering

Technology and IT
1 year ago
3
13.32 MB
0

Share link

Anyone who has the link will be able to view this.