Baselight
Sign In

Datasets

Total public datasets added

8,800

Rows

Total rows contributed

5,589,816,505

Popularity

Total times datasets used in queries

307

Stars

Total stars received

37

Acquiring Pragmalinguistic Competences Through

Investigating Language Acquisition Through Computer-Mediated Communication

Technology and IT
1 year ago
2
189.79 kB
0

Movies Recommendations System Data

A Corpus of Wikipedia Information

Media and Entertainment
1 year ago
3
514.29 MB
0

Global River Obstruction

30549 manually identified human-made structures that obstructing rivers

Other
1 year ago
2
2.52 MB
0

GoldGloveTV Tweets

Measuring Engagement and Popularity

Ecommerce and Consumer Trends
1 year ago
1
10.68 MB
0

Audience Response To Influencer Questions

Analyzing Audience Interactions with Influencer Queries

Other
1 year ago
1
62.88 kB
0

Vu Trong Phung's Audio Novels

Audio files of novels and stories by Vu Trong Phung for Vietnamese TTS training

Other
1 year ago
1
2.92 kB
0

Compositional Freebase Questions

Compositional Freebase Questions dataset for measuring generalization

Other
1 year ago
16
69.53 MB
0

Portuguese Instruction

Enhancing Non-English Language Models with Portuguese Instruction

Other
1 year ago
2
19.93 MB
0

English/ MoroccanTamazight & Taqbaylit Translation

Translation dataset from mozilla's pontoon localization platform

Other
1 year ago
1
712.13 kB
0

California Room Books Collection

Collection of books housed in the California Room of the library

Media and Entertainment
1 year ago
1
276.5 kB
0

Extended Stanford Natural Language Inference

Annotated explanations for entailment relations in SNLI dataset

Other
1 year ago
3
40.71 MB
0

Explanation Dataset For Question Answering Systems

Reddit Q&A Dataset for Question Answering Systems

Ecommerce and Consumer Trends
1 year ago
9
833.46 MB
0

DROP: Benchmarking Comprehension And Reasoning

DROP Dataset: Evaluating Reading Comprehension and Reasoning Skills

Other
1 year ago
2
15.27 MB
0

CoEdIT Text Editing

A curated dataset for training text editing models

Technology and IT
1 year ago
2
10.44 MB
0

Comparisons Of WebGPT And OpenAI Models

A comparison between WebGPT and OpenAI models with metrics and answers provided

Other
1 year ago
1
156.18 MB
0

Vezora/Tested-188k-Python-Alpaca: Functional

188k Functional Python Code Samples

Other
1 year ago
1
20.83 MB
0

Vu Trong Phung Novels Audio Dataset

Vietnamese TTS Dataset: Novels by Vu Trong Phung

Other
1 year ago
1
2.92 kB
0

SNES Games

The games released for the SNES console with technical info

Other
1 year ago
5
93.05 kB
0

Comprehensive Conifer Sampling In North America

Evaluating Geographic and Niche-Based Sampling Strategies

Other
1 year ago
2
6.49 MB
0

Filmaffinity Reviews

An Insight into Movie Popularity

Media and Entertainment
1 year ago
1
1.12 MB
0

Housing Prices By Location

Why Location DOES Matter in Housing Prices

Finance and Economics
1 year ago
1
34 kB
0

XQuAD (Cross-lingual Q&A)

Cross-lingual Question & Answering

Other
1 year ago
12
3.15 MB
0

OpenAI HumanEval (Coding Challenges & Unit-tests)

164 programming problems with a function signature, docstring, body, unittests

Healthcare
1 year ago
1
85.24 kB
0

MathQA (Math Problems)

Learning to solve math problems

Other
1 year ago
3
10.53 MB
0

CommonGen (Generative Commonsense Reasoning)

Constrained text generation task, associated with a benchmark dataset

Other
1 year ago
3
3.35 MB
0

HellaSwag (Commonsense NLI)

Can a Machine Really Finish Your Sentence?

Other
1 year ago
3
35.9 MB
0

James Bond Movies

A Dataset of all the James Bond Movies

Media and Entertainment
1 year ago
6
34.6 kB
0

Temperature Over Time By State (Starts: 1895)

State and County Temperature Changes

Environmental and Climate Sciences
1 year ago
5
3.54 MB
0

Educational Youth Indicators

School Enrollment, Attendance, Achievement, and Engagement

Demographics and Population Studies
1 year ago
1
99.8 kB
0

AslgPc12 (English-ASL Gloss Parallel Corpus 2012)

Synthetic English-ASL Gloss Parallel Corpus 2012

Other
1 year ago
1
7.18 MB
0

LinCE (Linguistic Code-switching Evaluation)

Data for training and evaluating NLP systems on code-switching tasks

Other
1 year ago
30
18.42 MB
0

CoQA (Conversational Question Answering)

127k Questions With Answers, 8k Conversations About Text From Seven Domains.

Other
1 year ago
2
12.58 MB
0

Psychiatric Comorbidity In Galicia

Examining the Prevalence of Dual Diagnosis in Addiction Assistance Units

Healthcare
1 year ago
1
79.89 kB
0

Hemibrain Neuronal Connectome

Olfactory and Thermo/Hygrosensory Processing

Other
1 year ago
9
1.46 MB
0

Young Swiss Men And Substance Use Disorders

Examining Associations with Mental Health and Co-Occurring Addictions

Healthcare
1 year ago
2
838.43 kB
0

Weather Prediction In Argentina

Daily Summaries of Base Stations for the Last 5 Years

Environmental and Climate Sciences
1 year ago
1
263.04 kB
0

Weather In Egypt (Daily Resolution)

Monitoring Weather and Climate Conditions

Environmental and Climate Sciences
1 year ago
1
16.7 kB
0

Prices & Characteristics Of Spanish Homes

Uncovering Market Trends in Spain

Finance and Economics
1 year ago
1
84.29 MB
0

Euro Zone Energy Prices

Household and Industrial Sectors from 2017-2021

Finance and Economics
1 year ago
1
23.11 kB
0

Greek Household Energy Consumption

Socio-Economic, Demographic, and Housing Characteristics, 2004-2020

Finance and Economics
1 year ago
1
6.03 MB
0

Crypto Trading And Technical Indicators

Understanding the Market Dynamics of 600 Popular Cryptocurrencies

Finance and Economics
1 year ago
1
179.31 kB
0

Job Opportunities In Ecuador

Analyzing Employment Characteristics and Trends (2013-2020)

Demographics and Population Studies
1 year ago
1
3.43 MB
0

AI-Based Job Site Matching

Leveraging 400k+ Hours of Resource & Performance Data

Technology and IT
1 year ago
3
2.35 MB
0

Mr-beast Transcribed YouTube Videos

All of Mr-beast's videos: Transcribed using OpenAI's whisper

Media and Entertainment
1 year ago
1
2.07 MB
0

Reddit: /r/DIY

Analyzing User-Generated Content and Interactions

Ecommerce and Consumer Trends
1 year ago
1
662.77 kB
0

Uncovering Popularity And Sentiment Around Books

Exploring User's Preferences and Behaviors

Ecommerce and Consumer Trends
1 year ago
1
868.01 kB
0

Reddit: /r/movies (Submissions & Comments)

Analyzing User Interaction and User Feedback

Ecommerce and Consumer Trends
1 year ago
1
620.04 kB
0

GoT Characters Screen Time

How Long did Characters Spend on Screen?

Other
1 year ago
1
16.35 kB
0

Energy Consumption Of United States Over Time

Building Energy Data Book

Environmental and Climate Sciences
1 year ago
1
420.22 kB
0

Ecological Footprint And National Biocapacity

Exploring Global Carbon, Production and Consumption Data

Finance and Economics
1 year ago
2
3.42 MB
0

Share link

Anyone who has the link will be able to view this.