Baselight

Data Catalog

Explore, analyze, and share quality data.

Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
No options selected
Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
No options selected
Showing 58957 Datasets

Reddit: /r/stocks

Analyzing User Engagement to Identify Market Trends

Finance and Economics
10 months ago
1
936.05 kB
0

Reddit: /r/Tinder

Examining User Behaviors and Attitudes

Ecommerce and Consumer Trends
10 months ago
1
251.63 kB
0

Reddit: /r/CryptoCurrency

Posts, Scores, Comment Counts and Creation Timestamps

Crypto and Blockchain
10 months ago
1
502.18 kB
0

Reddit: /r/NotTheOnion

Discriminating Truth and Satire

Ecommerce and Consumer Trends
10 months ago
1
259.59 kB
0

Fishing Trajectories (AIS)

A Pre-Labelled Dataset for Semantic Segmentation

Other
10 months ago
1
86.05 MB
0

Barcelona Airbnb Listings

Understanding Rental Prices and Trends in the City

Finance and Economics
10 months ago
1
30.29 MB
0

San Diego Airbnb Listings

Location, Amenities and Reviews

Ecommerce and Consumer Trends
10 months ago
1
22.14 MB
0

AAPL On Reddit

Discussions about AAPL on reddit

Ecommerce and Consumer Trends
10 months ago
2
69.9 MB
0

Physical Gene Regulatory Networks In C.elegans

239,001 Regulatory Interactions from 289 Wild-type Young Adult Datasets

Healthcare
10 months ago
1
10.03 kB
0

East African News Classification

Classifying Text Content Across East Africa

Media and Entertainment
10 months ago
3
68.02 MB
0

SSSniperWolf's Tweets

An Insight Into Popular Influencer Engagement Patterns

Ecommerce and Consumer Trends
10 months ago
1
4.47 MB
0

Oscar-Winning Movies Gross Revenue

Exploring the Relationship Between Financial Success and Critical Acclaim

Finance and Economics
10 months ago
1
4.47 kB
0

Acquiring Pragmalinguistic Competences Through

Investigating Language Acquisition Through Computer-Mediated Communication

Technology and IT
10 months ago
2
189.79 kB
0

Movies Recommendations System Data

A Corpus of Wikipedia Information

Media and Entertainment
10 months ago
3
514.29 MB
0

Global River Obstruction

30549 manually identified human-made structures that obstructing rivers

Other
10 months ago
2
2.52 MB
0

GoldGloveTV Tweets

Measuring Engagement and Popularity

Ecommerce and Consumer Trends
10 months ago
1
10.68 MB
0

Audience Response To Influencer Questions

Analyzing Audience Interactions with Influencer Queries

Other
10 months ago
1
62.88 kB
0

Vu Trong Phung's Audio Novels

Audio files of novels and stories by Vu Trong Phung for Vietnamese TTS training

Other
10 months ago
1
2.92 kB
0

Compositional Freebase Questions

Compositional Freebase Questions dataset for measuring generalization

Other
10 months ago
16
69.53 MB
0

Portuguese Instruction

Enhancing Non-English Language Models with Portuguese Instruction

Other
10 months ago
2
19.93 MB
0

English/ MoroccanTamazight & Taqbaylit Translation

Translation dataset from mozilla's pontoon localization platform

Other
10 months ago
1
712.13 kB
0

California Room Books Collection

Collection of books housed in the California Room of the library

Media and Entertainment
10 months ago
1
276.5 kB
0

Extended Stanford Natural Language Inference

Annotated explanations for entailment relations in SNLI dataset

Other
10 months ago
3
40.71 MB
0

Explanation Dataset For Question Answering Systems

Reddit Q&A Dataset for Question Answering Systems

Ecommerce and Consumer Trends
10 months ago
9
833.46 MB
0

DROP: Benchmarking Comprehension And Reasoning

DROP Dataset: Evaluating Reading Comprehension and Reasoning Skills

Other
10 months ago
2
15.27 MB
0

CoEdIT Text Editing

A curated dataset for training text editing models

Technology and IT
10 months ago
2
10.44 MB
0

Comparisons Of WebGPT And OpenAI Models

A comparison between WebGPT and OpenAI models with metrics and answers provided

Other
10 months ago
1
156.18 MB
0

Vezora/Tested-188k-Python-Alpaca: Functional

188k Functional Python Code Samples

Other
10 months ago
1
20.83 MB
0

Vu Trong Phung Novels Audio Dataset

Vietnamese TTS Dataset: Novels by Vu Trong Phung

Other
10 months ago
1
2.92 kB
0

SNES Games

The games released for the SNES console with technical info

Other
10 months ago
5
93.05 kB
0

Comprehensive Conifer Sampling In North America

Evaluating Geographic and Niche-Based Sampling Strategies

Other
10 months ago
2
6.49 MB
0

Filmaffinity Reviews

An Insight into Movie Popularity

Media and Entertainment
10 months ago
1
1.12 MB
0

Housing Prices By Location

Why Location DOES Matter in Housing Prices

Finance and Economics
10 months ago
1
34 kB
0

XQuAD (Cross-lingual Q&A)

Cross-lingual Question & Answering

Other
10 months ago
12
3.15 MB
0

OpenAI HumanEval (Coding Challenges & Unit-tests)

164 programming problems with a function signature, docstring, body, unittests

Healthcare
10 months ago
1
85.24 kB
0

MathQA (Math Problems)

Learning to solve math problems

Other
10 months ago
3
10.53 MB
0

CommonGen (Generative Commonsense Reasoning)

Constrained text generation task, associated with a benchmark dataset

Other
10 months ago
3
3.35 MB
0

HellaSwag (Commonsense NLI)

Can a Machine Really Finish Your Sentence?

Other
10 months ago
3
35.9 MB
0

James Bond Movies

A Dataset of all the James Bond Movies

Media and Entertainment
10 months ago
6
34.6 kB
0

Temperature Over Time By State (Starts: 1895)

State and County Temperature Changes

Environmental and Climate Sciences
10 months ago
5
3.54 MB
0

Educational Youth Indicators

School Enrollment, Attendance, Achievement, and Engagement

Demographics and Population Studies
10 months ago
1
99.8 kB
0

AslgPc12 (English-ASL Gloss Parallel Corpus 2012)

Synthetic English-ASL Gloss Parallel Corpus 2012

Other
10 months ago
1
7.18 MB
0

LinCE (Linguistic Code-switching Evaluation)

Data for training and evaluating NLP systems on code-switching tasks

Other
10 months ago
30
18.42 MB
0

CoQA (Conversational Question Answering)

127k Questions With Answers, 8k Conversations About Text From Seven Domains.

Other
10 months ago
2
12.58 MB
0

Psychiatric Comorbidity In Galicia

Examining the Prevalence of Dual Diagnosis in Addiction Assistance Units

Healthcare
10 months ago
1
79.89 kB
0

Hemibrain Neuronal Connectome

Olfactory and Thermo/Hygrosensory Processing

Other
10 months ago
9
1.46 MB
0

Young Swiss Men And Substance Use Disorders

Examining Associations with Mental Health and Co-Occurring Addictions

Healthcare
10 months ago
2
838.43 kB
0

Weather Prediction In Argentina

Daily Summaries of Base Stations for the Last 5 Years

Environmental and Climate Sciences
10 months ago
1
263.04 kB
0

Weather In Egypt (Daily Resolution)

Monitoring Weather and Climate Conditions

Environmental and Climate Sciences
10 months ago
1
16.7 kB
0

Prices & Characteristics Of Spanish Homes

Uncovering Market Trends in Spain

Finance and Economics
10 months ago
1
84.29 MB
0

Share link

Anyone who has the link will be able to view this.