Baselight

Data Catalog

Explore, analyze, and share quality data.

Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
No options selected
Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
1 option selected: Kaggle
Showing 8660 Datasets

Major League Baseball Game Logs

Historical MLB Game Logs and Player Statistics from 1871-2016

Sports
8 months ago
1
21.42 MB
0

Open Subtitles Multilingual Translation

Train Sequential Neural Networks in Nine Languages

Other
8 months ago
5
641.5 MB
0

Blended Skill Talk

Personality, Empathy, and Knowledge

Other
8 months ago
3
62.47 MB
0

LongAlpaca 16K-Length

Investigating Natural Language Processing Performance

Other
8 months ago
1
125.31 MB
0

Large-Scale Preference Dataset

Training Powerful Reward & Critic Models with Aligned Language Models

Other
8 months ago
1
361.39 MB
0
8 months ago
2
191.59 MB
0

Tamazight-NLP/Pontoon-Translations: Source-Target

Tamazight Translation Dataset: Source-Target Sentences for NLP

Other
8 months ago
1
3.47 MB
0

Yahoo Answers Topics Dataset

Yahoo Answers Topics Dataset: Questions and Answers for Various Topics

Other
8 months ago
2
525.17 MB
0

Friends TV Show Dialog Sequences

Friends TV Show Dialog Sequences

Media and Entertainment
8 months ago
1
638.07 kB
0

JFLEG: English Grammatical Error Benchmark

English Grammatical Error Correction Dataset

Other
8 months ago
2
290.36 kB
0

ARC: Grade School Science Questions

A Challenge for Advanced Question-Answering Research

Academic Research
8 months ago
6
1.25 MB
0

Nepali Health Q&A Corpus

Investigating Cultural Influences

Healthcare
8 months ago
1
7.69 MB
0

SQL Create Context

Uncovering Implications and Insights

Other
8 months ago
1
6.39 MB
0

OpenAI Summarization Corpus

Training and Validation Data from TL;DR, CNN, and Daily Mail

Other
8 months ago
4
68.93 MB
0

Anthropic Helpfulness-Harmlessness Preference

Iterative Human-in-the-Loop Solutions

Other
8 months ago
2
181.66 MB
0

Android Games

Games released for the android os

Other
8 months ago
1
16.74 kB
0

Alpaca Cleaned

Improving Pretrained Language Model Understanding

Other
8 months ago
1
23.83 MB
0

Humans Interaction Choice Rejection

Investigating Responses Through Selection and Rejection

Other
8 months ago
2
134.38 MB
0

UK Social Contact Network

Age, Gender, and Household Characteristics

Demographics and Population Studies
8 months ago
7
66.46 kB
0

European Alps Snow Depth Observations

Spatial and Long-term Trends 1971-2019

Other
8 months ago
20
66.96 MB
0

South Park Scripts Dataset

All the Words, All the Time

Other
8 months ago
20
6.39 MB
0

Yelp Reviews Sentiment Dataset

A Challenge for Natural Language Processing

Ecommerce and Consumer Trends
8 months ago
2
270.69 MB
0

TweetEval (Multi-task Classification Benchmark)

Irony, Hate, Offensive, Stance, Emoji, Emotion, and Sentiment

Ecommerce and Consumer Trends
8 months ago
33
13.83 MB
0

NLI-TR (Turkish NLI Research)

For training Turkish language models

Academic Research
8 months ago
6
76.03 MB
0

Share link

Anyone who has the link will be able to view this.