Baselight

Data Catalog

Explore, analyze, and share quality data.

Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
No options selected
Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
1 option selected: Kaggle
Showing 8660 Datasets

Vu Trong Phung's Audio Novels

Audio files of novels and stories by Vu Trong Phung for Vietnamese TTS training

Other
8 months ago
1
2.92 kB
0

Compositional Freebase Questions

Compositional Freebase Questions dataset for measuring generalization

Other
8 months ago
16
69.53 MB
0

Portuguese Instruction

Enhancing Non-English Language Models with Portuguese Instruction

Other
8 months ago
2
19.93 MB
0

English/ MoroccanTamazight & Taqbaylit Translation

Translation dataset from mozilla's pontoon localization platform

Other
8 months ago
1
712.13 kB
0

California Room Books Collection

Collection of books housed in the California Room of the library

Media and Entertainment
8 months ago
1
276.5 kB
0

Extended Stanford Natural Language Inference

Annotated explanations for entailment relations in SNLI dataset

Other
8 months ago
3
40.71 MB
0

Explanation Dataset For Question Answering Systems

Reddit Q&A Dataset for Question Answering Systems

Ecommerce and Consumer Trends
8 months ago
9
833.46 MB
0

DROP: Benchmarking Comprehension And Reasoning

DROP Dataset: Evaluating Reading Comprehension and Reasoning Skills

Other
8 months ago
2
15.27 MB
0

CoEdIT Text Editing

A curated dataset for training text editing models

Technology and IT
8 months ago
2
10.44 MB
0

Comparisons Of WebGPT And OpenAI Models

A comparison between WebGPT and OpenAI models with metrics and answers provided

Other
8 months ago
1
156.18 MB
0

Vezora/Tested-188k-Python-Alpaca: Functional

188k Functional Python Code Samples

Other
8 months ago
1
20.83 MB
0

Vu Trong Phung Novels Audio Dataset

Vietnamese TTS Dataset: Novels by Vu Trong Phung

Other
8 months ago
1
2.92 kB
0

SNES Games

The games released for the SNES console with technical info

Other
8 months ago
5
93.05 kB
0

Comprehensive Conifer Sampling In North America

Evaluating Geographic and Niche-Based Sampling Strategies

Other
8 months ago
2
6.49 MB
0

Filmaffinity Reviews

An Insight into Movie Popularity

Media and Entertainment
8 months ago
1
1.12 MB
0

Housing Prices By Location

Why Location DOES Matter in Housing Prices

Finance and Economics
8 months ago
1
34 kB
0

XQuAD (Cross-lingual Q&A)

Cross-lingual Question & Answering

Other
8 months ago
12
3.15 MB
0

OpenAI HumanEval (Coding Challenges & Unit-tests)

164 programming problems with a function signature, docstring, body, unittests

Healthcare
8 months ago
1
85.24 kB
0

MathQA (Math Problems)

Learning to solve math problems

Other
8 months ago
3
10.53 MB
0

CommonGen (Generative Commonsense Reasoning)

Constrained text generation task, associated with a benchmark dataset

Other
8 months ago
3
3.35 MB
0

HellaSwag (Commonsense NLI)

Can a Machine Really Finish Your Sentence?

Other
8 months ago
3
35.9 MB
0

James Bond Movies

A Dataset of all the James Bond Movies

Media and Entertainment
8 months ago
6
34.6 kB
0

Temperature Over Time By State (Starts: 1895)

State and County Temperature Changes

Environmental and Climate Sciences
8 months ago
5
3.54 MB
0

Educational Youth Indicators

School Enrollment, Attendance, Achievement, and Engagement

Demographics and Population Studies
8 months ago
1
99.8 kB
0

Share link

Anyone who has the link will be able to view this.