Baselight

Data Catalog

Explore, analyze, and share quality data.

Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
No options selected
Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
No options selected
Showing 58957 Datasets

Movie Rationales (Rationales For Movie Reviews)

Human annotated rationales for movie reviews

Media and Entertainment
10 months ago
3
5.53 MB
0

International Apple Pricing Strategy

Understanding Apple Product Prices in Relation to Local Salaries Worldwide

Finance and Economics
10 months ago
3
41.14 kB
0

Finnish Basic Education Teacher ICT Skills And

Age, Gender, Self-Efficacy, In-Service Training, and Urbanization Level

Demographics and Population Studies
10 months ago
1
28.7 kB
0

Job Trends In Australia

Insight into Job Types, Salaries and Growth Across States

Finance and Economics
10 months ago
1
34.57 MB
0

Age Prediction Using Genomic Information

Genomic data for age prediction

Demographics and Population Studies
10 months ago
1
17.8 kB
0

Social Media Engagement Levels For YouTube Tweets

Analyzing Likes, Retweets, and Conversations

Ecommerce and Consumer Trends
10 months ago
1
6.45 MB
0

CIFAR-10: Color Images, 10 Classes

CIFAR-10: Color Images, 10 Classes

Other
10 months ago
2
272.69 MB
0

DistillChat V1: Mixture Of Conversations

Conversational Dataset with Diverse Sources

Other
10 months ago
1
224.37 MB
0

Fictional Worlds

Immersive insights into diverse fictional realms

Other
10 months ago
1
16.68 MB
0

Multilingual NER Dataset

Multilingual NER Dataset for Named Entity Recognition

Other
10 months ago
27
113.07 MB
0

WIDER FACE: Face Detection Benchmark

Face Detection Dataset with Image IDs and Number of Faces Detected

Other
10 months ago
3
4.21 MB
0

MedMCQA: Medical MCQ Dataset

Deep Learning & AI for Improving Healthcare

Healthcare
10 months ago
3
82.59 MB
0

Sciphi Textbooks Are All You Need

650,000 Unique Samples from K-12 to Grad School

Demographics and Population Studies
10 months ago
1
1.26 GB
0

AI Research Instructions And Outputs

Driving Innovation in Machine Learning and AI Exploration

Academic Research
10 months ago
1
53.72 MB
0

Airoboros LLMs Math Dataset

Mastering Complex Mathematical Operations in Machine Learning

Technology and IT
10 months ago
1
57.55 MB
0

Laion-Pop Image Classification Dataset

Accurately Predicting and Classifying Images with Alt Texts and NSFW Predictions

Technology and IT
10 months ago
1
298.51 MB
0

Open Assistant

Over 10,000 Annotated Trees in 35 Languages

Other
10 months ago
2
48.77 MB
0

Sigfox And LoRaWAN Localization Tool

Evaluating Fingerprinting Localization Algorithms in Large Outdoor Areas

Other
10 months ago
4
6.13 MB
0

Soil Texture Classes (USDA) By Depth, 250m

A Refined Global Mapping for 1950-2017

Other
10 months ago
1
3.54 kB
0

Hippocampal Gene Expression For Long-Term Memory

Understanding Transcription and Synaptic Regulation

Healthcare
10 months ago
9
145.24 kB
0

Global Health Outcomes Data

Impact on Mortality Rates and Malnutrition in Countries Around the World

Healthcare
10 months ago
1
28.41 kB
0

Mental Illness Disparities In Vets

Comparative Rates of Diagnoses Among Vulnerable Veteran Groups

Other
10 months ago
1
32.77 kB
0

Impact Of Living Standards On Dry Forest

Tribal and Marginalized Households in Central Indian Highlands

Environmental and Climate Sciences
10 months ago
1
90.55 kB
0

Global Hotspots Of Sharks And Longline Fishing

Machine-Learning-Assisted Spatial Distribution of At-Risk Species

Other
10 months ago
12
17.61 MB
0

Hunt Prices For North American Mammals

Investigating Costly Signaling Theory

Finance and Economics
10 months ago
1
15.5 kB
0

Popular Products From NewChic.com E-Commerce

Product, Brand, and User Interaction Analytics

Finance and Economics
10 months ago
9
20.63 MB
0

Women's Football (European Leagues)

Team and Player Performance Statistics

Sports
10 months ago
7
728.82 kB
0

Most Popular GitHub Projects

Popularity Factors and Growth Patterns

Technology and IT
10 months ago
1
491.4 kB
0

California Residents' ZEV Attitudes

Drivers' Preferences, Experiences, and Environmental Concerns

Environmental and Climate Sciences
10 months ago
9
3.62 MB
0

NYC Subway Entrance And Exit

Entrance & Exit locations of the NYC subway

Transportation and Logistics
10 months ago
1
122.18 kB
0

Pokemon Images And Text Descriptions

Pokemon Llava: Images and Text Descriptions

Media and Entertainment
10 months ago
1
692.48 MB
0

Germeval18 - Text Classification Dataset

Text Classification Dataset with Binary and Multi-class Labels

Technology and IT
10 months ago
2
851.69 kB
0

Allegro Articles Summarization Dataset

Allegro Articles Summarization Source-Target Dataset

Other
10 months ago
3
199.86 MB
0

Rag Instruct Benchmark Tester

200 Samples for Enterprise Core Q&A Tasks

Other
10 months ago
1
46.2 kB
0

PubMed Article Summarization Dataset

PubMed Summarization Dataset

Academic Research
10 months ago
3
1.16 GB
0

Alpaca GPT-4

High-Performance NLP for Instruction-Following Reasoning

Other
10 months ago
1
47.83 MB
0

High-Quality Multilingual Translation Data

13 Languages for Machine Learning

Technology and IT
10 months ago
62
208.17 MB
0

Databricks Dolly (15K)

Over 15,000 Language Models and Dialogues for Interactive Chat Applications

Other
10 months ago
1
7.68 MB
0

MetXBioDB Metabolite Biotransformations

Enzyme-Catalyzed Metabolism Insights

Other
10 months ago
1
446.83 kB
0

Occupational Skills And Tasks

Understanding the Role of Skills in Online Job Ads

Demographics and Population Studies
10 months ago
1
26.3 kB
0

Electronic Card Transactions From 2017-2020

Exploring Retail Spending Trends

Finance and Economics
10 months ago
72
6.62 MB
0

Relato Business Graph Database

Visualizing Company Relationships & Market Trends

Finance and Economics
10 months ago
2
10.95 MB
0

Tomato Gene Expression Data

Non-Organic Imprints

Healthcare
10 months ago
1
37.5 MB
0

HTTP Header Fields Dataset

How information is encoded and sent/received on the internet

Technology and IT
10 months ago
5
47.15 kB
0

Wikipedia Molecules Properties Dataset

Molecular Properties Dataset from Wikipedia

Other
10 months ago
1
1.83 MB
0

LAMBADA Word Prediction

Evaluating text understanding through word prediction

Other
10 months ago
3
552.45 MB
0

Question-Answering Training And Testing Data

A dataset for training and testing question-answering models

Other
10 months ago
2
83.38 MB
0

LLM Feedback Collection

Induce fine-grained evaluation capabilities into language models

Technology and IT
10 months ago
1
459.52 MB
0

UltraChat 200K

200K Dialogues of Diverse Topics for NLG Research

Academic Research
10 months ago
4
1.63 GB
0

Orca DPO Dialogue Pairs

Orca style for preference training (Intel's DPO dataset)

Other
10 months ago
1
18.88 MB
0

Share link

Anyone who has the link will be able to view this.