Baselight

Data Catalog

Explore, analyze, and share quality data.

Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
No options selected
Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
No options selected
Showing 58957 Datasets

US Renewable Energy Programs

Generation, States, Programs, Contacts, and More

Environmental and Climate Sciences
10 months ago
37
10.65 MB
0

Free WiFi To Monitor Flow In Hanoian Markets

Demographics, Timing and Frequency of Visitor Behaviour

Demographics and Population Studies
10 months ago
3
12.78 MB
0

Dietary Restriction Transgenerational Fitness

Three-Generational Study in *Caenorhabditis elegans*

Healthcare
10 months ago
5
173.73 kB
0

East Bay Housing Prices: Room Shares Vs Apartments

A Tale of Two Cities

Finance and Economics
10 months ago
1
16.58 kB
0

WikiSQL (Questions And SQL Queries)

80654 hand-annotated questions and SQL queries on 24241 Wikipedia tables

Other
10 months ago
3
38.25 MB
0

Avengers Character Appearances, Deaths

Assessing Gender, Status, and Resurrections

Demographics and Population Studies
10 months ago
1
29.08 kB
0

US Jobs On Dice.com

22,000 technology job listings

Technology and IT
10 months ago
1
30.81 MB
0

Real Estate Sales 730 Days

City of Hartford real estate sales for the past 2 years

Finance and Economics
10 months ago
1
299 kB
0

ASLG-PC12 (English-ASL Gloss Parallel Corpus 2012)

Interactions between Corpus and Lexicon LREC

Other
10 months ago
1
7.18 MB
0

SciFact (Scientific Claims)

1.4K Expert-Written Claims with Structured Annotations

Academic Research
10 months ago
4
4.69 MB
0

Salaries And Job Postings By Company In Australia

Uncovering Industry Trends and Analyzing Companies’ Salary Structures

Finance and Economics
10 months ago
1
34.75 MB
0

Italian Negation Constructions - Tweets

Exploring Language Variation Across 10 Cities

Ecommerce and Consumer Trends
10 months ago
1
21.13 kB
0

Remote Jobs In Spain

Analyzing Roles, Technologies, and Salaries in October 2020

Finance and Economics
10 months ago
1
1.31 MB
0

Reddit: /r/worldnews (Submissions & Comments)

Analyzing Post Engagement

Ecommerce and Consumer Trends
10 months ago
1
268.97 kB
0

Industrial Energy End Use In The U.S

Facility-Level Combustion Energy Data

Environmental and Climate Sciences
10 months ago
2
1.16 MB
0

Landmark Detection For Tsetse Fly

Accurate Morphometric Data

Other
10 months ago
1
5.25 MB
0

NewChic Product Catalog (Customer Segmentation)

Trends, User Tastes, Brand Segmentation, and Online Shopping Opportunities

Finance and Economics
10 months ago
9
20.63 MB
0

US Tennis Courts: Capacity, Amenities, And

Discovering Court Types, Amenities, and Locations Across the US

Sports
10 months ago
1
1.28 MB
0

Airbnb Listings And Reviews In Washington, DC

Exploring Room Availability, Host Profiles, and Pricing Data

Finance and Economics
10 months ago
2
739.8 kB
0

London's Airbnb

Airbnb listings in London

Ecommerce and Consumer Trends
10 months ago
3
13.61 MB
0

Compounds For Studying Environmental Exposures

PubChemLite: Annotation Categories for Translational and Applied Research

Academic Research
10 months ago
1
73.82 MB
0

COCONUT: The COlleCtion Of Open NatUral ProducTs.

Unlocking Molecule Information

Finance and Economics
10 months ago
1
73.58 MB
0

Comprehensive Literary Greats Dataset

50,000+ Books Rated and Awarded Across Language, Genre, and Format

Media and Entertainment
10 months ago
1
42.19 MB
0

SONYC-UST Audio Tag Dataset

Annotated Real-World Urban Sounds for Multi-Label Audio Tag Prediction

Transportation and Logistics
10 months ago
1
822.85 kB
0

Game Boy Advance Games

Games Released for Game boy advanced

Other
10 months ago
3
91.05 kB
0

Game Boy Games

Games released for game boy

Other
10 months ago
3
62.08 kB
0

Google's M&A History

How Much, What For, and Where?

Other
10 months ago
12
138.39 kB
0

Submachine Guns

A dataset of known submachine gun models

Other
10 months ago
1
14.32 kB
0

Galaxy Clustering

Iris, Moon, and Circles datasets for Galaxy clustering tutorial

Other
10 months ago
3
20.43 kB
0

Lake Baikal Biomass (Decal Change)

Investigating Climate Change-Driven Regime Shifts

Environmental and Climate Sciences
10 months ago
2
72.61 kB
0

Insects Flight Dynamics

Drosophila melanogaster, Isoleucinella rotunda, and Calopteron reticulatum

Transportation and Logistics
10 months ago
50
1.25 GB
0

Intel Processors

A Comprehensive Guide

Other
10 months ago
24
184.22 kB
0

The World's Highest Mountains

A Dataset of Peaks with at Least 500m Prominence

Other
10 months ago
3
15.28 kB
0

QASPER: NLP Questions And Evidence

Discovering Answers with Expertise

Other
10 months ago
3
27.9 MB
0

MultiNLI Textual Entailment Corpus

Multi-Genre Natural Language Inference (MultiNLI)

Other
10 months ago
3
215.15 MB
0

Cmrc2018 - Chinese Machine Reading Comprehension

Chinese MRC Dataset with Language Diversities

Other
10 months ago
3
5.48 MB
0

English-Darija Bilingual Text (Moroccan Arabic)

English-Darija Bilingual Corpus for Machine Translation

Other
10 months ago
1
23.28 MB
0

Erotiquant-XL

Enhanced erotica dataset with longer context samples

Other
10 months ago
1
99.99 MB
0

Synthia-v1.3

Synthetic training data for LLM development

Technology and IT
10 months ago
1
128.27 MB
0

Kubernetes Commands

kubectl commands and descriptions for Kubernetes

Other
10 months ago
1
3.65 MB
0

Cricket Commentary Dataset

Performance Validation for Cricket Commentary Model

Sports
10 months ago
3
6.95 MB
0

Text Classification For QA Dataset

Text classification dataset for question answering

Technology and IT
10 months ago
3
13.32 MB
0

Accurate Medical Translation Data

Accurate Medical Translation Dataset

Healthcare
10 months ago
1
2.45 MB
0

Textual Entailment Dataset

Textual Entailment Dataset with Labelled Text Pairs

Other
10 months ago
3
51.84 MB
0

WinoBias Coreference Dataset

Gender-biased coreference dataset focused on occupation stereotypes in WinoBias

Demographics and Population Studies
10 months ago
8
271.58 kB
0

WikiANN

Multilingual named entity recognition for LLM training

Technology and IT
10 months ago
528
137.22 MB
0

MLQA - Multilingual Question-Answering

Multilingual Question-Answering Dataset

Other
10 months ago
116
259.57 MB
0

HAREM Portuguese NER Corpus

Portuguese NER Corpus with 10 Classes

Other
10 months ago
3
442.56 kB
0

DBpedia Ontology

Text Classification Dataset with 14 Classes

Technology and IT
10 months ago
2
116 MB
0

Mind2Web: Generalist Agents For Web Tasks

Language-guided Generalist Agents for Web Tasks

Other
10 months ago
1
814.5 MB
0

Share link

Anyone who has the link will be able to view this.