Baselight

Data Catalog

Explore, analyze, and share quality data.

Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
No options selected
Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
No options selected
Showing 58957 Datasets

Belgian Statutory Article Retrieval Dataset

Legal Q&A Dataset for Law Information Retrieval

Other
10 months ago
3
4.01 MB
0

ViGGO: Video Game Chatbot Dataset

Conversational data-to-text for video game chatbots

Technology and IT
10 months ago
8
1.2 MB
0

Medical Conversation Corpus (100k+)

Generative Language Modeling for Medical Applications

Healthcare
10 months ago
2
75.7 MB
0

AG News (News Articles)

News Articles Text Classification

Technology and IT
10 months ago
2
19.35 MB
0

Conversations On Coding, Debugging, Storytelling

Conversations on Coding, Debugging, Storytelling & Science

Other
10 months ago
1
2.21 MB
0

ProsocialDialog - Problematic Content Dialogue

Teach conversation agents to respond to problematic topics

Other
10 months ago
3
40.71 MB
0

Comprehensive Medical Q&A Dataset

Unlocking Healthcare Data with Natural Language Processing

Healthcare
10 months ago
1
8.76 MB
0

Chinese Medical Dialogue

Deep Learning for Intelligent Healthcare

Healthcare
10 months ago
6
888.12 MB
0

Housing Prices In San Francisco (Craigslist)

Predicting housing prices based on scraped craigslist data

Finance and Economics
10 months ago
2
223.9 kB
0

Global Suicide, Mental Health, Substance Use

Analyzing the Impact Across Countries

Healthcare
10 months ago
2
90.11 kB
0

Hourly European Power Market Prices

Price Comparisons by System, Type and Currency

Finance and Economics
10 months ago
1
11.65 MB
0

Amazon Product Reviews

18 Years of Customer Ratings and Experiences

Finance and Economics
10 months ago
2
1.12 GB
0

OpenBookQA (Multi-step Reasoning)

Multi-step Reasoning, Commonsense Knowledge, and Rich Text Comprehension

Other
10 months ago
6
1.37 MB
0

SciQ (Scientific Question Answering)

Question & Answering on scientific topics

Academic Research
10 months ago
3
4.49 MB
0

Glaive Python Code QA Dataset

Supporting Intelligent Development of Code Assistants

Other
10 months ago
1
102.52 MB
0

Recipes Dataset

Recipes Dataset for NLP

Other
10 months ago
2
632.41 kB
0

Student Engagement

Predicting Engagement and Exam Performance

Demographics and Population Studies
10 months ago
11
2.58 MB
0

Mental Health Support Feature Analysis

Correlating Text Features and Mental Health Indicators

Healthcare
10 months ago
99
1.13 GB
0

Tweet Sentiment's Impact On Stock Returns

862,231 Labeled Instances

Finance and Economics
10 months ago
2
91 MB
0

Global C2C Fashion Store User Behaviour Analysis

Analyzing Buyer and Seller Profiles across Countries

Ecommerce and Consumer Trends
10 months ago
4
2.27 MB
0

The United States National Parks

Discover America's Natural Wonders

Demographics and Population Studies
10 months ago
5
46.01 kB
0

Python Code Instruction

Training Data with Instruction, Input, Output, and Prompt Columns

Other
10 months ago
1
11.13 MB
0

Coding Questions With Solutions

Introductory, Interview and Competition Levels

Other
10 months ago
2
788.73 MB
0

OpenAI HumanEval Code Gen

Handcrafted Python Programming Problems for Accurate Model Evaluation

Technology and IT
10 months ago
1
85.24 kB
0

New York City Airbnb Reviews

A Dataset for Text Analysis and Of NYC Airbnb Reviews

Ecommerce and Consumer Trends
10 months ago
1
3.45 MB
0

Spanish Housing Dataset: Location, Size, Price,

Now with 100% More Fun!

Finance and Economics
10 months ago
26
67.82 MB
0

Timeline Of Historical Pandemics

Tracing the Past to Prevent the Future

Other
10 months ago
9
80.14 kB
0

Cybersecurity Risk (2022 CISA Vulnerability)

Severity, CVSS Score, and National Security Vulnerability Types

Technology and IT
10 months ago
5
532.91 kB
0

Sephora Skincare Products

Skincare products of sephora.com

Finance and Economics
10 months ago
16
519.61 kB
0

How Natural Disasters Impact Region's Labor Market

A Comprehensive Dataset of Economic Disruptions

Finance and Economics
10 months ago
1
62.02 kB
0

Tech Salaries

A Detailed Look into the US and International Salary & Experience Landscape

Finance and Economics
10 months ago
1
105.86 kB
0

H-1B Non-Immigrant Labour Visa

Investigating Impact on Job Market, Salary, & Approval Rate 2011-2018

Finance and Economics
10 months ago
1
82.82 MB
0

Bots On Social Media

Tracking Content Spread

Media and Entertainment
10 months ago
1
287.77 kB
0

SMS Spam Collection (Text Classification)

SMS labeled messages that have been collected for mobile phone spam research

Academic Research
10 months ago
1
328.25 kB
0

US Travel Check-Ins - Analysis

In-Depth Study of Location, Date, Temperature, USIndex, and Crime Rates

Environmental and Climate Sciences
10 months ago
7
2.36 MB
0

SXSW 2019 Schedule Dataset

SXSW 2019 Schedule: Music and Speaker Events

Media and Entertainment
10 months ago
2
3.76 MB
0

Eurovision Festival Voting Dynamics

Country Interactions between 2002-2023

Politics and Governance
10 months ago
5
80.68 kB
0

Suicidality On Reddit

Characterization of Time-variant and Time-invariant Assessment of Suicidality

Ecommerce and Consumer Trends
10 months ago
1
2.16 MB
0

Synthetic Therapy Conversations

Synthetic Therapy Conversations

Other
10 months ago
1
210.39 MB
0

Nintendo Entertainment System Games

Games released for the NES system

Media and Entertainment
10 months ago
5
55.18 kB
0

NBA Players Stats And Rating (Timeseries)

Analyze NBA players performance over time The best players in the NBA, accordin

Sports
10 months ago
58
5.08 MB
0

All Romanian Higher Education Institutions

Aggregated index of all the Romanian Higher Education Institutions

Demographics and Population Studies
10 months ago
1
32.16 MB
0

Google Stadia Games

Games released for google stadia

Other
10 months ago
2
35.11 kB
0

All GPT-4 Conversations

All chat datasets generated by GPT-4 from Huggingface in the same format

Other
10 months ago
27
1.39 GB
0

Weeds In Cultivation Fields

Ecology, Biogeography, and Red List Status

Other
10 months ago
1
96.2 kB
0

Mental Health In Drug Users During COVID-19

Exploring Personality and Risk Profiles

Healthcare
10 months ago
2
7.09 MB
0

MuSe Music Sentiment Analysis

Music Tags, Metadata, & Audio Features

Ecommerce and Consumer Trends
10 months ago
1
7.67 MB
0

Housing Prices In Lagos, Nigeria

Address, Price, and Property Name

Finance and Economics
10 months ago
4
1.54 MB
0

Video Game Prices 2022

Analyzing Digital and Physical Retailer Prices

Finance and Economics
10 months ago
1
88.43 kB
0

PHQ-9 Depression Assessment

14-Days of Ambulatory Mood Dynamics in a General Population

Healthcare
10 months ago
1
500.41 kB
0

Share link

Anyone who has the link will be able to view this.