Baselight
Sign In

Datasets

Total public datasets added

8,718

Rows

Total rows contributed

5,557,227,310

Popularity

Total times datasets used in queries

248

Stars

Total stars received

17

Job Postings In Europe

Exploring Salaries, Job Types and Locations

Finance and Economics
11 months ago
1
37.08 MB
0

Opera Performances

Opera performances and associated data (Composers, Year written, etc)

Other
11 months ago
1
618.08 kB
0

GoodReads Best Books

Ratings, Genres, Awards, and More

Media and Entertainment
11 months ago
1
42.19 MB
0

Evol-Instruct-Code-80k-v1

Instructional code snippets with corresponding outputs

Other
11 months ago
1
53.72 MB
0

DailyDialog (Multi-turn Dialog)

Dialogues that reflect our daily communication way and cover various topics

Other
11 months ago
3
4.13 MB
0

Online Influencer Marketing

Influencer Engagement and Performance

Ecommerce and Consumer Trends
11 months ago
1
62.88 kB
0

Belgian Statutory Article Retrieval Dataset

Legal Q&A Dataset for Law Information Retrieval

Other
11 months ago
3
4.01 MB
0

ViGGO: Video Game Chatbot Dataset

Conversational data-to-text for video game chatbots

Technology and IT
11 months ago
8
1.2 MB
0

Medical Conversation Corpus (100k+)

Generative Language Modeling for Medical Applications

Healthcare
11 months ago
2
75.7 MB
0

AG News (News Articles)

News Articles Text Classification

Technology and IT
11 months ago
2
19.35 MB
0

Conversations On Coding, Debugging, Storytelling

Conversations on Coding, Debugging, Storytelling & Science

Other
11 months ago
1
2.21 MB
0

ProsocialDialog - Problematic Content Dialogue

Teach conversation agents to respond to problematic topics

Other
11 months ago
3
40.71 MB
0

Comprehensive Medical Q&A Dataset

Unlocking Healthcare Data with Natural Language Processing

Healthcare
11 months ago
1
8.76 MB
0

Chinese Medical Dialogue

Deep Learning for Intelligent Healthcare

Healthcare
11 months ago
6
888.12 MB
0

Housing Prices In San Francisco (Craigslist)

Predicting housing prices based on scraped craigslist data

Finance and Economics
11 months ago
2
223.9 kB
0

Global Suicide, Mental Health, Substance Use

Analyzing the Impact Across Countries

Healthcare
11 months ago
2
90.11 kB
0

Hourly European Power Market Prices

Price Comparisons by System, Type and Currency

Finance and Economics
11 months ago
1
11.65 MB
0

Amazon Product Reviews

18 Years of Customer Ratings and Experiences

Finance and Economics
11 months ago
2
1.12 GB
0

OpenBookQA (Multi-step Reasoning)

Multi-step Reasoning, Commonsense Knowledge, and Rich Text Comprehension

Other
11 months ago
6
1.37 MB
0

SciQ (Scientific Question Answering)

Question & Answering on scientific topics

Academic Research
11 months ago
3
4.49 MB
0

Glaive Python Code QA Dataset

Supporting Intelligent Development of Code Assistants

Other
11 months ago
1
102.52 MB
0

Recipes Dataset

Recipes Dataset for NLP

Other
11 months ago
2
632.41 kB
0

Student Engagement

Predicting Engagement and Exam Performance

Demographics and Population Studies
11 months ago
11
2.58 MB
0

Mental Health Support Feature Analysis

Correlating Text Features and Mental Health Indicators

Healthcare
11 months ago
99
1.13 GB
0

Tweet Sentiment's Impact On Stock Returns

862,231 Labeled Instances

Finance and Economics
11 months ago
2
91 MB
0

Global C2C Fashion Store User Behaviour Analysis

Analyzing Buyer and Seller Profiles across Countries

Ecommerce and Consumer Trends
11 months ago
4
2.27 MB
0

The United States National Parks

Discover America's Natural Wonders

Demographics and Population Studies
11 months ago
5
46.01 kB
0

Python Code Instruction

Training Data with Instruction, Input, Output, and Prompt Columns

Other
11 months ago
1
11.13 MB
0

Coding Questions With Solutions

Introductory, Interview and Competition Levels

Other
11 months ago
2
788.73 MB
0

OpenAI HumanEval Code Gen

Handcrafted Python Programming Problems for Accurate Model Evaluation

Technology and IT
11 months ago
1
85.24 kB
0

New York City Airbnb Reviews

A Dataset for Text Analysis and Of NYC Airbnb Reviews

Ecommerce and Consumer Trends
11 months ago
1
3.45 MB
0

Spanish Housing Dataset: Location, Size, Price,

Now with 100% More Fun!

Finance and Economics
11 months ago
26
67.82 MB
0

Timeline Of Historical Pandemics

Tracing the Past to Prevent the Future

Other
11 months ago
9
80.14 kB
0

Cybersecurity Risk (2022 CISA Vulnerability)

Severity, CVSS Score, and National Security Vulnerability Types

Technology and IT
11 months ago
5
532.91 kB
0

Sephora Skincare Products

Skincare products of sephora.com

Finance and Economics
11 months ago
16
519.61 kB
0

How Natural Disasters Impact Region's Labor Market

A Comprehensive Dataset of Economic Disruptions

Finance and Economics
11 months ago
1
62.02 kB
0

Tech Salaries

A Detailed Look into the US and International Salary & Experience Landscape

Finance and Economics
11 months ago
1
105.86 kB
0

H-1B Non-Immigrant Labour Visa

Investigating Impact on Job Market, Salary, & Approval Rate 2011-2018

Finance and Economics
11 months ago
1
82.82 MB
0

Bots On Social Media

Tracking Content Spread

Media and Entertainment
11 months ago
1
287.77 kB
0

SMS Spam Collection (Text Classification)

SMS labeled messages that have been collected for mobile phone spam research

Academic Research
11 months ago
1
328.25 kB
0

US Travel Check-Ins - Analysis

In-Depth Study of Location, Date, Temperature, USIndex, and Crime Rates

Environmental and Climate Sciences
11 months ago
7
2.36 MB
0

SXSW 2019 Schedule Dataset

SXSW 2019 Schedule: Music and Speaker Events

Media and Entertainment
11 months ago
2
3.76 MB
0

Eurovision Festival Voting Dynamics

Country Interactions between 2002-2023

Politics and Governance
11 months ago
5
80.68 kB
0

Suicidality On Reddit

Characterization of Time-variant and Time-invariant Assessment of Suicidality

Ecommerce and Consumer Trends
11 months ago
1
2.16 MB
0

Synthetic Therapy Conversations

Synthetic Therapy Conversations

Other
11 months ago
1
210.39 MB
0

Nintendo Entertainment System Games

Games released for the NES system

Media and Entertainment
11 months ago
5
55.18 kB
0

NBA Players Stats And Rating (Timeseries)

Analyze NBA players performance over time The best players in the NBA, accordin

Sports
11 months ago
58
5.08 MB
0

All Romanian Higher Education Institutions

Aggregated index of all the Romanian Higher Education Institutions

Demographics and Population Studies
11 months ago
1
32.16 MB
0

Google Stadia Games

Games released for google stadia

Other
11 months ago
2
35.11 kB
0

All GPT-4 Conversations

All chat datasets generated by GPT-4 from Huggingface in the same format

Other
11 months ago
27
1.39 GB
0
Load More

Share link

Anyone who has the link will be able to view this.