Dataset
Provider
Updated at
Tables
Size
Stars
HTTP Header Fields Dataset
How information is encoded and sent/received on the internet
Technology and ITQuestion-Answering Training And Testing Data
A dataset for training and testing question-answering models
OtherLLM Feedback Collection
Induce fine-grained evaluation capabilities into language models
Technology and ITQSAR Molecular Descriptor Predictions
Analyzing Activation Energy in Chemical Compounds
Environmental and Climate SciencesPAWS (Paraphrase Word Scrambling)
A dataset for modeling structure, context, and word order information
OtherTinyShakespeare (Shakespeare's Plays)
40,000 lines of Shakespeare from a variety of Shakespeare's plays
OtherReddit: /r/EatCheapAndHealthy
Cost-Effective Nutritional Solutions from the Community
Finance and EconomicsLovoo V3 Dating App User Profiles And Statistics
Revealing popular user traits and behavior
Media and EntertainmentCrypto, Web3 And Blockchain Jobs
Scraped active crypto jobs listed on cryptojobslist.com
Crypto and BlockchainPsychedelic Drug Database
Psychotropic and psychedelics drugs database with molecular descriptors
HealthcareAmod Mental Health Counseling Conversations
A dataset of mental health counseling conversations for training models
HealthcareLogical Reasoning Improvement Dataset
Enhancing LLM Logical Reasoning Skills with Platypus2 Models
Technology and ITAutonomous Transport User Experiences
Rating Vehicle and User Interface Performance in Luxembourg Pilots
Transportation and LogisticsFertilizer Use And Price
1960-2012 data on fertilizer consumption in the United States by plant nutrient
Finance and EconomicsChemistry Problem-Solution
Chemistry Problem-Solution Dataset: 20K pairs across 25 topics and subtopics
OtherOpenerotica/basilisk-v0.2 Conversations Dataset
Annotated Conversations from openerotica and freedom-rp
OtherGPT Roleplay Realm: Enhanced Character
Character Cards and Dialogues for immersive role-playing experiences
OtherRegional Water Temperatures Over Time
Historical Records of Berlin, Brandenburg and Altmark Lakes
Environmental and Climate SciencesPredicting Portuguese Bank Term Deposit
Identifying Likely Customers for Conversion Optimization
Finance and EconomicsSmithsonian Butterfly Dataset
Butterfly images and information from the Smithsonian Institution
OtherGSM8K - Grade School Math 8K Q&A
A Linguistically Diverse Dataset for Multi-Step Reasoning Question Answering
Demographics and Population StudiesGeneral Language Understanding Evaluation (GLUE)
The Famous General Language Understanding Evaluation benchmark
Other