Data Catalog
Explore, analyze, and share quality data.
Explore, analyze, and share quality data.
Text classification dataset for question answering
3 tables
12.7 MB0
Accurate Medical Translation Dataset
1 table
2.34 MB0
Textual Entailment Dataset with Labelled Text Pairs
3 tables
49.44 MB0
Gender-biased coreference dataset focused on occupation stereotypes in WinoBias
8 tables
265.21 KB0
Multilingual named entity recognition for LLM training
528 tables
130.87 MB0
Multilingual Question-Answering Dataset
116 tables
247.54 MB0
Portuguese NER Corpus with 10 Classes
3 tables
432.18 KB0
Text Classification Dataset with 14 Classes
2 tables
110.63 MB0
Language-guided Generalist Agents for Web Tasks
1 table
776.76 MB0
Biology Problem-Solution Pairs for Synthetic Biology
1 table
20.85 MB0
A curated dataset for math instruction tuning models
1 table
93.14 MB0
Generating Alpaca-style code from natural language instructions
1 table
67.48 MB0
Building a Bridge Between Prompts and Knowledge for Large Language Models
1 table
127.1 KB0
Instruct dataset generated from starcoder
4 tables
10.33 MB0
Predicting Binary Preferences with SFT, PPO and DPO
6 tables
614.3 MB0
Conversation, Prompts, and Tags
3 tables
7.13 MB0
Analyzing Consumer Engagement and Content Trends
1 table
220.19 KB0
Identifying Key Associations
2 tables
236.92 KB0
Pre-1489 Techniques, Dimensions and Origins
1 table
610.6 KB0
12,102 questions with one correct answer and four distractor answers
3 tables
1.14 MB0
Comprehensive Collection of Text Classification Datasets
77 tables
61.62 MB0
Resolving Coreferences to Answer Questions
2 tables
9.51 MB0
Predicting Movie Review Sentiment
3 tables
850.85 KB0
A Dataset of mobile phone carriers
3 tables
16.78 KB0
Anyone who has the link will be able to view this.