Other Data | Page 14

German Question-Answer Context Dataset

German Q&A Context Dataset

Other

Kaggle

1 year ago

2

4.88 MB

0

GermanQuAD: High-Quality German QA Dataset

High-quality German Question Answering Dataset

Other

Kaggle

1 year ago

2

4.88 MB

0

Wikipedia Biographies Text Generation Dataset

Wikipedia Biographies: Infobox and First Paragraphs Texts

Other

Kaggle

1 year ago

3

440.17 MB

0

LexGLUE: Legal NLP Benchmark

Legal NLP Benchmark Dataset: LexGLUE

Other

Kaggle

1 year ago

21

562.21 MB

0

LeNER-Br: Portuguese Legal NER

Labeled Portuguese Legal NER

Other

Kaggle

1 year ago

3

1.26 MB

0

Korean Natural Language Inference

Korean NLI Data: Premises, Hypotheses, and Labels

Other

Kaggle

1 year ago

4

75.12 MB

0

HellaSwag: Commonsense NLI

ACL2019 Dataset for Testing Machine's Sentence Completion Abilities

Other

Kaggle

1 year ago

3

35.9 MB

0

English-Thai Translation Quality

English-to-Thai Translation Quality

Other

Kaggle

1 year ago

3

91.52 MB

0

File Validation And Training Statistics

Validation, Training, and Testing Statistics for tasksource/leandojo Files

Other

Kaggle

1 year ago

3

27.84 MB

0

TruthfulQA: Benchmark For Evaluating Language

Evaluating truthfulness in language models' answers

Other

Kaggle

1 year ago

2

498.05 kB

0

Middletownbooks Joke Training

Jokes for training joke generation

Other

Kaggle

1 year ago

1

2.18 MB

0

Symbolic Correlation Dataset For LLMs

Exploring the Relationship between Knowledge and Language

Other

Kaggle

1 year ago

1

130.15 kB

0

Legume-Rhizobium Mutualism Evolution

Multi and Single Strain Responses

Other

Kaggle

1 year ago

2

35.06 kB

0

WikiSQL (Questions And SQL Queries)

80654 hand-annotated questions and SQL queries on 24241 Wikipedia tables

Other

Kaggle

1 year ago

3

38.25 MB

0

ASLG-PC12 (English-ASL Gloss Parallel Corpus 2012)

Interactions between Corpus and Lexicon LREC

Other

Kaggle

1 year ago

1

7.18 MB

0

Landmark Detection For Tsetse Fly

Accurate Morphometric Data

Other

Kaggle

1 year ago

1

5.25 MB

0

Game Boy Advance Games

Games Released for Game boy advanced

Other

Kaggle

1 year ago

3

91.05 kB

0

Game Boy Games

Games released for game boy

Other

Kaggle

1 year ago

3

62.08 kB

0

Google's M&A History

How Much, What For, and Where?

Other

Kaggle

1 year ago

12

138.39 kB

0

Submachine Guns

A dataset of known submachine gun models

Other

Kaggle

1 year ago

1

14.32 kB

0

Galaxy Clustering

Iris, Moon, and Circles datasets for Galaxy clustering tutorial

Other

Kaggle

1 year ago

3

20.43 kB

0

Intel Processors

A Comprehensive Guide

Other

Kaggle

1 year ago

24

184.22 kB

0

The World's Highest Mountains

A Dataset of Peaks with at Least 500m Prominence

Other

Kaggle

1 year ago

3

15.28 kB

0

QASPER: NLP Questions And Evidence

Discovering Answers with Expertise

Other

Kaggle

1 year ago

3

27.9 MB

0

MultiNLI Textual Entailment Corpus

Multi-Genre Natural Language Inference (MultiNLI)

Other

Kaggle

1 year ago

3

215.15 MB

0

Cmrc2018 - Chinese Machine Reading Comprehension

Chinese MRC Dataset with Language Diversities

Other

Kaggle

1 year ago

3

5.48 MB

0

English-Darija Bilingual Text (Moroccan Arabic)

English-Darija Bilingual Corpus for Machine Translation

Other

Kaggle

1 year ago

1

23.28 MB

0

Erotiquant-XL

Enhanced erotica dataset with longer context samples

Other

Kaggle

1 year ago

1

99.99 MB

0

Kubernetes Commands

kubectl commands and descriptions for Kubernetes

Other

Kaggle

1 year ago

1

3.65 MB

0

Textual Entailment Dataset

Textual Entailment Dataset with Labelled Text Pairs

Other

Kaggle

1 year ago

3

51.84 MB

0

MLQA - Multilingual Question-Answering

Multilingual Question-Answering Dataset

Other

Kaggle

1 year ago

116

259.57 MB

0

HAREM Portuguese NER Corpus

Portuguese NER Corpus with 10 Classes

Other

Kaggle

1 year ago

3

442.56 kB

0

Mind2Web: Generalist Agents For Web Tasks

Language-guided Generalist Agents for Web Tasks

Other

Kaggle

1 year ago

1

814.5 MB

0

TokenBender: Alpaca Code Generation Instructions

Generating Alpaca-style code from natural language instructions

Other

Kaggle

1 year ago

1

70.75 MB

0

Knowledge Symbolic Correlation With LLMs

Building a Bridge Between Prompts and Knowledge for Large Language Models

Other

Kaggle

1 year ago

1

130.15 kB

0

Self-instruct Starcoder

Instruct dataset generated from starcoder

Other

Kaggle

1 year ago

4

10.83 MB

0

Ultrafeedback Binarized

Predicting Binary Preferences with SFT, PPO and DPO

Other

Kaggle

1 year ago

6

644.14 MB

0

Empathetic Conversational Model Benchmark

Conversation, Prompts, and Tags

Other

Kaggle

1 year ago

3

7.48 MB

0

Museo Del Prado Artworks

Pre-1489 Techniques, Dimensions and Origins

Other

Kaggle

1 year ago

1

625.25 kB

0

CommonsenseQA (Multiple-Choice Q&A)

12,102 questions with one correct answer and four distractor answers

Other

Kaggle

1 year ago

3

1.19 MB

0

Quoref (Q&A For Coreference Resolution)

Resolving Coreferences to Answer Questions

Other

Kaggle

1 year ago

2

9.97 MB

0

SciTail (Multiple-choice Science Exams)

27,026 Multiple-choice science exams and web sentences

Other

Kaggle

1 year ago

12

12.76 MB

0

EV Driver Trips In London

Charging Bundle Optimization for EV Adoption

Other

Kaggle

1 year ago

3

319.55 kB

0

Geographic Patterns Of NYPD Arrests

Exploring Arrest Locations and Contributing Factors

Other

Kaggle

1 year ago

1

2.54 kB

0

Open Subtitles Multilingual Translation

Train Sequential Neural Networks in Nine Languages

Other

Kaggle

1 year ago

5

641.5 MB

0

Blended Skill Talk

Personality, Empathy, and Knowledge

Other

Kaggle

1 year ago

3

62.47 MB

0

LongAlpaca 16K-Length

Investigating Natural Language Processing Performance

Other

Kaggle

1 year ago

1

125.31 MB

0

Large-Scale Preference Dataset

Training Powerful Reward & Critic Models with Aligned Language Models

Other

Kaggle

1 year ago

1

361.39 MB

0

Helpful-Harmless Assistant Dataset (For RLHF)

17k Train, 9000 Test

Other

Kaggle

1 year ago

2

191.59 MB

0

Tamazight-NLP/Pontoon-Translations: Source-Target

Tamazight Translation Dataset: Source-Target Sentences for NLP

Other

Kaggle

1 year ago

1

3.47 MB

0