Baselight
Sign In
Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
1 option selected: Other
Multi-select dropdown. Use arrow keys to navigate, Enter to select, and Escape to close.
No options selected
1295 results

BoolQ - Question-Answer-Passage Consistency

BoolQ Dataset: Question-Answer-Passage Consistency

Other
11 months ago
2
4.95 MB
0

German Question-Answer Context Dataset

German Q&A Context Dataset

Other
11 months ago
2
4.88 MB
0

GermanQuAD: High-Quality German QA Dataset

High-quality German Question Answering Dataset

Other
11 months ago
2
4.88 MB
0

Wikipedia Biographies Text Generation Dataset

Wikipedia Biographies: Infobox and First Paragraphs Texts

Other
11 months ago
3
440.17 MB
0

LexGLUE: Legal NLP Benchmark

Legal NLP Benchmark Dataset: LexGLUE

Other
11 months ago
21
562.21 MB
0

LeNER-Br: Portuguese Legal NER

Labeled Portuguese Legal NER

Other
11 months ago
3
1.26 MB
0

Korean Natural Language Inference

Korean NLI Data: Premises, Hypotheses, and Labels

Other
11 months ago
4
75.12 MB
0

HellaSwag: Commonsense NLI

ACL2019 Dataset for Testing Machine's Sentence Completion Abilities

Other
11 months ago
3
35.9 MB
0

English-Thai Translation Quality

English-to-Thai Translation Quality

Other
11 months ago
3
91.52 MB
0

File Validation And Training Statistics

Validation, Training, and Testing Statistics for tasksource/leandojo Files

Other
11 months ago
3
27.84 MB
0

TruthfulQA: Benchmark For Evaluating Language

Evaluating truthfulness in language models' answers

Other
11 months ago
2
498.05 kB
0

Middletownbooks Joke Training

Jokes for training joke generation

Other
11 months ago
1
2.18 MB
0

Symbolic Correlation Dataset For LLMs

Exploring the Relationship between Knowledge and Language

Other
11 months ago
1
130.15 kB
0

Legume-Rhizobium Mutualism Evolution

Multi and Single Strain Responses

Other
11 months ago
2
35.06 kB
0

WikiSQL (Questions And SQL Queries)

80654 hand-annotated questions and SQL queries on 24241 Wikipedia tables

Other
11 months ago
3
38.25 MB
0

ASLG-PC12 (English-ASL Gloss Parallel Corpus 2012)

Interactions between Corpus and Lexicon LREC

Other
11 months ago
1
7.18 MB
0

Landmark Detection For Tsetse Fly

Accurate Morphometric Data

Other
11 months ago
1
5.25 MB
0

Game Boy Advance Games

Games Released for Game boy advanced

Other
11 months ago
3
91.05 kB
0

Game Boy Games

Games released for game boy

Other
11 months ago
3
62.08 kB
0

Google's M&A History

How Much, What For, and Where?

Other
11 months ago
12
138.39 kB
0

Submachine Guns

A dataset of known submachine gun models

Other
11 months ago
1
14.32 kB
0

Galaxy Clustering

Iris, Moon, and Circles datasets for Galaxy clustering tutorial

Other
11 months ago
3
20.43 kB
0

Intel Processors

A Comprehensive Guide

Other
11 months ago
24
184.22 kB
0

The World's Highest Mountains

A Dataset of Peaks with at Least 500m Prominence

Other
11 months ago
3
15.28 kB
0

QASPER: NLP Questions And Evidence

Discovering Answers with Expertise

Other
11 months ago
3
27.9 MB
0

MultiNLI Textual Entailment Corpus

Multi-Genre Natural Language Inference (MultiNLI)

Other
11 months ago
3
215.15 MB
0

Cmrc2018 - Chinese Machine Reading Comprehension

Chinese MRC Dataset with Language Diversities

Other
11 months ago
3
5.48 MB
0

English-Darija Bilingual Text (Moroccan Arabic)

English-Darija Bilingual Corpus for Machine Translation

Other
11 months ago
1
23.28 MB
0

Erotiquant-XL

Enhanced erotica dataset with longer context samples

Other
11 months ago
1
99.99 MB
0

Kubernetes Commands

kubectl commands and descriptions for Kubernetes

Other
11 months ago
1
3.65 MB
0

Textual Entailment Dataset

Textual Entailment Dataset with Labelled Text Pairs

Other
11 months ago
3
51.84 MB
0

MLQA - Multilingual Question-Answering

Multilingual Question-Answering Dataset

Other
11 months ago
116
259.57 MB
0

HAREM Portuguese NER Corpus

Portuguese NER Corpus with 10 Classes

Other
11 months ago
3
442.56 kB
0

Mind2Web: Generalist Agents For Web Tasks

Language-guided Generalist Agents for Web Tasks

Other
11 months ago
1
814.5 MB
0

TokenBender: Alpaca Code Generation Instructions

Generating Alpaca-style code from natural language instructions

Other
11 months ago
1
70.75 MB
0

Knowledge Symbolic Correlation With LLMs

Building a Bridge Between Prompts and Knowledge for Large Language Models

Other
11 months ago
1
130.15 kB
0

Self-instruct Starcoder

Instruct dataset generated from starcoder

Other
11 months ago
4
10.83 MB
0

Ultrafeedback Binarized

Predicting Binary Preferences with SFT, PPO and DPO

Other
11 months ago
6
644.14 MB
0

Empathetic Conversational Model Benchmark

Conversation, Prompts, and Tags

Other
11 months ago
3
7.48 MB
0

Museo Del Prado Artworks

Pre-1489 Techniques, Dimensions and Origins

Other
11 months ago
1
625.25 kB
0

CommonsenseQA (Multiple-Choice Q&A)

12,102 questions with one correct answer and four distractor answers

Other
11 months ago
3
1.19 MB
0

Quoref (Q&A For Coreference Resolution)

Resolving Coreferences to Answer Questions

Other
11 months ago
2
9.97 MB
0

SciTail (Multiple-choice Science Exams)

27,026 Multiple-choice science exams and web sentences

Other
11 months ago
12
12.76 MB
0

EV Driver Trips In London

Charging Bundle Optimization for EV Adoption

Other
11 months ago
3
319.55 kB
0

Geographic Patterns Of NYPD Arrests

Exploring Arrest Locations and Contributing Factors

Other
11 months ago
1
2.54 kB
0

Open Subtitles Multilingual Translation

Train Sequential Neural Networks in Nine Languages

Other
11 months ago
5
641.5 MB
0

Blended Skill Talk

Personality, Empathy, and Knowledge

Other
11 months ago
3
62.47 MB
0

LongAlpaca 16K-Length

Investigating Natural Language Processing Performance

Other
11 months ago
1
125.31 MB
0

Large-Scale Preference Dataset

Training Powerful Reward & Critic Models with Aligned Language Models

Other
11 months ago
1
361.39 MB
0
11 months ago
2
191.59 MB
0

Share link

Anyone who has the link will be able to view this.