HellaSwag: Commonsense NLI by Kaggle | Other

About this Dataset

HellaSwag: Commonsense NLI

ACL2019 Dataset for Testing Machine's Sentence Completion Abilities

By Zellers (From Huggingface) [source]

About this dataset

The HellaSwag dataset is a comprehensive and highly valuable resource for assessing a machine's sentence completion abilities based on commonsense natural language inference (NLI). It was introduced in a paper published at ACL2019 as an important contribution to the field. This dataset allows researchers and machine learning practitioners to train, validate, and evaluate models designed to understand and predict plausible sentence completions using common sense knowledge.

The dataset consists of three main files: train.csv, validation.csv, and test.csv. The train.csv file serves as the training data, facilitating the learning process for machine learning models by exposing them to various contexts, corresponding activity labels, multiple candidate sentence completions (endings), splits of the dataset (such as train, dev, or test), as well as split types like random or balanced.

Similarly, the validation.csv file contains data specifically reserved for validating the performance of models on completing sentences based on commonsense knowledge. This helps researchers assess how well their models generalize and make accurate predictions in real-world scenarios.

On the other hand, the test.csv file enables thorough evaluation of machines' ability to complete sentences with relevant common sense information. By utilizing this test data, researchers can accurately measure their model's effectiveness in generating appropriate sentence endings given specific contexts and activity labels.

Each row in these datasets includes essential features such as index numbers indicating specific data points. The context sentences (ctx_a and ctx_b) provide necessary background information for comprehending each task while also aiding machines in generating suitable sentence completions. Additionally, every row includes activity labels offering insights into different activities or events described within each context.

To further ensure diversity within the datasets' distribution and enhance their readiness for diverse application scenarios like generalization testing or fairness evaluation during model development stages; splits according to train/dev/test are included along with split types such as random or balanced distribution-type splitting techniques.

In summary, the HellaSwag dataset presents a valuable resource for researchers and practitioners in the field of commonsense NLI. By leveraging this dataset, one can train and evaluate machine learning models that excel at generating plausible sentence completions based on common sense knowledge

Research Ideas

Language Modeling: The HellaSwag dataset can be used to train language models to better understand common sense knowledge and improve sentence completion tasks.

Common Sense Reasoning: Researchers can use this dataset to study and develop algorithms that can reason and make inferences based on common sense knowledge.

Evaluating Machine Performance: The dataset can be used to evaluate the performance of machine learning models in completing sentences based on common sense, helping researchers and developers understand the limitations of current NLI systems

Acknowledgements

If you use this dataset in your research, please credit the original authors.
Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: validation.csv

Column name	Description
ind	The index of the data point. (Integer)
activity_label	The label indicating the activity or event described in the sentence. (String)
ctx_a	The first context sentence. (String)
ctx_b	The second context sentence. (String)
endings	The possible sentence completions for the given context. (List of Strings)
split	The split of the dataset (e.g., train, dev, test). (String)
split_type	The type of split used for dividing the dataset (e.g., random, balanced). (String)

File: train.csv

Column name	Description
ind	The index of the data point. (Integer)
activity_label	The label indicating the activity or event described in the sentence. (String)
ctx_a	The first context sentence. (String)
ctx_b	The second context sentence. (String)
endings	The possible sentence completions for the given context. (List of Strings)
split	The split of the dataset (e.g., train, dev, test). (String)
split_type	The type of split used for dividing the dataset (e.g., random, balanced). (String)

File: test.csv

Column name	Description
ind	The index of the data point. (Integer)
activity_label	The label indicating the activity or event described in the sentence. (String)
ctx_a	The first context sentence. (String)
ctx_b	The second context sentence. (String)
endings	The possible sentence completions for the given context. (List of Strings)
split	The split of the dataset (e.g., train, dev, test). (String)
split_type	The type of split used for dividing the dataset (e.g., random, balanced). (String)

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit Zellers (From Huggingface).

Tables

Test

@kaggle.thedevastator_hellaswag_commonsense_nli_dataset.test

5.71 MB
10003 rows
10 columns


CREATE TABLE test (
  "ind" BIGINT,
  "activity_label" VARCHAR,
  "ctx_a" VARCHAR,
  "ctx_b" VARCHAR,
  "ctx" VARCHAR,
  "endings" VARCHAR,
  "source_id" VARCHAR,
  "split" VARCHAR,
  "split_type" VARCHAR,
  "label" VARCHAR
);

Train

@kaggle.thedevastator_hellaswag_commonsense_nli_dataset.train

22.64 MB
39905 rows
10 columns


CREATE TABLE train (
  "ind" BIGINT,
  "activity_label" VARCHAR,
  "ctx_a" VARCHAR,
  "ctx_b" VARCHAR,
  "ctx" VARCHAR,
  "endings" VARCHAR,
  "source_id" VARCHAR,
  "split" VARCHAR,
  "split_type" VARCHAR,
  "label" BIGINT
);

Validation

@kaggle.thedevastator_hellaswag_commonsense_nli_dataset.validation

5.89 MB
10042 rows
10 columns


CREATE TABLE validation (
  "ind" BIGINT,
  "activity_label" VARCHAR,
  "ctx_a" VARCHAR,
  "ctx_b" VARCHAR,
  "ctx" VARCHAR,
  "endings" VARCHAR,
  "source_id" VARCHAR,
  "split" VARCHAR,
  "split_type" VARCHAR,
  "label" BIGINT
);