CommonGen (Generative Commonsense Reasoning)
Constrained text generation task, associated with a benchmark dataset
@kaggle.thedevastator_commongen_a_benchmark_dataset_for_generative_com
Constrained text generation task, associated with a benchmark dataset
@kaggle.thedevastator_commongen_a_benchmark_dataset_for_generative_com
The CommonGen Dataset is a new benchmark dataset for the task of generative commonsense reasoning. This task is challenging because it requires both relational reasoning using background commonsense knowledge and compositional generalization ability to work on unseen concept combinations. The CommonGen Dataset provides a set of training data and validation data for the task of generating commonsense sentences. The dataset contains a set of concept sets and target sentences that the model should generate
The CommonGen Dataset is a new benchmark dataset for the task of generative commonsense reasoning. This task is challenging because it requires both relational reasoning using background commonsense knowledge and compositional generalization ability to work on unseen concept combinations.
This guide provides an overview of the CommonGen Dataset and how to use it for generative commonsense reasoning tasks.
The CommonGen Dataset contains a set of training data, validation data, and test data. The training data consists of pairs of concepts and targets, where the target is a sentence that should be generated based on the concepts. The validation data contains pairs of concepts and targets, where the target is a sentence that should be generated based on the concepts. The test data contains pairs of concepts and targets, where the target is a sentence that should be generated based on the concepts.
To use the CommonGen Dataset for generative commonsense reasoning tasks, you will need to train a model on the training data and then evaluate your model on the validation and test data
- Achieving human-like commonsense reasoning ability in AI models
- Generating natural language descriptions of complex concept combinations
- Developing new methods for efficient data-driven commonsense knowledge acquisition
The CommonGen Dataset is a new benchmark dataset for the task of generative commonsense reasoning. This task is challenging because it requires both relational reasoning using background commonsense knowledge and compositional generalization ability to work on unseen concept combinations.
This dataset was created by the research team at the Allen Institute for Artificial Intelligence (AI2), in collaboration with academics from Carnegie Mellon University, Stanford University, and the University of Washington. We would like to thank these institutions for their support
License
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
File: validation.csv
| Column name | Description |
|---|---|
| concepts | A set of concepts that the model should generate a sentence about. (Input) |
| target | The target sentence that the model should generate. (Output) |
File: train.csv
| Column name | Description |
|---|---|
| concepts | A set of concepts that the model should generate a sentence about. (Input) |
| target | The target sentence that the model should generate. (Output) |
File: test.csv
| Column name | Description |
|---|---|
| concepts | A set of concepts that the model should generate a sentence about. (Input) |
| target | The target sentence that the model should generate. (Output) |
CREATE TABLE test (
"concept_set_idx" BIGINT,
"concepts" VARCHAR,
"target" VARCHAR
);CREATE TABLE train (
"concept_set_idx" BIGINT,
"concepts" VARCHAR,
"target" VARCHAR
);CREATE TABLE validation (
"concept_set_idx" BIGINT,
"concepts" VARCHAR,
"target" VARCHAR
);Anyone who has the link will be able to view this.