Name: Quoref (Q&A For Coreference Resolution)
Creator: Kaggle
Published: 2025-02-13T08:24:50.842Z
License: https://creativecommons.org/publicdomain/zero/1.0/

Resolving Coreferences to Answer Questions

Quoref (Q&A for Coreference Resolution)

Resolving Coreferences to Answer Questions

Source

Huggingface Hub: link
Original Author: AllenAI

About this dataset

This dataset, called Quoref, is a unique dataset meant to test the coreferential reasoning capability of reading comprehension systems. The dataset contains 24,000 questions over 4,700 paragraphs from Wikipedia pages. A system must resolve complex coreferences before selecting the appropriate span(s) in the paragraphs for answering questions. The data fields in this dataset are question, context, title, url, answers. This allows for systems to not only answer the questions but also provide evidence from the context to back up their answers

Research Ideas

This dataset could be used to test the coreferential reasoning capability of reading comprehension systems.
A system must resolve hard coreferences before selecting the appropriate span(s) in the paragraphs for answering questions

Acknowledgements

Original Author: AllenAI

License

> License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
> No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: validation.csv

Column name	Description
question	The question text. (String)
context	The context paragraph(s) for the question. (String)
title	The title of the Wikipedia page from which the context was extracted. (String)
url	The URL of the Wikipedia page from which the context was extracted. (String)
answers	The answer span(s) for the question. (List of strings)

File: train.csv

Column name	Description
question	The question text. (String)
context	The context paragraph(s) for the question. (String)
title	The title of the Wikipedia page from which the context was extracted. (String)
url	The URL of the Wikipedia page from which the context was extracted. (String)
answers	The answer span(s) for the question. (List of strings)

Related Datasets

WikiQA (Open-Domain Q&A)

@kaggle
Eucalyptus Growth And Environmental Data

@euremarkable
Ethnic Power Relations Dataset (ETH, 2021)

@owid
Dummy Monster

@owid
MoTT: A Speech Dataset For Modular Composition Of Turn-Taking Conversations

@zenodo
Trust Questions In The European Social Survey, Latinobarómetro And Afrobarometer

@owid

WikiQA (Open-Domain Q&A)

Eucalyptus Growth And Environmental Data

Ethnic Power Relations Dataset (ETH, 2021)

Dummy Monster

MoTT: A Speech Dataset For Modular Composition Of Turn-Taking Conversations

Trust Questions In The European Social Survey, Latinobarómetro And Afrobarometer