Baselight

MLQA - Multilingual Question-Answering

Multilingual Question-Answering Dataset

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset

Loading...
Loading...

About this Dataset

MLQA - Multilingual Question-Answering


MLQA - Multilingual Question-Answering

Multilingual Question-Answering Dataset

By mlqa (From Huggingface) [source]


About this dataset

The dataset consists of several files in CSV format that provide context passages or paragraphs along with corresponding questions and answer options. The context passages serve as the source of information from which the questions are derived, and the answer options are potential answers to these questions.

Each file in the dataset contains different language combinations for evaluation purposes. For example, mlqa.es.zh_test.csv focuses on testing multilingual question-answering models in Spanish and Chinese languages. Similarly, mlqa.hi.de_test.csv provides test data specifically for evaluating Hindi-German language pairs.

In order to facilitate accurate evaluation of models' performance, each file includes multiple columns for context and answers. This allows researchers to assess how well their models can generate correct answers based on the given contexts.

Research Ideas

  • Evaluation of multilingual question-answering models: This dataset can be used to evaluate the performance of different models designed for multilingual question-answering. By providing context, question, and answer pairs in multiple languages, it allows researchers to measure the accuracy and effectiveness of their models across different language pairs.
  • Cross-lingual transfer learning: The MLQA dataset can be utilized to develop cross-lingual transfer learning techniques. Models trained on this dataset can learn to perform question-answering tasks in one language and then transfer that knowledge to answer questions in another language.
  • Language understanding research: Researchers studying natural language processing (NLP) and language understanding can use this dataset to analyze how different languages handle questions and answers within various contexts. They can explore linguistic patterns, variations, and differences across languages by comparing the performance of NLP models trained on this dataset for both similar and dissimilar language pairs

Acknowledgements

If you use this dataset in your research, please credit the original authors.
Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: mlqa.es.zh_test.csv

Column name Description
context The text passage or paragraph in which a question is being asked. (Text)
answers The possible answers to the question, along with their start and end positions within the context passage. (Text)

File: mlqa.hi.de_test.csv

Column name Description
context The text passage or paragraph in which a question is being asked. (Text)
answers The possible answers to the question, along with their start and end positions within the context passage. (Text)

File: mlqa.zh.de_test.csv

Column name Description
context The text passage or paragraph in which a question is being asked. (Text)
answers The possible answers to the question, along with their start and end positions within the context passage. (Text)

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit mlqa (From Huggingface).

Tables

Mlqa Es Hi Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_es_hi_test
  • 815.53 KB
  • 1723 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_es_hi_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Es Hi Validation

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_es_hi_validation
  • 106.2 KB
  • 187 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_es_hi_validation (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Es Vi Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_es_vi_test
  • 1 MB
  • 2018 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_es_vi_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Es Vi Validation

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_es_vi_validation
  • 110.41 KB
  • 189 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_es_vi_validation (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Es Zh Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_es_zh_test
  • 975.57 KB
  • 1947 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_es_zh_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Es Zh Validation

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_es_zh_validation
  • 85.07 KB
  • 161 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_es_zh_validation (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi Ar Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_ar_test
  • 1.37 MB
  • 1831 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_ar_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi Ar Validation

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_ar_validation
  • 130.65 KB
  • 186 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_ar_validation (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi De Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_de_test
  • 1.08 MB
  • 1430 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_de_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi De Validation

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_de_validation
  • 111.28 KB
  • 163 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_de_validation (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi En Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_en_test
  • 3.71 MB
  • 4918 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_en_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi En Validation

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_en_validation
  • 355.32 KB
  • 507 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_en_validation (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi Es Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_es_test
  • 1.29 MB
  • 1723 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_es_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi Es Validation

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_es_validation
  • 141.98 KB
  • 187 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_es_validation (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi Hi Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_hi_test
  • 3.8 MB
  • 4918 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_hi_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi Hi Validation

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_hi_validation
  • 365.09 KB
  • 507 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_hi_validation (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi Vi Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_vi_test
  • 1.5 MB
  • 1947 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_vi_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi Vi Validation

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_vi_validation
  • 148.16 KB
  • 177 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_vi_validation (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi Zh Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_zh_test
  • 1.42 MB
  • 1767 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_zh_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Hi Zh Validation

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_hi_zh_validation
  • 142.49 KB
  • 189 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_hi_zh_validation (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Translate Test Ar Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_translate_test_ar_test
  • 2.76 MB
  • 5335 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_translate_test_ar_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Translate Test De Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_translate_test_de_test
  • 2.08 MB
  • 4517 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_translate_test_de_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Translate Test Es Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_translate_test_es_test
  • 2.26 MB
  • 5253 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_translate_test_es_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Translate Test Hi Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_translate_test_hi_test
  • 2.27 MB
  • 4918 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_translate_test_hi_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Mlqa Translate Test Vi Test

@kaggle.thedevastator_mlqa_multilingual_question_answering_dataset.mlqa_translate_test_vi_test
  • 3.09 MB
  • 5495 rows
  • 4 columns
Loading...

CREATE TABLE mlqa_translate_test_vi_test (
  "context" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR,
  "id" VARCHAR
);

Share link

Anyone who has the link will be able to view this.