WikiSQL (Questions And SQL Queries)
80654 hand-annotated questions and SQL queries on 24241 Wikipedia tables
@kaggle.thedevastator_dataset_for_developing_natural_language_interfac
80654 hand-annotated questions and SQL queries on 24241 Wikipedia tables
@kaggle.thedevastator_dataset_for_developing_natural_language_interfac
By Huggingface Hub [source]
A large crowd-sourced dataset for developing natural language interfaces for relational databases.
WikiSQL is a dataset of 80654 hand-annotated examples of questions and SQL queries distributed across 24241 tables from Wikipedia.
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset can be used to develop natural language interfaces for relational databases. The data fields are the same among all splits, and the file contains information on the phase, question, table, and SQL for each interface
- This dataset can be used to develop natural language interfaces for relational databases.
- This dataset can be used to develop a knowledge base of common SQL queries.
- This dataset can be used to generate a training set for a neural network that translates natural language into SQL queries
If you use this dataset in your research, please credit the original authors.
License
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
File: validation.csv
| Column name | Description |
|---|---|
| phase | The phase of the data collection. (String) |
| question | The question asked by the user. (String) |
| table | The table containing the data for the question. (String) |
| sql | The SQL query corresponding to the question. (String) |
File: train.csv
| Column name | Description |
|---|---|
| phase | The phase of the data collection. (String) |
| question | The question asked by the user. (String) |
| table | The table containing the data for the question. (String) |
| sql | The SQL query corresponding to the question. (String) |
File: test.csv
| Column name | Description |
|---|---|
| phase | The phase of the data collection. (String) |
| question | The question asked by the user. (String) |
| table | The table containing the data for the question. (String) |
| sql | The SQL query corresponding to the question. (String) |
If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit Huggingface Hub.
CREATE TABLE test (
"phase" BIGINT,
"question" VARCHAR,
"table" VARCHAR,
"sql" VARCHAR
);CREATE TABLE train (
"phase" BIGINT,
"question" VARCHAR,
"table" VARCHAR,
"sql" VARCHAR
);CREATE TABLE validation (
"phase" BIGINT,
"question" VARCHAR,
"table" VARCHAR,
"sql" VARCHAR
);Anyone who has the link will be able to view this.