WebQuestions (Questions & Answers)
6,642 questions that are popular ones asked on the web
By Huggingface Hub [source]
About this dataset
This dataset provides deep insight into the potential of crowdsourced web-based question answering. It contains 6,642 questions and answers sourced from the web all answerable using Freebase, the world’s largest knowledge graph. Get to know the opportunities posed by crowd-sourced data! The questions span a vast range of topics from entertainment to science, engaging with individuals named entities as well as complex concepts. Expand your knowledge through this unique dataset and unlock valuable insights by exploring it!
More Datasets
For more datasets, click here.
Featured Notebooks
- 🚨 Your notebook can be here! 🚨!
How to use the dataset
Guide for Using the Unlocking Questions with Web-Based Knowledge Graphs Dataset
Welcome to the Unlocking Questions with Web-Based Knowledge Graphs dataset! This dataset includes 6,642 questions and answers collected from the web that are answerable by Freebase, the world’s largest knowledge graph. In this guide, we’ll explain how to use this dataset in order to answer questions related to individual named entities.
The dataset contains two csv files: train.csv and test.csv. The train.csv file contains a collection of questions and answers related to individual named entities that can be answered using Freebase, as well as a URL for reference if needed. The test.csv file contains unanswered questions collected from the web along with their URLs for reference (providing additional context) should you need it while answering them via Freebase.
To begin using this datastet simply read in both csv files into your favorite analytics or machine learning tool or programming language such as Python or R After loading in both csvs you can iterate through each row of data available within each record set in order to answer any remaining unanswered questions present on the test set by utilizing freebase as well url references associated with each row should you need additional context when formulating an appropriate response/answer
In addition you also have access up 6646 datasets that do contain predetermined answers which may help when training your model particularly if machine learning techniques such as Natural Language Processing techniques such Supervised Learning is required due to its ability utilize features included within pre answered datasets allowing it come up with its own conclusions regarding how best approach answering new data points
Thank you for exploring our Unlocking Questions With Web Based Knowledge Graphs Dataset
Happy computing!
Research Ideas
- Analyzing the effectiveness of using Freebase to answer user questions on different topics.
- Developing predictive models for automatically recognizing and extracting information from web-based knowledge graphs.
- Identifying entities that are most frequently asked about and/or have the most accurate answers provided by Freebase
Acknowledgements
If you use this dataset in your research, please credit the original authors.
Data Source
License
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
Columns
File: train.csv
Column name |
Description |
url |
The URL of the web page from which the question was sourced. (String) |
answers |
The answers to the question, sourced from Freebase. (String) |
File: test.csv
Column name |
Description |
url |
The URL of the web page from which the question was sourced. (String) |
answers |
The answers to the question, sourced from Freebase. (String) |
Acknowledgements
If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit Huggingface Hub.