Baselight

WebQuestions (Questions & Answers)

6,642 questions that are popular ones asked on the web

@kaggle.thedevastator_unlock_the_power_of_knowledge_graphs_with_questi

Loading...
Loading...

About this Dataset

WebQuestions (Questions & Answers)


WebQuestions (Questions & Answers)

6,642 questions that are popular ones asked on the web

By Huggingface Hub [source]


About this dataset

This dataset provides deep insight into the potential of crowdsourced web-based question answering. It contains 6,642 questions and answers sourced from the web all answerable using Freebase, the world’s largest knowledge graph. Get to know the opportunities posed by crowd-sourced data! The questions span a vast range of topics from entertainment to science, engaging with individuals named entities as well as complex concepts. Expand your knowledge through this unique dataset and unlock valuable insights by exploring it!

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

Guide for Using the Unlocking Questions with Web-Based Knowledge Graphs Dataset

Welcome to the Unlocking Questions with Web-Based Knowledge Graphs dataset! This dataset includes 6,642 questions and answers collected from the web that are answerable by Freebase, the world’s largest knowledge graph. In this guide, we’ll explain how to use this dataset in order to answer questions related to individual named entities.

The dataset contains two csv files: train.csv and test.csv. The train.csv file contains a collection of questions and answers related to individual named entities that can be answered using Freebase, as well as a URL for reference if needed. The test.csv file contains unanswered questions collected from the web along with their URLs for reference (providing additional context) should you need it while answering them via Freebase.

To begin using this datastet simply read in both csv files into your favorite analytics or machine learning tool or programming language such as Python or R After loading in both csvs you can iterate through each row of data available within each record set in order to answer any remaining unanswered questions present on the test set by utilizing freebase as well url references associated with each row should you need additional context when formulating an appropriate response/answer

In addition you also have access up 6646 datasets that do contain predetermined answers which may help when training your model particularly if machine learning techniques such as Natural Language Processing techniques such Supervised Learning is required due to its ability utilize features included within pre answered datasets allowing it come up with its own conclusions regarding how best approach answering new data points

Thank you for exploring our Unlocking Questions With Web Based Knowledge Graphs Dataset

Happy computing!

Research Ideas

  • Analyzing the effectiveness of using Freebase to answer user questions on different topics.
  • Developing predictive models for automatically recognizing and extracting information from web-based knowledge graphs.
  • Identifying entities that are most frequently asked about and/or have the most accurate answers provided by Freebase

Acknowledgements

If you use this dataset in your research, please credit the original authors.
Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: train.csv

Column name Description
url The URL of the web page from which the question was sourced. (String)
answers The answers to the question, sourced from Freebase. (String)

File: test.csv

Column name Description
url The URL of the web page from which the question was sourced. (String)
answers The answers to the question, sourced from Freebase. (String)

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit Huggingface Hub.

Tables

Test

@kaggle.thedevastator_unlock_the_power_of_knowledge_graphs_with_questi.test
  • 128.62 KB
  • 2032 rows
  • 3 columns
Loading...

CREATE TABLE test (
  "url" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR
);

Train

@kaggle.thedevastator_unlock_the_power_of_knowledge_graphs_with_questi.train
  • 220.21 KB
  • 3778 rows
  • 3 columns
Loading...

CREATE TABLE train (
  "url" VARCHAR,
  "question" VARCHAR,
  "answers" VARCHAR
);

Share link

Anyone who has the link will be able to view this.