Baselight

BBC News

Self updating dataset - BBC News RSS Feeds

@kaggle.gpreda_bbc_news

About this Dataset

BBC News

Context

Self updating dataset. It collects RSS Feeds from BBC News using a Kernel: https://www.kaggle.com/gpreda/bbc-news-rss-feeds.
The Kernel is run with a fixed frequency and the dataset is updated using the output of the Notebook.

Content

BBC News RSS Feeds. The data contains the following columns:

  • title
  • pubDate
  • guid
  • link
  • description

Collection method

Uses requests_html and BeautifulSoup to collect RSS Feeds from BBC News site.

Acknowledgements

The content is proprietary of BBC

Inspiration

Use the data to analyze the sentiment of news, from title and description

Tables

Bbc News

@kaggle.gpreda_bbc_news.bbc_news
  • 5.88 MB
  • 42115 rows
  • 5 columns
Loading...

CREATE TABLE bbc_news (
  "title" VARCHAR,
  "pubdate" VARCHAR,
  "guid" VARCHAR,
  "link" VARCHAR,
  "description" VARCHAR
);