Baselight
Sign In
kaggle

Scrapped Chilean News Articles From BioBioChile

Kaggle
•

@kaggle.sebastiantarebustos_scrapped_chilean_news_articles_fro_dbb038c7

Loading...
Loading...

Chilean news articles scrapped for a SaaS about news summarizing and filter.

Dataset Description

Scrapped news articles from https://www.biobiochile.cl/.

Use ; as delimiter.

Here you will find a collection of news I scrapped from 2023-07-17 06:31:00 to 2023-10-07 19:52:00 (About 82 days).

This is from a project I'm building for myself for Summarizing news articles to avoid getting clickbaited, scrolling through ads, and also for saving time reading news (here is the link for the project).

They are mainly from the category "national", so there is expected less data about other categories like "sports".

I also scrapped the HTML, so I can comeback again and scrape more data like images, views, and also debug my scrapper for some special cases.

There are some news that doesn't have much data, because they seem to work in the news articles as a project with different people, specially for "news in development", so there is basically a title and almost no body text.


Related Datasets

Share link

Anyone who has the link will be able to view this.