A Million News Headlines
@kaggle.therohk_million_headlines
@kaggle.therohk_million_headlines
This contains data of news headlines published over a period of nineteen years.
Sourced from the reputable Australian news source ABC (Australian Broadcasting Corporation)
Agency Site: (http://www.abc.net.au)
Format: CSV ; Single File
Start Date: 2003-02-19 ; End Date: 2021-12-31
I look at this news dataset as a summarised historical record of noteworthy events in the globe from early-2003 to end-2021 with a more granular focus on Australia.
This includes the entire corpus of articles published by the abcnews website in the given date range.
With a volume of two hundred articles per day and a good focus on international news, we can be fairly certain that every event of significance has been captured here.
Digging into the keywords, one can see all the important episodes shaping the last decade and how they evolved over time.
Ex: afghanistan war, financial crisis, multiple elections, ecological disasters, terrorism, famous people, criminal activity et cetera.
Similar news datasets exploring other attributes, countries and topics can be seen on my profile.
Most kernals can be reused with minimal changes across these news datasets.
Prepared by Rohit Kulkarni
@kaggle
@owid
@owid
@ukgov
Share link
Anyone who has the link will be able to view this.