14 Categories of News Articles with Headline and body.
Dataset Description
Context
The data was created for my Academic Project entitled News-Article-Classifier. This dataset can be used to train models to classify news articles into different categories.
Content
It Contains 6877 unique data about News Articles published in HuffPost. The categories include ARTS & CULTURE, BUSINESS, COMEDY, CRIME, EDUCATION, ENTERTAINMENT, ENVIRONMENT, MEDIA, POLITICS, RELIGION, SCIENCE, SPORTS, TECH, WOMEN.
Categories and corresponding article counts are as follows:
ARTS AND CULTURE: 1002BUSINESS: 501COMEDY: 380CRIME: 300EDUCATION: 490ENTERTAINMENT: 501ENVIRONMENT: 501MEDIA: 347POLITICS: 501RELIGION: 501SCIENCE: 350SPORTS: 501TECH: 501WOMEN: 501
Acknowledgements
The data was created with the help of News Category Dataset and scrapped from HuffPost
Inspiration
- Do news articles from different categories have different writing styles?
- What kinds of words contribute to each of the categories in News Articles?
Citation
If you're using this dataset for research purposes, please use the following BibTex for citations:
@dataset{dataset,
author = {Timilsina, Bimal},
year = {2021},
month = {08},
pages = {},
title = {News Article Category Dataset},
}
Related Datasets
-
Burmese News Category Dataset
@kaggle