Comprehensive News Articles Dataset
An All-Encompassing Dataset of News Articles for Multi-Domain Analysis
@kaggle.khushikhushikhushi_comprehensive_news_articles_dataset
An All-Encompassing Dataset of News Articles for Multi-Domain Analysis
@kaggle.khushikhushikhushi_comprehensive_news_articles_dataset
This dataset is a collection of news articles gathered from various sources, spanning multiple categories such as Technology, Sports, Finance, Politics, Education, and Health. It is designed to provide a diverse and extensive set of data for natural language processing (NLP) tasks, sentiment analysis, topic modeling, and other machine learning applications.
The dataset includes articles from the following categories:
Each article is accompanied by the following attributes:
source
: The source from where the article was retrieved.author
: The author of the article.title
: The title of the article.description
: A brief description or summary of the article.url
: The URL of the full article.urlToImage
: The URL to an image associated with the article.publishedAt
: The publication date of the article.content
: The full content of the article.category
: The category to which the article belongs.This dataset can be used for a variety of tasks including:
CREATE TABLE news_dataset (
"source" VARCHAR,
"author" VARCHAR,
"title" VARCHAR,
"description" VARCHAR,
"url" VARCHAR,
"urltoimage" VARCHAR,
"publishedat" VARCHAR,
"content" VARCHAR,
"category" VARCHAR
);
Anyone who has the link will be able to view this.