Persian News Dataset
Ideal for NLP tasks, sentiment analysis, topic modeling, and more.
@kaggle.amirzenoozi_persian_news_dataset
Ideal for NLP tasks, sentiment analysis, topic modeling, and more.
@kaggle.amirzenoozi_persian_news_dataset
By Using This Dataset You Will Have Access To 391,749 News From FarsNews Agency (178,480), MehrNews Agency (87,471), MashreghNews Agency (53,414), ISNA Agency (51,779), and KhabarOnline Agency (20,605). This Dataset Includes Title, Description, Publish Date, Service, Category, and Tags. In The Future, We Will Update This Dataset
CREATE TABLE archive_v2 (
"id" BIGINT,
"title" VARCHAR,
"short_link" VARCHAR,
"service" VARCHAR,
"subgroup" VARCHAR,
"abstract" VARCHAR,
"body" VARCHAR,
"tags" VARCHAR,
"published_datetime" VARCHAR,
"agency_name" VARCHAR
);CREATE TABLE archive_v3 (
"id" VARCHAR,
"title" VARCHAR,
"short_link" VARCHAR,
"service" VARCHAR,
"subgroup" VARCHAR,
"abstract" VARCHAR,
"body" VARCHAR,
"tags" VARCHAR,
"published_datetime" VARCHAR,
"agency_name" VARCHAR
);CREATE TABLE archive_v4 (
"id" VARCHAR,
"title" VARCHAR,
"short_link" VARCHAR,
"service" VARCHAR,
"subgroup" VARCHAR,
"abstract" VARCHAR,
"body" VARCHAR,
"tags" VARCHAR,
"published_datetime" VARCHAR,
"agency_name" VARCHAR
);CREATE TABLE archive_v5 (
"id" VARCHAR,
"title" VARCHAR,
"short_link" VARCHAR,
"service" VARCHAR,
"subgroup" VARCHAR,
"abstract" VARCHAR,
"body" VARCHAR,
"tags" VARCHAR,
"published_datetime" VARCHAR,
"agency_name" VARCHAR
);CREATE TABLE n__output (
"id" BIGINT,
"title" VARCHAR,
"short_link" VARCHAR,
"service" VARCHAR,
"subgroup" VARCHAR,
"abstract" VARCHAR,
"body" VARCHAR,
"tags" VARCHAR,
"published_datetime" TIMESTAMP,
"agency_name" VARCHAR
);Anyone who has the link will be able to view this.