USA News Dataset
Information of news articles and click histories
@kaggle.vinayakshanawad_us_news_dataset
Information of news articles and click histories
@kaggle.vinayakshanawad_us_news_dataset
Construct two types of models -- (A) a deep learning classifier such as LSTM or similar model to predict the category of a news article given its title and abstract, and (B) A recommendation system to recommend posts that a user is most likely to click.
The dataset consists of two files -- (1) user_news_clicks.csv, and (2) news_text.csv.
Model A, the deep learning classifier only requires the news_text.csv dataset. The goal is to predict the ‘category’ label using the ‘title’ and ‘abstract; columns. Model B, the recommendation system only requires user_news_clicks.csv but you can use the news_text.csv in addition if you’d like though it is not necessary for this exercise. The goal is to be able to recommend users news articles that they’re likely to click.
In news_text.csv - each record consists of three attributes and a target variable:
In user_news_clicks.csv - each record consists of two attributes and a target variable:
NOTE: We do not need to use the entire dataset, if resources are limited. Feel free to sample.
CREATE TABLE news_text (
"news_id" VARCHAR,
"title" VARCHAR,
"abstract" VARCHAR,
"category" VARCHAR
);
CREATE TABLE user_news_clicks (
"user_id" VARCHAR,
"item" VARCHAR,
"click" BIGINT
);
Anyone who has the link will be able to view this.