Amazon Product Reviews Dataset
Amazon Product Reviews Dataset - Topic Modelling Problem
@kaggle.yasserh_amazon_product_reviews_dataset
Amazon Product Reviews Dataset - Topic Modelling Problem
@kaggle.yasserh_amazon_product_reviews_dataset
The dataset consists of samples from Amazon Ratings for select products. The reviews are picked randomly and the corpus has nearly 1.6k reviews of different customers.
Amazon aims to understand what are the main topics of these reviews to classify them for easier search.
Can you build a strong model that differentiates the topics based on the reviews corpus?
The dataset is referred from Kaggle.
CREATE TABLE n_7817_1 (
"id" VARCHAR,
"asins" VARCHAR,
"brand" VARCHAR,
"categories" VARCHAR,
"colors" VARCHAR,
"dateadded" VARCHAR,
"dateupdated" VARCHAR,
"dimension" VARCHAR,
"ean" DOUBLE,
"keys" VARCHAR,
"manufacturer" VARCHAR,
"manufacturernumber" VARCHAR,
"name" VARCHAR,
"prices" VARCHAR,
"reviews_date" VARCHAR,
"reviews_dorecommend" VARCHAR,
"reviews_numhelpful" DOUBLE,
"reviews_rating" DOUBLE,
"reviews_sourceurls" VARCHAR,
"reviews_text" VARCHAR,
"reviews_title" VARCHAR,
"reviews_usercity" VARCHAR,
"reviews_userprovince" VARCHAR,
"reviews_username" VARCHAR,
"sizes" VARCHAR,
"upc" DOUBLE,
"weight" VARCHAR
);Anyone who has the link will be able to view this.