Baselight

🛒 Amazon Reviews For Senti-Analysis Binary -N/P+

Containes train.csv + test.csv + readme.csv

@kaggle.yacharki_amazon_reviews_for_sa_binary_negative_positive_csv

About this Dataset

🛒 Amazon Reviews For Senti-Analysis Binary -N/P+

The Amazon reviews polarity dataset is constructed by taking review score 1 and 2 as negative, and 4 and 5 as positive. Samples of score 3 is ignored. In the dataset, class 1 is the negative and class 2 is the positive. Each class has 1,800,000 training samples and 200,000 testing samples.

The files train.csv and test.csv contain all the training samples as comma-sparated values. There are 3 columns in them, corresponding to class index (1 or 2), review title and review text. The review title and text are escaped using double quotes ("), and any internal double quote is escaped by 2 double quotes (""). New lines are escaped by a backslash followed with an "n" character, that is "\n".

Share link

Anyone who has the link will be able to view this.