Baselight

🛒 Amazon Reviews For SA Fine-grained 5 Classes

Containes train.csv + test.csv + readme.csv

@kaggle.yacharki_amazon_reviews_for_sentianalysis_finegrained_csv

About this Dataset

🛒 Amazon Reviews For SA Fine-grained 5 Classes

The Amazon reviews full score dataset is constructed by randomly taking 600,000 training samples and 130,000 testing samples for each review score from 1 to 5. In total there are 3,000,000 trainig samples and 650,000 testing samples.

The files train.csv and test.csv contain all the training samples as comma-sparated values. There are 3 columns in them, corresponding to class index (1 to 5), review title and review text. The review title and text are escaped using double quotes ("), and any internal double quote is escaped by 2 double quotes (""). New lines are escaped by a backslash followed with an "n" character, that is "\n".