Containes train.csv + test.csv + readme.csv

The Amazon reviews polarity dataset is constructed by taking review score 1 and 2 as negative, and 4 and 5 as positive. Samples of score 3 is ignored. In the dataset, class 1 is the negative and class 2 is the positive. Each class has 1,800,000 training samples and 200,000 testing samples.

The files train.csv and test.csv contain all the training samples as comma-sparated values. There are 3 columns in them, corresponding to class index (1 or 2), review title and review text. The review title and text are escaped using double quotes ("), and any internal double quote is escaped by 2 double quotes (""). New lines are escaped by a backslash followed with an "n" character, that is "\n".

Related Datasets

🏪Yelp Reviews For Senti-Analysis Binary -N/P+

@kaggle
Yahoo Finance Historical Prices And Ticker Fundamentals

@yahoo
SFC2014 - REACT EU Overview Allocation Vs Decided

@esifunds
Lookup Comparison Of 2017-13 V 2014-2020 Thematic Categorisation Codes

@esifunds
Lookup Comparison Of 2017-13 V 2014-2020 Thematic Categorisation Codes

@esifunds
TGS SC2 Nasal Positivity

@cdc

🏪Yelp Reviews For Senti-Analysis Binary -N/P+

Yahoo Finance Historical Prices And Ticker Fundamentals

SFC2014 - REACT EU Overview Allocation Vs Decided

Lookup Comparison Of 2017-13 V 2014-2020 Thematic Categorisation Codes

Lookup Comparison Of 2017-13 V 2014-2020 Thematic Categorisation Codes

TGS SC2 Nasal Positivity