Baselight

🏪Yelp Reviews For Senti-Analysis Binary -N/P+

Containes train.csv + test.csv + readme.csv

@kaggle.yacharki_yelp_reviews_for_sentianalysis_binary_np_csv

About this Dataset

🏪Yelp Reviews For Senti-Analysis Binary -N/P+

The Yelp reviews polarity dataset is constructed by considering stars 1 and 2 negative, and 3 and 4 positive. For each polarity 280,000 training samples and 19,000 testing samples are take randomly. In total there are 560,000 trainig samples and 38,000 testing samples. Negative polarity is class 1, and positive class 2.

The files train.csv and test.csv contain all the training samples as comma-sparated values. There are 2 columns in them, corresponding to class index (1 and 2) and review text. The review texts are escaped using double quotes ("), and any internal double quote is escaped by 2 double quotes (""). New lines are escaped by a backslash followed with an "n" character, that is "\n".

Share link

Anyone who has the link will be able to view this.