515K Hotel Reviews Data In Europe
Can you make your trip more cozy by using data science?
@kaggle.jiashenliu_515k_hotel_reviews_data_in_europe
Can you make your trip more cozy by using data science?
@kaggle.jiashenliu_515k_hotel_reviews_data_in_europe
The data was scraped from Booking.com. All data in the file is publicly available to everyone already. Please be noted that data is originally owned by Booking.com.
This dataset contains 515,000 customer reviews and scoring of 1493 luxury hotels across Europe. Meanwhile, the geographical location of hotels are also provided for further analysis.
The csv file contains 17 fields. The description of each field is as below:
In order to keep the text data clean, I removed unicode and punctuation in the text data and transform text into lower case. No other preprocessing was performed.
The dataset is large and informative, I believe you can have a lot of fun with it! Let me put some ideas below to futher inspire kagglers!
The idea is unlimited! Please, have a look into data, generate some ideas and leave a master kernel here! I am ready to upvote your ideas and kernels! Cheers!
CREATE TABLE hotel_reviews (
"hotel_address" VARCHAR,
"additional_number_of_scoring" BIGINT,
"review_date" TIMESTAMP,
"average_score" DOUBLE,
"hotel_name" VARCHAR,
"reviewer_nationality" VARCHAR,
"negative_review" VARCHAR,
"review_total_negative_word_counts" BIGINT,
"total_number_of_reviews" BIGINT,
"positive_review" VARCHAR,
"review_total_positive_word_counts" BIGINT,
"total_number_of_reviews_reviewer_has_given" BIGINT,
"reviewer_score" DOUBLE,
"tags" VARCHAR,
"days_since_review" VARCHAR,
"lat" DOUBLE,
"lng" DOUBLE
);Anyone who has the link will be able to view this.