Baselight

Johnson & Johnson OGX Product Reviews

Reviews of OGX Products Involved in Class Action Lawsuit

@kaggle.winston56_johnson_johnson_ogx_product_reviews

Loading...
Loading...

About this Dataset

Johnson & Johnson OGX Product Reviews

Context

In June 2021 a proposed Class Action Lawsuit was filed against Johnson & Johnson claiming that certain OGX shampoos and conditioners contain and ingredient linked to hair loss and scalp irritation.

The complaint and a description of the lawsuit and products can be found in the above link. After seeing this lawsuit, I wondered if there was any evidence of consumer harm online. I decided to scrape reviews of the OGX product line from two well known websites.

Maybe these reviews can give a glimpse into the effects these hair products have had, and whether injuries could have been identified earlier. This should also be a good test to determine if other consumer harms can be discovered using these methods in the future.

Content

The content in here contains reviews scraped from MakeupAlley and Ulta Beauty. The two datasets are MakeupAlley_Reviews and Ulta_Reviews, and the names reflect where the reviews come from. All of the data has been cleaned of any identifying information such as usernames or locations. Both datasets contain the same four columns:

  • Date - The date the review was made. For the Ulta dataset we have accurate information all the way down to the day the post was made. In the MakeupAlley dataset our information is only accurate to the month and year of the post.

  • Rating - The number of stars (out of 5) a reviewer gives a product (ex. 2/5 stars = 2).

  • Review - The actual review written by the reviewer.

  • Product - The name of the product being reviewed.

Warning:

This dataset is not the cleanest. The Ulta dataset will have duplicate values in the review column. For some reason the website classified some reviews as applying to multiple products. For example, a shampoo and conditioner with the same name will have the same 1000 or so reviews when you load their review pages. I wanted to accurately reflect the reviews on the website so I simply created the datasets from the data that I scraped, even though some products share the same reviews.

The only problem is now we might not know exactly which product the reviewer is referring to. So this is an interesting problem for you to figure out. Good luck!

Tables

Makeupalley Reviews

@kaggle.winston56_johnson_johnson_ogx_product_reviews.makeupalley_reviews
  • 766.47 KB
  • 2194 rows
  • 4 columns
Loading...

CREATE TABLE makeupalley_reviews (
  "date" TIMESTAMP,
  "rating" DOUBLE,
  "review" VARCHAR,
  "product" VARCHAR
);

Ulta Reviews

@kaggle.winston56_johnson_johnson_ogx_product_reviews.ulta_reviews
  • 1.75 MB
  • 20201 rows
  • 4 columns
Loading...

CREATE TABLE ulta_reviews (
  "date" TIMESTAMP,
  "rating" BIGINT,
  "review" VARCHAR,
  "product" VARCHAR
);

Share link

Anyone who has the link will be able to view this.