Baselight

Cosmetics Datasets

Cosmetics ingredients, rank, price and effects on skin

@kaggle.kingabzpro_cosmetics_datasets

Loading...
Loading...

About this Dataset

Cosmetics Datasets

Context

Whenever I want to try a new cosmetic item, it's so difficult to choose. It's actually more than difficult. It's sometimes scary because new items that I've never tried end up giving me skin trouble. We know the information we need is on the back of each product, but it's really hard to interpret those ingredient lists unless you're a chemist. You may be able to relate to this situation.

Content

we are going to create a content-based recommendation system where the 'content' will be the chemical components of cosmetics. Specifically, we will process ingredient lists for 1472 cosmetics on Sephora via word embedding, then visualize ingredient similarity using a machine learning method called t-SNE and an interactive visualization library called Bokeh. Let's inspect our data first.

Acknowledgements

DataCamp

Tables

Cosmetics

@kaggle.kingabzpro_cosmetics_datasets.cosmetics
  • 439.5 KB
  • 1472 rows
  • 11 columns
Loading...

CREATE TABLE cosmetics (
  "label" VARCHAR,
  "brand" VARCHAR,
  "name" VARCHAR,
  "price" BIGINT,
  "rank" DOUBLE,
  "ingredients" VARCHAR,
  "combination" BIGINT,
  "dry" BIGINT,
  "normal" BIGINT,
  "oily" BIGINT,
  "sensitive" BIGINT
);

Share link

Anyone who has the link will be able to view this.