Coffee Quality Database From CQI
A database scrapped from Coffee Quality Institute
@kaggle.volpatto_coffee_quality_database_from_cqi
A database scrapped from Coffee Quality Institute
@kaggle.volpatto_coffee_quality_database_from_cqi
These datasets are gathered from Coffee Quality Institute (CQI) in January, 2018.
I'm not the Owner of the Datasets, nor scrapping was performed by me. It was done in this GitHub's repo (kudos for the author), see there for further details.
Three CSV files are provided:
An Arabica coffee pre-cleaned dataset;
A Robusta coffee pre-cleaned dataset;
A dataset constructed through a merging of the datasets.
The file names indicates the above datasets clearly.
As explained in the repo, the datasets have reviews from specialized reviewers for both coffees: arabica and robusta. The below information is provided in each dataset.
There is one related dataset here in Kaggle, please check here. It's pretty much similar to the datasets presented here, but without Robusta coffee data.
CREATE TABLE arabica_data_cleaned (
"unnamed_0" BIGINT -- Unnamed: 0,
"species" VARCHAR,
"owner" VARCHAR,
"country_of_origin" VARCHAR,
"farm_name" VARCHAR,
"lot_number" VARCHAR,
"mill" VARCHAR,
"ico_number" VARCHAR,
"company" VARCHAR,
"altitude" VARCHAR,
"region" VARCHAR,
"producer" VARCHAR,
"number_of_bags" BIGINT,
"bag_weight" VARCHAR,
"in_country_partner" VARCHAR,
"harvest_year" VARCHAR,
"grading_date" VARCHAR,
"owner_1" VARCHAR,
"variety" VARCHAR,
"processing_method" VARCHAR,
"aroma" DOUBLE,
"flavor" DOUBLE,
"aftertaste" DOUBLE,
"acidity" DOUBLE,
"body" DOUBLE,
"balance" DOUBLE,
"uniformity" DOUBLE,
"clean_cup" DOUBLE,
"sweetness" DOUBLE,
"cupper_points" DOUBLE,
"total_cup_points" DOUBLE,
"moisture" DOUBLE,
"category_one_defects" BIGINT,
"quakers" DOUBLE,
"color" VARCHAR,
"category_two_defects" BIGINT,
"expiration" VARCHAR,
"certification_body" VARCHAR,
"certification_address" VARCHAR,
"certification_contact" VARCHAR,
"unit_of_measurement" VARCHAR,
"altitude_low_meters" DOUBLE,
"altitude_high_meters" DOUBLE,
"altitude_mean_meters" DOUBLE
);CREATE TABLE merged_data_cleaned (
"unnamed_0" BIGINT -- Unnamed: 0,
"species" VARCHAR,
"owner" VARCHAR,
"country_of_origin" VARCHAR,
"farm_name" VARCHAR,
"lot_number" VARCHAR,
"mill" VARCHAR,
"ico_number" VARCHAR,
"company" VARCHAR,
"altitude" VARCHAR,
"region" VARCHAR,
"producer" VARCHAR,
"number_of_bags" BIGINT,
"bag_weight" VARCHAR,
"in_country_partner" VARCHAR,
"harvest_year" VARCHAR,
"grading_date" VARCHAR,
"owner_1" VARCHAR,
"variety" VARCHAR,
"processing_method" VARCHAR,
"aroma" DOUBLE,
"flavor" DOUBLE,
"aftertaste" DOUBLE,
"acidity" DOUBLE,
"body" DOUBLE,
"balance" DOUBLE,
"uniformity" DOUBLE,
"clean_cup" DOUBLE,
"sweetness" DOUBLE,
"cupper_points" DOUBLE,
"total_cup_points" DOUBLE,
"moisture" DOUBLE,
"category_one_defects" BIGINT,
"quakers" DOUBLE,
"color" VARCHAR,
"category_two_defects" BIGINT,
"expiration" VARCHAR,
"certification_body" VARCHAR,
"certification_address" VARCHAR,
"certification_contact" VARCHAR,
"unit_of_measurement" VARCHAR,
"altitude_low_meters" DOUBLE,
"altitude_high_meters" DOUBLE,
"altitude_mean_meters" DOUBLE
);CREATE TABLE robusta_data_cleaned (
"unnamed_0" BIGINT -- Unnamed: 0,
"species" VARCHAR,
"owner" VARCHAR,
"country_of_origin" VARCHAR,
"farm_name" VARCHAR,
"lot_number" VARCHAR,
"mill" VARCHAR,
"ico_number" VARCHAR,
"company" VARCHAR,
"altitude" VARCHAR,
"region" VARCHAR,
"producer" VARCHAR,
"number_of_bags" BIGINT,
"bag_weight" VARCHAR,
"in_country_partner" VARCHAR,
"harvest_year" BIGINT,
"grading_date" VARCHAR,
"owner_1" VARCHAR,
"variety" VARCHAR,
"processing_method" VARCHAR,
"fragrance_aroma" DOUBLE -- Fragrance...Aroma,
"flavor" DOUBLE,
"aftertaste" DOUBLE,
"salt_acid" DOUBLE -- Salt...Acid,
"bitter_sweet" DOUBLE -- Bitter...Sweet,
"mouthfeel" DOUBLE,
"uniform_cup" DOUBLE,
"clean_cup" DOUBLE,
"balance" DOUBLE,
"cupper_points" DOUBLE,
"total_cup_points" DOUBLE,
"moisture" DOUBLE,
"category_one_defects" BIGINT,
"quakers" BIGINT,
"color" VARCHAR,
"category_two_defects" BIGINT,
"expiration" VARCHAR,
"certification_body" VARCHAR,
"certification_address" VARCHAR,
"certification_contact" VARCHAR,
"unit_of_measurement" VARCHAR,
"altitude_low_meters" DOUBLE,
"altitude_high_meters" DOUBLE,
"altitude_mean_meters" DOUBLE
);Anyone who has the link will be able to view this.