Anime Recommendation Database 2020
Recommendation data from 320.0000 users and 16.000 animes at myanimelist.net
@kaggle.hernan4444_anime_recommendation_database_2020
Recommendation data from 320.0000 users and 16.000 animes at myanimelist.net
@kaggle.hernan4444_anime_recommendation_database_2020
Recommendation data from 320.0000 users and 16.000 animes at myanimelist.net
This dataset contains information about 17.562 anime and the preference from 325.772 different users. In particular, this dataset contain:
Also, the code used to collect the data is available at github: https://github.com/Hernan4444/MyAnimeList-Database.
The data was scrapped between February 26th and March 20th.
I uploaded 2 files as example to don't increase the size of this dataset. All HTML files are in this link: https://drive.google.com/drive/folders/12ghJk-sWyXXORoLBUpPirK4YdtIaZPV_?usp=sharing
animelist.csv have the list of all animes register by the user with the respective score, watching status and numbers of episodes watched. This dataset contains 109 Million row, 17.562 different animes and 325.772 different users. The file have the following columns:watching_status.csv describe every possible status of the column: "watching_status" in animelist.csv.
rating_complete.csv is a subset of animelist.csv. This dataset only considers animes that the user has watched completely (watching_status==2) and gave it a score (score!=0). This dataset contains 57 Million ratings applied to 16.872 animes by 310.059 users. This file have the following columns:
anime.csv contain general information of every anime (17.562 different anime) like genre, stats, studio, etc. This file have the following columns:Thanks to:
Have an HTML files to experience the scraping exercise without the delay of each requests.
Experiment with different types of recommended. For instance, collaborative filtering or based on context like stats, genre, seiyus, reviews, synopsis, etc.
Use this information to build a better anime recommended system.
Identifying which feature allows us to build the best anime recommended system.
CREATE TABLE anime (
"mal_id" BIGINT,
"name" VARCHAR,
"score" VARCHAR,
"genres" VARCHAR,
"english_name" VARCHAR,
"japanese_name" VARCHAR,
"type" VARCHAR,
"episodes" VARCHAR,
"aired" VARCHAR,
"premiered" VARCHAR,
"producers" VARCHAR,
"licensors" VARCHAR,
"studios" VARCHAR,
"source" VARCHAR,
"duration" VARCHAR,
"rating" VARCHAR,
"ranked" VARCHAR,
"popularity" BIGINT,
"members" BIGINT,
"favorites" BIGINT,
"watching" BIGINT,
"completed" BIGINT,
"on_hold" BIGINT,
"dropped" BIGINT,
"plan_to_watch" BIGINT,
"score_10" VARCHAR,
"score_9" VARCHAR,
"score_8" VARCHAR,
"score_7" VARCHAR,
"score_6" VARCHAR,
"score_5" VARCHAR,
"score_4" VARCHAR,
"score_3" VARCHAR,
"score_2" VARCHAR,
"score_1" VARCHAR
);CREATE TABLE animelist (
"user_id" BIGINT,
"anime_id" BIGINT,
"rating" BIGINT,
"watching_status" BIGINT,
"watched_episodes" BIGINT
);CREATE TABLE anime_with_synopsis (
"mal_id" BIGINT,
"name" VARCHAR,
"score" VARCHAR,
"genres" VARCHAR,
"sypnopsis" VARCHAR
);CREATE TABLE rating_complete (
"user_id" BIGINT,
"anime_id" BIGINT,
"rating" BIGINT
);CREATE TABLE watching_status (
"status" BIGINT,
"n__description" VARCHAR -- Description
);Anyone who has the link will be able to view this.