Baselight

Anime-Planet Character Recommendation

Waifus and husbando dataset

@kaggle.hernan4444_animeplanet_character_recommendation

About this Dataset

Anime-Planet Character Recommendation

Anime-Planet Character Recommendation

Waifus and husbando dataset

This dataset contains information about 132.028 characters and the preference from 72.629 different users of characters scrapped from anime-planet. In particular, this dataset contain:

  • Information about the character like Tags, description, genre, etc.
  • HTML with character information to do data scrapping. These files contain information such as name, alias, hate rank, tags, etc.
  • The character list per user. Include characters that love and hate.

Warning: this dataset includes information on adult anime characters.

Content

The anime data was scrapped between June 29th and August 14th.

  • The "html" folder contain 1 html per character (132.028 different characters). I uploaded 2 files as example to don't increase the size of this dataset. All HTML files are in this link: https://drive.google.com/drive/folders/1Kg0OZ6dEsQuJZVqj1CcTGwDnwp4sNOnW?usp=sharing

  • user_characters.csv have the list of all character register by the user with the respective love boolean (means if the user love or hate the character). This dataset contains 12 Million row, 72.629 different animes and 132.028 different characters. The file have the following columns:

  1. user_id: non identifiable randomly generated user id.
  2. character_id: non identifiable randomly generated character id.
  3. loved: True if the user loves the character, False if he hates it.
  • characters_metadata.csv contain general information of every character (132.028 different character) like Tags, alias, name, gender, etc. This file have the following columns:
  1. ID: non identifiable randomly generated character id.
  2. Name: full name of this character.
  3. Alias: another way to call the character.
  4. Gender: gender of the this character.
  5. Hair Color: hair color of this character.
  6. Love Rank: love rank based in users preference.
  7. Hate Rank: hate rank based in users preference.
  8. Eye color: eye color of the character.
  9. Birthday: date of his birthday.
  10. Blood Type: blood type of the character.
  11. Tags: comma separated list of tags for this character.
  12. Love Count: how many users love this character.
  13. Hate Count: how many users hate this character.
  14. Description: short text with description if this character.
  15. url: url to the main page of character in Anime Planet.

Acknowledgements

Thanks to:

  1. Anime Planet for providing anime data.

Inspiration

  1. Experiment with different types of recommended. For instance, collaborative filtering or based on context like Tags, description, etc.

  2. Use this information to build a character recommended system.

  3. Build another dataset with anime topic.

  4. Try to Improve Anime Recommendation Database 2020 with more data of characters from the anime. This need to extract the anime id from every html.

Share link

Anyone who has the link will be able to view this.