Name: Banned Book Dataset
Creator: Kaggle
License: http://opendatacommons.org/licenses/dbcl/1.0/

Dataset of ~5k Banned Books + 7.5k non banned books

Book bans limit access to information ad restrict freedom of expression. There has been no comprehensive data for training ML models on if a book will be censored (challenged/banned) or not. This dataset aims to address that. The title and author of banned books are obtained through non-profits like the ALA and Pen America while metadata like description and genre for them are obtained through webscraping Goodreads.
The books that are labeled as uncensored are obtained through kaggle then filtered.

Related Datasets

Banned Prison Books Dataset

@kaggle
Fur Banning

@owid
Ethnic Power Relations Dataset (ETH, 2021)

@owid
Social Media Ban For Minors: A Computational Analysis Of Media Coverage In Europe And Beyond, Dataset

@ecjrc
Nuclear Weapons Proliferation

@owid
Wars On Territory

@owid

Banned Prison Books Dataset

Fur Banning

Ethnic Power Relations Dataset (ETH, 2021)

Social Media Ban For Minors: A Computational Analysis Of Media Coverage In Europe And Beyond, Dataset

Nuclear Weapons Proliferation

Wars On Territory