40,000 lines of Shakespeare from a variety of Shakespeare's plays
Dataset Description
TinyShakespeare (Shakespeare's Plays)
40,000 lines of Shakespeare from a variety of Shakespeare's plays
Source
Huggingface Hub: link
About this dataset
40,000 lines of Shakespeare from a variety of Shakespeare's plays. Featured in Andrej Karpathy's blog post 'The Unreasonable Effectiveness of Recurrent Neural Networks': http://karpathy.github.io/2015/05/21/rnn-effectiveness/.
How to use the dataset
Research Ideas
- Developing a model to generate new works in the style of William Shakespeare
- Using the characters in the plays to create new, original works
- Using the dataset to study the character development of Shakespeare's characters over the course of his career
Acknowledgements
License
> License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
> No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
Columns
File: validation.csv
| Column name | Description |
|---|---|
| text | The text of the play. (String) |
File: train.csv
| Column name | Description |
|---|---|
| text | The text of the play. (String) |
File: test.csv
| Column name | Description |
|---|---|
| text | The text of the play. (String) |
Related Datasets
-
Shakespeare Play's Dialogues
@kaggle
-
Dummy Monster
@owid
-
Wars On Territory
@owid