The Office Quotes
Quotes from the popular sitcom The Office
@kaggle.thedevastator_the_office_a_great_dataset_for_nlp_research
Quotes from the popular sitcom The Office
@kaggle.thedevastator_the_office_a_great_dataset_for_nlp_research
By [source]
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
How to Use this Dataset
- Download the dataset from Kaggle (https://www.kaggle.com/the-office-quotes/data).
- Extract the zip file into a new folder.
- Open the file
training.csvin a spreadsheet application such as Microsoft Excel or Google Sheets.- The first column contains the character who said the quote, and the second column contains the quote itself.
- To use this dataset for natural language processing research, you can treat each quote as a separate document and use standard text mining techniques to analyze them
-Develop a model that can identify the character speaking given a quote from the show
-Build a bot that can generate new quotes from The Office in the style of the show
-Identify patterns in the dialogue of different characters on the show
If you use this dataset in your research, please credit the original authors.
Data SourceLicense
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
File: talking_head.csv
| Column name | Description |
|---|---|
| quote | The quote said by the character. (Text) |
File: test.csv
| Column name | Description |
|---|---|
| quote | The quote said by the character. (Text) |
| character | The character who said the quote. (Text) |
File: training.csv
| Column name | Description |
|---|---|
| quote | The quote said by the character. (Text) |
| character | The character who said the quote. (Text) |
If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .
CREATE TABLE talking_head (
"quote_id" BIGINT,
"quote" VARCHAR,
"character" VARCHAR
);CREATE TABLE test (
"quote_id" BIGINT,
"quote" VARCHAR,
"character" VARCHAR
);CREATE TABLE training (
"quote_id" BIGINT,
"quote" VARCHAR,
"character" VARCHAR
);Anyone who has the link will be able to view this.