Baselight

The Office Quotes

Quotes from the popular sitcom The Office

@kaggle.thedevastator_the_office_a_great_dataset_for_nlp_research

Loading...
Loading...

About this Dataset

The Office Quotes


The Office Quotes

Quotes from the popular sitcom The Office

By [source]


About this dataset

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

How to Use this Dataset

  1. Download the dataset from Kaggle (https://www.kaggle.com/the-office-quotes/data).
  2. Extract the zip file into a new folder.
  3. Open the file training.csv in a spreadsheet application such as Microsoft Excel or Google Sheets.
  4. The first column contains the character who said the quote, and the second column contains the quote itself.
  5. To use this dataset for natural language processing research, you can treat each quote as a separate document and use standard text mining techniques to analyze them

Research Ideas

-Develop a model that can identify the character speaking given a quote from the show
-Build a bot that can generate new quotes from The Office in the style of the show
-Identify patterns in the dialogue of different characters on the show

Acknowledgements

If you use this dataset in your research, please credit the original authors.
Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: talking_head.csv

Column name Description
quote The quote said by the character. (Text)

File: test.csv

Column name Description
quote The quote said by the character. (Text)
character The character who said the quote. (Text)

File: training.csv

Column name Description
quote The quote said by the character. (Text)
character The character who said the quote. (Text)

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .

Tables

Talking Head

@kaggle.thedevastator_the_office_a_great_dataset_for_nlp_research.talking_head
  • 212.11 KB
  • 1749 rows
  • 3 columns
Loading...

CREATE TABLE talking_head (
  "quote_id" BIGINT,
  "quote" VARCHAR,
  "character" VARCHAR
);

Test

@kaggle.thedevastator_the_office_a_great_dataset_for_nlp_research.test
  • 25.32 KB
  • 175 rows
  • 3 columns
Loading...

CREATE TABLE test (
  "quote_id" BIGINT,
  "quote" VARCHAR,
  "character" VARCHAR
);

Training

@kaggle.thedevastator_the_office_a_great_dataset_for_nlp_research.training
  • 195.17 KB
  • 1574 rows
  • 3 columns
Loading...

CREATE TABLE training (
  "quote_id" BIGINT,
  "quote" VARCHAR,
  "character" VARCHAR
);

Share link

Anyone who has the link will be able to view this.