45,000+ lines of dialogue from 9 seasons of The Office
Dataset Description
Data mined from transcripts of the show.
Content
Season: Season Number
Episode: Episode Number
Title: Name of the Episode
Scene: Scene number (running value from start of dataset)
Speaker: Character name
Line: Dialogue of character
Version 3 Updates - based on feedback from @saradata
- Added the missing lines from S9E4 and S7E17
- Fixed the issue with the running scene number
- Remove some censoring and special characters (ex: *, Ä etc)
- Cleaned up some lines that had scene context artifacts (ex: [on phone])
Related Datasets
-
The Office Quotes
@kaggle
-
Cases Completed 2015
@usgov
-
Cases Completed 2014
@usgov