South Park Scripts Dataset
All the Words, All the Time
@kaggle.thedevastator_south_park_scripts_dataset
All the Words, All the Time
@kaggle.thedevastator_south_park_scripts_dataset
By [source]
This dataset contains every word spoken by a character in the first 16 seasons of the TV show South Park. That's over 1 million words in all! Whether you're a fan of South Park or not, this is an interesting dataset to explore natural language processing and see what insights can be gleaned from such a large corpus of text
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset contains all of the words spoken by characters in the South Park TV show. It is divided into seasons, with each season containing a number of episodes. For each episode, there is a transcript of what was said by each character.
This dataset can be used to study the language used in the South Park TV show, as well as to study how the dialogue changes over time
- Sentiment analysis of the South Park scripts
- Word clouds for each character
- Finding the most common words used in each season
If you use this dataset in your research, please credit the original authors.
License
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
File: All-seasons.csv
| Column name | Description |
|---|---|
| Season | The season the episode is from. (Numeric) |
| Episode | The episode number. (Numeric) |
| Character | The character who spoke the line. (String) |
| Line | The line spoken by the character. (String) |
File: Season-1.csv
| Column name | Description |
|---|---|
| Season | The season the episode is from. (Numeric) |
| Episode | The episode number. (Numeric) |
| Character | The character who spoke the line. (String) |
| Line | The line spoken by the character. (String) |
File: Season-10.csv
| Column name | Description |
|---|---|
| Season | The season the episode is from. (Numeric) |
| Episode | The episode number. (Numeric) |
| Character | The character who spoke the line. (String) |
| Line | The line spoken by the character. (String) |
File: Season-11.csv
| Column name | Description |
|---|---|
| Season | The season the episode is from. (Numeric) |
| Episode | The episode number. (Numeric) |
| Character | The character who spoke the line. (String) |
| Line | The line spoken by the character. (String) |
File: Season-12.csv
| Column name | Description |
|---|---|
| Season | The season the episode is from. (Numeric) |
| Episode | The episode number. (Numeric) |
| Character | The character who spoke the line. (String) |
| Line | The line spoken by the character. (String) |
File: Season-13.csv
| Column name | Description |
|---|---|
| Season | The season the episode is from. (Numeric) |
| Episode | The episode number. (Numeric) |
| Character | The character who spoke the line. (String) |
| Line | The line spoken by the character. (String) |
File: Season-14.csv
| Column name | Description |
|---|---|
| Season | The season the episode is from. (Numeric) |
| Episode | The episode number. (Numeric) |
| Character | The character who spoke the line. (String) |
| Line | The line spoken by the character. (String) |
File: Season-15.csv
| Column name | Description |
|---|---|
| Season | The season the episode is from. (Numeric) |
| Episode | The episode number. (Numeric) |
| Character | The character who spoke the line. (String) |
| Line | The line spoken by the character. (String) |
File: Season-16.csv
| Column name | Description |
|---|---|
| Season | The season the episode is from. (Numeric) |
| Episode | The episode number. (Numeric) |
| Character | The character who spoke the line. (String) |
| Line | The line spoken by the character. (String) |
If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .
CREATE TABLE all_seasons (
"season" VARCHAR,
"episode" VARCHAR,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_1 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_10 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_11 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_12 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_13 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_14 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_15 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_16 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_17 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_18 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_19 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_2 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_3 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_4 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_5 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_6 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_7 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_8 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);CREATE TABLE season_9 (
"season" BIGINT,
"episode" BIGINT,
"character" VARCHAR,
"line" VARCHAR
);Anyone who has the link will be able to view this.