Spotify Music Dataset
Popular and Not so popular songs
@kaggle.solomonameh_spotify_music_dataset
Popular and Not so popular songs
@kaggle.solomonameh_spotify_music_dataset
The dataset is collated from Spotify's API using two separate python scripts to extract popular and non-popular songs and their associated audio and descriptive features. Descriptive features of a song include information about the song such as the artist name, album name and release date. Audio features include key, valence , danceability and energy which are results of spotify's audio analysis. The combined features used in the experimentation phase of this project include:
Feature | Description |
---|---|
Energy | A measure of intensity and activity. Typically, energetic tracks feel fast, loud, and noisy. |
Tempo | The speed of a track, measured in beats per minute (BPM). |
Danceability | A score describing how suitable a track is for dancing based on tempo, rhythm stability, beat strength and overall regularity. |
Loudness | The overall loudness of a track in decibels (dB). Higher values indicate louder tracks overall. |
Liveness | The likelihood of a track being performed live. Higher values suggest more audience presence. |
Valence | The overall musical positiveness(emotion) of a track. High valence sounds happy; low valence sounds sad or angry. |
Speechiness | Measures the presence of spoken words. |
Instrumentalness | The likelihood a track contains no vocals. Values closer to 1.0 suggest solely instrumental tracks. |
Mode | Indicates the modality of the track. |
Key | The musical key, represented as an integer from 0 to 11, mapping to standard Pitch class notation. |
Duration_ms | The length of the track in milliseconds. |
Acousticness | A confidence measure of whether a track is acoustic(1) or not(0). |
Feature | Description |
---|---|
Track Name | The name of the track. |
Track Artist | The artist(s) performing the track. |
Track Album Name | The album in which the track is featured. |
Track Album Release Date | The release date of the album containing the track. |
Track ID | A unique identifier assigned to the track by Spotify. |
Track Album ID | A unique identifier for the album. |
Playlist Name | The name of the playlist where the track is included. |
Playlist Genre | The main genre associated with the playlist (e.g., pop, rock, classical). |
Playlist Subgenre | A more specific subgenre tied to the playlist (e.g., indie pop, punk rock). |
Playlist ID | A unique identifier for the playlist. |
Track Popularity | A score (0–100) which is calculated based on total number of streams in relation to other songs. |
CREATE TABLE high_popularity_spotify_data (
"energy" DOUBLE,
"tempo" DOUBLE,
"danceability" DOUBLE,
"playlist_genre" VARCHAR,
"loudness" DOUBLE,
"liveness" DOUBLE,
"valence" DOUBLE,
"track_artist" VARCHAR,
"time_signature" BIGINT,
"speechiness" DOUBLE,
"track_popularity" BIGINT,
"track_href" VARCHAR,
"uri" VARCHAR,
"track_album_name" VARCHAR,
"playlist_name" VARCHAR,
"analysis_url" VARCHAR,
"track_id" VARCHAR,
"track_name" VARCHAR,
"track_album_release_date" TIMESTAMP,
"instrumentalness" DOUBLE,
"track_album_id" VARCHAR,
"mode" BIGINT,
"key" BIGINT,
"duration_ms" BIGINT,
"acousticness" DOUBLE,
"id" VARCHAR,
"playlist_subgenre" VARCHAR,
"type" VARCHAR,
"playlist_id" VARCHAR
);
CREATE TABLE low_popularity_spotify_data (
"time_signature" DOUBLE,
"track_popularity" BIGINT,
"speechiness" DOUBLE,
"danceability" DOUBLE,
"playlist_name" VARCHAR,
"track_artist" VARCHAR,
"duration_ms" DOUBLE,
"energy" DOUBLE,
"playlist_genre" VARCHAR,
"playlist_subgenre" VARCHAR,
"track_href" VARCHAR,
"track_name" VARCHAR,
"mode" DOUBLE,
"uri" VARCHAR,
"type" VARCHAR,
"track_album_release_date" TIMESTAMP,
"analysis_url" VARCHAR,
"id" VARCHAR,
"instrumentalness" DOUBLE,
"track_album_id" VARCHAR,
"playlist_id" VARCHAR,
"track_id" VARCHAR,
"valence" DOUBLE,
"key" DOUBLE,
"tempo" DOUBLE,
"loudness" DOUBLE,
"acousticness" DOUBLE,
"liveness" DOUBLE,
"track_album_name" VARCHAR
);
Anyone who has the link will be able to view this.