Structured MIDI-like sequences with tempo, key, genre, style, and cluster labels

This dataset contains 2765 rows of musical phrase data designed to support research, education, and creative projects in music technology. Each entry represents a unique musical phrase with detailed attributes describing its melodic, rhythmic, and expressive qualities.

The dataset includes note sequences, durations, velocities, tempo, musical key, genre, style label, and a target cluster label. These features reflect structured, MIDI-like representations of music suitable for analysis, classification, and generative tasks.

✅ Key Features
2765 musical phrases

MIDI-style note sequences

Note durations and velocities

Tempo and musical key annotations

Genre and style labels

Cluster label as a target column

CSV format for easy use in data analysis projects

🗂️ Example Columns
Column Description
phrase_id Unique phrase identifier
note_sequence MIDI note numbers (space-separated)
duration_sequence Note durations in beats
velocity_sequence MIDI velocities
tempo Beats per minute (BPM)
key Musical key (e.g., Cmaj, Amin)
genre Genre label (e.g., jazz, classical)
style_label Style or idiom within the genre
cluster_label Target label for grouping or classification

Musical Improvisation Dataset

Structured MIDI-like sequences with tempo, key, genre, style, and cluster labels

Related Datasets

Music Features

Dataset Of Thermostable In Vitro Transcription-translation Compatible With Microfluidic Droplets

MoTT: A Speech Dataset For Modular Composition Of Turn-Taking Conversations

Historical Series Of Phenological Data For Cherry Tree Flowering At Kyoto City (and March Mean Temperature Reconstructions)

Trust Questions In The European Social Survey, Latinobarómetro And Afrobarometer

Ethnic Power Relations Dataset (ETH, 2021)