以下为卖家选择提供的数据验证报告:
数据描述
Context
This is version 3.0.0 MAESTRO dataset which was downloaded from The MAESTRO Dataset by mangeta.js.
MAESTRO (MIDI and Audio Edited for Synchronous TRacks and Organization) is a dataset composed of virtuosic piano performances captured with fine alignment (~3 ms) between note labels and audio waveforms.
Content
The MIDI data includes key strike velocities and sustain/sostenuto/una corda pedal positions. Audio and MIDI files are aligned with ∼3 ms accuracy and sliced to individual musical pieces, which are annotated with composer, title, and year of performance. Uncompressed audio is of CD quality or higher (44.1–48 kHz 16-bit PCM stereo).
MAESTRO is provided as a zip file containing the MIDI files as well as metadata in CSV and JSON formats.
The metadata files have the following fields for every MIDI pair:
Field | Description |
---|---|
composer | Composer of the piece. We have attempted to standardize on a single spelling for a given name. |
canonical_title | Title of the piece. Not guaranteed to be standardized to a single representation. |
split | Suggested train/validation/test split. |
year | Year of performance. |
midi_filename | MIDI filename. |
duration | Duration in seconds, based on the MIDI file. |
Acknowledgements
MAESTRO was originally introduced in the paper Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset.
