mirdata
Mirdata is an open-source Python library that provides tools for working with common Music Information Retrieval (MIR) datasets, including tools for:
downloading datasets to a common location and format
validating that the files for a dataset are all present
loading annotation files to a common format, consistent with
mir_eval
parsing track level metadata for detailed evaluations.
pip install mirdata
For more details on how to use the library see the Tutorial.
Citing mirdata
If you are using the library for your work, please cite the version you used as indexed at Zenodo:
If you refer to mirdata’s design principles, motivation etc., please cite the following paper [1]:
When working with datasets, please cite the version of Mirdata that you are using (given by the DOI
above)
AND include the reference of the dataset, which can be found in the respective dataset loader using the cite()
method.
Contributing to mirdata
We welcome contributions to this library, especially new datasets. Please see Contributing for guidelines.