Skip to content

GulaerChen/gulaerchen.github.io

Repository files navigation

MIR-SingerSeparation Dataset & Auto-selection

Dataset Description

The dataset consists of 3 types (Please refer to paper to introduce various categories.) of singer separation datasets, each track 10 seconds long, segments from 476 English and 500 Chinese songs, and male/female vocalist ratio for English songs was 269:207, while that for Chinese songs was 223:277. The tracks are all 8kHz Mono 16-bit audio files in .wav format.

Versions

1.0.0 (default): No release notes.

System Demo

Singer Separation

Download datasets

Download size

  • EN-D: 17.0 GB
  • CH-D: 8.25 GB
  • EN-S: 11.6 GB

Dataset size

  • EN-D: 22 GB
  • CH-D: 12 GB
  • EN-S: 15 GB

Splits

Auto Selection

python auto_selection.py 

Citation

Please cite the paper to use the dataset.

@misc{chen2021singer,
    title={Singer separation for karaoke content generation},
    author={Hsuan-Yu Chen and Xuanjun Chen and Jyh-Shing Roger Jang},
    year={2021},
    eprint={2110.06707},
    archivePrefix={arXiv},
    primaryClass={cs.SD}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published