MIR-SingerSeparation Dataset & Auto-selection

Dataset Description

The dataset consists of 3 types (Please refer to paper to introduce various categories.) of singer separation datasets, each track 10 seconds long, segments from 476 English and 500 Chinese songs, and male/female vocalist ratio for English songs was 269:207, while that for Chinese songs was 223:277. The tracks are all 8kHz Mono 16-bit audio files in .wav format.

Versions

1.0.0 (default): No release notes.

System Demo

Singer Separation

Download datasets

Download size

EN-D: 17.0 GB
CH-D: 8.25 GB
EN-S: 11.6 GB

Dataset size

EN-D: 22 GB
CH-D: 12 GB
EN-S: 15 GB

Splits

Auto Selection

Pitch Data

python auto_selection.py

Citation

Please cite the paper to use the dataset.

@misc{chen2021singer,
    title={Singer separation for karaoke content generation},
    author={Hsuan-Yu Chen and Xuanjun Chen and Jyh-Shing Roger Jang},
    year={2021},
    eprint={2110.06707},
    archivePrefix={arXiv},
    primaryClass={cs.SD}
}

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
MIR-SingerSeparation		MIR-SingerSeparation
pitch_data/CH_D		pitch_data/CH_D
static		static
README.md		README.md
auto_selection.py		auto_selection.py
lead_vocal_separation.yml		lead_vocal_separation.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MIR-SingerSeparation Dataset & Auto-selection

Dataset Description

Versions

System Demo

Download datasets

Download size

Dataset size

Splits

Auto Selection

Citation

About

Releases

Packages

Languages

GulaerChen/gulaerchen.github.io

Folders and files

Latest commit

History

Repository files navigation

MIR-SingerSeparation Dataset & Auto-selection

Dataset Description

Versions

System Demo

Download datasets

Download size

Dataset size

Splits

Auto Selection

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages