Speech2Phone

This is the official implementation of the paper Speech2Phone: A Multilingual and Text Independent Speaker Identification Model

Speech2Phone is a multilingual, text-independent speaker identification system. In addition, the embeddings extracted from this model can be used to represent speakers in speech synthesis systems, speech cloning and voice transfer between languages.

In this repository the Paper directory has the implementation of all the experiments and topologies explored in the article. The Speech2Phone directory presents the implementation and checkpoints of the best model of the article.

Colab Notebook Demos:

Identification of speakers in Spanish

Identification of speakers in Chinese spoken in Taiwan

Citation

@article{casanova2020speech2phone,
  title={Speech2Phone: A Multilingual and Text Independent Speaker Identification Model},
  author={Casanova, Edresson and Junior, Arnaldo Candido and Shulby, Christopher and da Silva, Hamilton Pereira and Cordeiro, Alessandro Ferreira and Guedes, Victor de Oliveira and Aluisio, Sandra Maria and others},
  journal={arXiv preprint arXiv:2002.11213},
  year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Paper		Paper
Speech2Phone		Speech2Phone
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Speech2Phone

Colab Notebook Demos:

Citation

About

Uh oh!

Releases

Packages

Languages

nilc-nlp/Speech2Phone

Folders and files

Latest commit

History

Repository files navigation

Speech2Phone

Colab Notebook Demos:

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages