Skip to content

Speech2Phone: A Multilingual and Text Independent Speaker Identification Model

Notifications You must be signed in to change notification settings

nilc-nlp/Speech2Phone

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

Speech2Phone

This is the official implementation of the paper Speech2Phone: A Multilingual and Text Independent Speaker Identification Model

Speech2Phone is a multilingual, text-independent speaker identification system. In addition, the embeddings extracted from this model can be used to represent speakers in speech synthesis systems, speech cloning and voice transfer between languages.

In this repository the Paper directory has the implementation of all the experiments and topologies explored in the article.   The Speech2Phone directory presents the implementation and checkpoints of the best model of the article.

Colab Notebook Demos:

     Identification of speakers in Spanish

     Identification of speakers in Chinese spoken in Taiwan

Citation

@article{casanova2020speech2phone,
  title={Speech2Phone: A Multilingual and Text Independent Speaker Identification Model},
  author={Casanova, Edresson and Junior, Arnaldo Candido and Shulby, Christopher and da Silva, Hamilton Pereira and Cordeiro, Alessandro Ferreira and Guedes, Victor de Oliveira and Aluisio, Sandra Maria and others},
  journal={arXiv preprint arXiv:2002.11213},
  year={2020}
}

About

Speech2Phone: A Multilingual and Text Independent Speaker Identification Model

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 70.3%
  • Python 29.7%