Skip to content

Sharif-SLPL/speaker-diarization

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

speaker-diarization

Speaker diarization is the process of partitioning an audio stream containing human speech into homogeneous segments according to the identity of each speaker.

this project includes API, web UI and telegram bot. API module provides three endpoint with deferent response type. all three endpoints get an audio file and use speaker diarization model to process on audio. API results are rttm, TF plot and combined by ASR results. web ui provides an interface to upload your audio file or record a voice for speaker diarization. telegram bot is an useful and simple choice to use speaker diarization. you can record voice or forward voices from your chats to convert it to text aside its speaker tag.

Installation

Use

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages

  • Jupyter Notebook 88.2%
  • Python 11.7%
  • Makefile 0.1%