This program serves as a speaker recognition software that can identify whether someone is speaking and the current speaker in run-time.
How to use it:
-
install the following packages on the client side:
pip install socket
pip install sounddevice
install PyTorch and the following packages on the server side:
pip install speechbrain
pip install socket
-
To set up server connection, follow these steps:
-
Use regular ssh to connect to the Linux box:
-
Activate conda, and then run "python server.py":
conda activate speechbrain
-
On Windows terminal, use "ssh -L..." to open a SSH tunnel:
ssh -L 12345:localhost:12345 [email protected] -N
-
-
To start the program, run user_interface.py
In the "enrollment" page, user can type in the name of the speaker and talk to save a 5 second voice sample to server for later identification.
In the "recognition" page, the program will load the sample audio and display who the speaker was every second once the recognition button is pressed