GitHub

This program serves as a speaker recognition software that can identify whether someone is speaking and the current speaker in run-time.

How to use it:

install the following packages on the client side:

pip install socket

pip install sounddevice

install PyTorch and the following packages on the server side:

pip install speechbrain

pip install socket

To set up server connection, follow these steps:
1. Use regular ssh to connect to the Linux box:
  
  ssh [email protected]
2. Activate conda, and then run "python server.py":
  
  conda activate speechbrain
3. On Windows terminal, use "ssh -L..." to open a SSH tunnel:
  
  ssh -L 12345:localhost:12345 [email protected] -N
To start the program, run user_interface.py

In the "enrollment" page, user can type in the name of the speaker and talk to save a 5 second voice sample to server for later identification.

In the "recognition" page, the program will load the sample audio and display who the speaker was every second once the recognition button is pressed

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
local_machine		local_machine
server		server
README.md		README.md

Provide feedback