Skip to content

jwhitehill/realtimespeech

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

This program serves as a speaker recognition software that can identify whether someone is speaking and the current speaker in run-time.

How to use it:

  1. install the following packages on the client side:

    pip install socket

    pip install sounddevice

install PyTorch and the following packages on the server side:

pip install speechbrain

pip install socket
  1. To set up server connection, follow these steps:

    1. Use regular ssh to connect to the Linux box:

      ssh [email protected]

    2. Activate conda, and then run "python server.py":

      conda activate speechbrain

    3. On Windows terminal, use "ssh -L..." to open a SSH tunnel:

      ssh -L 12345:localhost:12345 [email protected] -N

  2. To start the program, run user_interface.py

    In the "enrollment" page, user can type in the name of the speaker and talk to save a 5 second voice sample to server for later identification.

    In the "recognition" page, the program will load the sample audio and display who the speaker was every second once the recognition button is pressed

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages