speaker-recognition

Here are 326 public repositories matching this topic...

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr speech-translation speaker-diariazation generative-ai

Updated Dec 7, 2025
Python

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Dec 3, 2025
Python

pyannote / pyannote-audio

Star

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated Dec 7, 2025
Jupyter Notebook

google / uis-rnn

Star

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

machine-learning clustering supervised-learning speaker-recognition speaker-diarization supervised-clustering uis-rnn

Updated Sep 25, 2024
Python

mravanelli / SincNet

Star

SincNet is a neural architecture for efficiently processing raw audio samples.

Updated Apr 28, 2021
Python

yeyupiaoling / VoiceprintRecognition-Pytorch

Star

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

pytorch voice-recognition speaker-recognition arcface ecapa-tdnn

Updated Jun 10, 2025
Python

clovaai / voxceleb_trainer

Star

In defence of metric learning for speaker recognition

metric-learning speaker-recognition speaker-verification voxceleb

Updated Mar 26, 2024
Python

wenet-e2e / wespeaker

Star

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Updated Dec 5, 2025
Python

FluidInference / FluidAudio

Star

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.

audio macos swift ios real-time avfoundation nvidia vad automatic-speech-recognition speech-to-text ane speaker-recognition asr speaker-diarization voice-activity-detection coreml speaker-identification speaker-embedding parakeet

Updated Dec 7, 2025
Swift

athena-team / athena

Star

an open-source implementation of sequence-to-sequence based speech processing engine

deployment tensorflow tts speech-synthesis transformer speech-recognition sequence-to-sequence unsupervised-learning speaker-recognition asr ctc wfst

Updated Dec 2, 2022
C++

astorfi / 3D-convolutional-speaker-recognition

Sponsor

Star

🔈 Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

deep-learning convolutional-neural-networks speaker-recognition 3d

Updated Mar 3, 2020
Python

TaoRuijie / ECAPA-TDNN

Star

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

speaker-recognition speaker-verification voxceleb1 voxceleb2 ecapa-tdnn

Updated Apr 11, 2024
Python

cvqluu / Angular-Penalty-Softmax-Losses-Pytorch

Star

Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)

pytorch face-recognition metric-learning speaker-recognition embedding loss-functions face-verification sphereface normface fashion-mnist arcface am-softmax fmnist-dataset loss-function

Updated Dec 13, 2023
Python

taylorlu / Speaker-Diarization

Star

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

speaker-recognition speaker-diarization uis-rnn ghostvlad vgg-speaker-recognition

Updated Jul 1, 2021
Python

google / speaker-id

Star

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

speaker-recognition speaker-verification source-separation speaker-diarization speaker-identification

Updated Aug 12, 2025
Python

nuaazs / VAF_2

Star

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

microservices speech-recognition speaker-recognition antifraud speaker-diarization

Updated Apr 16, 2024
Python

speechbrain / speechbrain.github.io

Star

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Updated Jun 18, 2025
HTML

SamirPaulb / real-time-voice-translator

Star

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

Updated Jan 22, 2024
Tcl

yeyupiaoling / VoiceprintRecognition-Tensorflow

Star

使用Tensorflow实现声纹识别

tensorflow voice-recognition speaker-recognition arcface

Updated Jun 16, 2024
Python

manojpamk / pytorch_xvectors

Star

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

speaker-recognition speaker-verification speaker-diarization speaker-embeddings

Updated Nov 11, 2020
Python

Improve this page

Add a description, image, and links to the speaker-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speaker-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speaker-recognition

Here are 326 public repositories matching this topic...

NVIDIA-NeMo / NeMo

speechbrain / speechbrain

pyannote / pyannote-audio

google / uis-rnn

mravanelli / SincNet

yeyupiaoling / VoiceprintRecognition-Pytorch

clovaai / voxceleb_trainer

wenet-e2e / wespeaker

FluidInference / FluidAudio

athena-team / athena

astorfi / 3D-convolutional-speaker-recognition

TaoRuijie / ECAPA-TDNN

cvqluu / Angular-Penalty-Softmax-Losses-Pytorch

taylorlu / Speaker-Diarization

google / speaker-id

nuaazs / VAF_2

speechbrain / speechbrain.github.io

SamirPaulb / real-time-voice-translator

yeyupiaoling / VoiceprintRecognition-Tensorflow

manojpamk / pytorch_xvectors

Improve this page

Add this topic to your repo