Unified-Audio: An Open-Source Project to Unify Audio Processing and Generation

This project contains a series of works developed for audio (including speech, music, and general audio events) processing and generation, which helps reproducible research in the field of audio. The target of Unified-Audio is to explore a unified framework to handle different audio processing and generation tasks, including:

SR: Speech Restoration (⛳ supported)
TSE: Target Speaker Extraction (⛳ supported)
SS: Speech Separation (⛳ supported)
VC: Voice Conversion (⛳ supported)
LASS: Language-Queried Audio Source Separation (⛳ supported)
CODEC: Audio Tokenization (⛳ supported)
AE: Audio Editing (⛳ developing)
TTA: Text to Audio (⛳ developing)
more...

In addition to the frameworks for specific audio tasks, Unified-Audio also provides works involving neural audio codec (NAC), which is the fundamental module to combine audio modality with language models.

🚀 News

2025/09/22: We release UniSE, a foundation model for unified speech generation. The system supports target speaker extraction, universal speech enhancement.demo, ,Code will comming soon.
2025/10/26: We release UniTok-Audio, The system supports target speaker extraction, universal speech enhancement, Speech Restoration, Voice Conversion, Language-Queried Audio Source Separation, Audio Tokenization,demo,

key Works

UniSE

UniSE: A Unified Framework for Decoder-Only Autoregressive LM-Based Speech Enhancement supported tasks: SR, TSE, SS

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
UniSE		UniSE
UniTok-audio		UniTok-audio
LICENSE		LICENSE
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Unified-Audio: An Open-Source Project to Unify Audio Processing and Generation

🚀 News

key Works

UniSE

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Languages

License

alibaba/unified-audio

Folders and files

Latest commit

History

Repository files navigation

Unified-Audio: An Open-Source Project to Unify Audio Processing and Generation

🚀 News

key Works

UniSE

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Languages

Packages