Unified-Audio: An Open-Source Project to Unify Audio Processing and Generation

This project contains a series of works developed for audio (including speech, music, and general audio events) processing and generation, which helps reproducible research in the field of audio. The target of Unified-Audio is to explore a unified framework to handle different audio processing and generation tasks, including:

SR: Speech Restoration (⛳ supported)
TSE: Target Speaker Extraction (⛳ supported)
SS: Speech Separation (⛳ supported)
VC: Voice Conversion (⛳ supported)
LASS: Language-Queried Audio Source Separation (⛳ supported)
AE: Audio Editing (⛳ developing)
TTA: Text to Audio (⛳ developing)
more...

In addition to the frameworks for specific audio tasks, Unified-Audio also provides works involving neural audio codec (NAC), which is the fundamental module to combine audio modality with language models.

🚀 News

2025/09/22: We release UniSE, a foundation model for unified speech generation. The system supports target speaker extraction, universal speech enhancement.demo, Code will comming soon.

key Works

UniSE

UniSE: A Unified Framework for Decoder-Only Autoregressive LM-Based Speech Enhancement supported tasks: SR, TSE, SS

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
UniSE		UniSE
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Unified-Audio: An Open-Source Project to Unify Audio Processing and Generation

🚀 News

key Works

UniSE

About

Uh oh!

Releases

Packages

License

melo1998/unified-audio

Folders and files

Latest commit

History

Repository files navigation

Unified-Audio: An Open-Source Project to Unify Audio Processing and Generation

🚀 News

key Works

UniSE

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages