GitHub - AMAAI-Lab/DART: Demo for DART, Audio Imagination workshop submission in NeurIPS 2024

This is a demo repo for DART, accepted in the Audio Imagination workshop of NeurIPS 2024

Code used in the paper is here

Audio samples are on the associated webpage: https://amaai-lab.github.io/DART/

This code is based on https://github.com/keonlee9420/Comprehensive-Transformer-TTS

Training

To train on L2-ARCTIC, you can call:

CUDA_VISIBLE_DEVICES=0 python train.py --dataset L2ARCTIC

Inference

For inference from a checkpoint, you can utilize the two enclosed functions synthesize_converted.py or synthesize_stats_valset.py. The first one would synthesize sentences in the script (a loop for speakers, accents, and sentences), the latter would synthesize from a metadata .txt file. Note that before you run any synthesis script, you should first run extract_stats.py script on your current checkpoint to extract and save the MLVAE embeddings for speakers and accents first.

An example use of the synthesis scripts is:

CUDA_VISIBLE_DEVICES=0 python synthesize_converted.py --dataset L2ARCTIC --restore_step 704000

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Metrics		Metrics
audio		audio
config		config
encoder		encoder
lexicon		lexicon
model		model
preprocessor		preprocessor
samples		samples
text		text
utils		utils
README.md		README.md
dataset.py		dataset.py
evaluate.py		evaluate.py
extract_stats.py		extract_stats.py
index.html		index.html
object_metricsL2.py		object_metricsL2.py
plot_embs.py		plot_embs.py
prepare_align.py		prepare_align.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
synthesize_converted.py		synthesize_converted.py
synthesize_stats_valset.py		synthesize_stats_valset.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Training

Inference

About

Releases

Packages

Contributors 2

Languages

AMAAI-Lab/DART

Folders and files

Latest commit

History

Repository files navigation

Training

Inference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages