-
Notifications
You must be signed in to change notification settings - Fork 157
Getting Started
The reported performance of the FullSuBNet in the paper is on this dataset.
It consists of 65000 speech clips (30s per clip) and 65000 noise clips (10s per clip). You can download this dataset from https://github.com/microsoft/DNS-Challenge.git.
This Git repository contains the DNS Challenge dataset (INTERSPEECH 2020) and the newer DNS Challenge dataset (ICASSP 2021). The default branch of the
Git repository is the ICASSP 2021 Dataset. You need to check out the default branch to the interspeech2020
branch.
Use the default training configuration:
cd FullSubNet/recipes/dns_interspeech_2020
# Use a default config and two GPUs to train the FullSubNet model
CUDA_VISIABLE_DEVICES=0,1 python train.py -C fullsubnet/train.toml -N 2
# Use default config and one GPU to train the Fullband baseline model
CUDA_VISIABLE_DEVICES=0 python train.py -C fullband_baseline/train.toml -N 1
# Resume the experiment using "-R" parameter
CUDA_VISIABLE_DEVICES=0,1 python train.py -C fullband_baseline/train.toml -N 2 -R
Assuming that:
- The file path of the training configuration is
fullsubnet/train.toml
- In the training configuration,
save_dir = "~/Experiments/FullSubNet/FullSubNet"
The logs will be stored in the ~/Experiments/FullSubNet/FullSubNet/train
directory. This directory contains the following:
-
logs/
directory: store the Tensorboard related data, including loss curves, audio files, and spectrogram figures. -
checkpoints/
directory: stores all checkpoints of the model, from which you can resume the training or start an inference. -
*.toml
file: the backup of the training configuration.
In the logs/
directory, use the following command to visualize loss curves, spectrogram figures, and audio files during the training and the
validation.
tensorboard --logdir ~/Experiments/FullSubNet/FullSubNet/train
# specify a port
tensorboard --logdir ~/Experiments/FullSubNet/FullSubNet/train --port 45454
TODO
TODO
# DNS-INTERSPEECH-2020
python tools/calculate_metrics.py \
-R ~/Datasets/DNS-Challenge-INTERSPEECH/datasets/test_set/synthetic/no_reverb/clean \
-E ~/Enhancement/fullsubnet_dns_interspeech_no_reverb/enhanced_1155 \
-M SI_SDR,STOI,WB_PESQ,NB_PESQ \
-S DNS_1