Authors: Yupeng Shi, Jiatong Shi, Xi Chen, Nengheng Zheng

Audio Samples

Descriptions

WB: the reference wideband speech

NB: the narrowband speech compressed by G.711

GAN_MSE: the estimated wideband speech from the GAN-based SSR system trained by Mean Square Error Loss.

GAN_PWF: the estimated wideband speech from the GAN-based SSR system trained by Perceptual Weighting Filter Loss.

GAN_PM: the estimated wideband speech from the GAN-based SSR system trained by Psychoacoustic Masking Loss.

GAN_PE: the estimated wideband speech from the GAN-based SSR system trained by Perceptual Entropy Loss.

Spectrogram

Some spectrogram samples are also shown intuitively.