Authors: Yupeng Shi, Jiatong Shi, Xi Chen, Nengheng Zheng
Audio Samples
Descriptions
WB: the reference wideband speech
NB: the narrowband speech compressed by G.711
GAN_MSE: the estimated wideband speech from the GAN-based SSR system trained by Mean Square Error Loss.
GAN_PWF: the estimated wideband speech from the GAN-based SSR system trained by Perceptual Weighting Filter Loss.
GAN_PM: the estimated wideband speech from the GAN-based SSR system trained by Psychoacoustic Masking Loss.
GAN_PE: the estimated wideband speech from the GAN-based SSR system trained by Perceptual Entropy Loss.
S1 | ||||||
S2 | ||||||
S3 | ||||||
S4 | ||||||
S5 | ||||||
S6 | ||||||
S7 | ||||||
S8 | ||||||
S9 | ||||||
S10 |
Spectrogram
Some spectrogram samples are also shown intuitively.