Input
STFT-SEANet: Reconstruction loss only (better)
STFT-SEANet: Adversarial + recon. loss (worse)
0 0
1 1
2 2
3 3
4 4