Audio samples from "Disentangling speech from surroundings in a neural audio codec"

Authors: Ahmed Omran, Neil Zeghidour, Zalán Borsos, Félix de Chaumont Quitry, Malcolm Slaney, Marco Tagliasacchi


The audio examples on this page were randomly selected from evaluation splits of the datasets. For noisy samples, neither the speech nor noise components were seen during training. For examples using reverberant speech, the room impulse responses were also drawn from an evaluation set withheld during training.


Disentangling speech from noise


Input sample
Reconstruction
Decoding speech partition
Decoding noise partition
Example 1
   
Example 2
   
Example 3
   
Example 4
   

Scaling the noise partition by a weight factor


1
0.75
0.5
0.25
0
Example 1
Example 2
Example 3
Example 4

Swapping noise between samples


Input A
Input B
Output A
Output B
Example 1
   
Example 2
   
Example 3
   
Example 4
   

Disentangling speech from reverberation


Input sample
Reconstruction
Decoding speech partition
Example 1
   
Example 2
   
Example 3
   
Example 4
   

Scaling the reverberation partition by a weight factor


1
0.75
0.5
0.25
0
Example 1
Example 2
Example 3
Example 4

Swapping reverberation between samples


Input A
Input B
Output A
Output B
Example 1
   
Example 2
   
Example 3
   
Example 4