Interaural coherence (icProc.m)

The IC is estimated by determining the maximum value of the normalised CCF. It has been suggested that the IC can be used to select TF units where the binaural cues (ITDs and ILDs) are dominated by the direct sound of an individual sound source, and thus, are likely to reflect the true location of one of the active sources [Faller2004]. The IC processor does not have any controllable parameters itself, but it depends on the settings of the CCF processor, which is described in Cross-correlation (crosscorrelationProc.m). The IC representation is computed by using the request entry ’ic’.

The application of the IC processor is demonstrated by the script DEMO_IC, which produces the following four plots shown in Fig. 41. The top left and bottom left panels show the anechoic and reverberant speech signal, respectively. It can be seen that the time domain signal is smeared due to the influence of the reverberation. The IC for the anechoic signal is close to one for most of the individual TF units, which indicates that the corresponding binaural cues are reliable. In contrast, the IC for the reverberant signal is substantially lower for many TF units, suggesting that the corresponding binaural cues might be unreliable due to the impact of the reverberation.


Fig. 41 Time domain signals and the corresponding interaural coherence as a function of time frames and frequency channels estimated for a speech signal in anechoic and reverberant conditions. Anechoic speech (top left panel) and the corresponding IC (top right panel). Reverberant speech (bottom left panel) and the corresponding IC (bottom right panel).

[Faller2004]Faller, C. and Merimaa, J. (2004), “Source localization in complex listening situations: Selection of binaural cues based on interaural coherence,” Journal of the Acoustical Society of America 116(5), pp. 3075–3089.