Research PaperNoise and pitch interact during the cortical segregation of concurrent speech

Article ID	Journal	Published Year	Pages	File Type
5739352	Hearing Research	2017	11 Pages	PDF

Abstract

â¢Measured cortical correlates of concurrent speech segregation via ERPs.â¢Varied pitch (F0) and noise (SNR) cues to assess intrinsic and extrinsic factors.â¢P2 (200Â ms) paralleled behavioral accuracy and predicted perceptual segregation.â¢Neural classifiers show speech activity segregated earlier for SNR than F0 cues.â¢Noise/pitch interact in deciphering concurrent speech mixtures within 200Â ms.

Behavioral studies reveal listeners exploit intrinsic differences in voice fundamental frequency (F0) to segregate concurrent speech sounds-the so-called “F0-benefit.” More favorable signal-to-noise ratio (SNR) in the environment, an extrinsic acoustic factor, similarly benefits the parsing of simultaneous speech. Here, we examined the neurobiological substrates of these two cues in the perceptual segregation of concurrent speech mixtures. We recorded event-related brain potentials (ERPs) while listeners performed a speeded double-vowel identification task. Listeners heard two concurrent vowels whose F0 differed by zero or four semitones presented in either clean (no noise) or noise-degraded (+5Â dB SNR) conditions. Behaviorally, listeners were more accurate in correctly identifying both vowels for larger F0 separations but F0-benefit was more pronounced at more favorable SNRs (i.e., pitchÂ ÃÂ SNR interaction). Analysis of the ERPs revealed that only the P2 wave (â¼200Â ms) showed a similar F0 x SNR interaction as behavior and was correlated with listeners' perceptual F0-benefit. Neural classifiers applied to the ERPs further suggested that speech sounds are segregated neurally within 200Â ms based on SNR whereas segregation based on pitch occurs later in time (400-700 ms). The earlier timing of extrinsic SNR compared to intrinsic F0-based segregation implies that the cortical extraction of speech from noise is more efficient than differentiating speech based on pitch cues alone, which may recruit additional cortical processes. Findings indicate that noise and pitch differences interact relatively early in cerebral cortex and that the brain arrives at the identities of concurrent speech mixtures as early as â¼200Â ms.

Keywords

Neural classification Speech-in-noise perception