Modulation of the Primary Auditory Thalamus When Recognizing Speech with Background Noise

Abstract
Recognising speech in background noise is a strenuous daily activity, yet most humans can master it. An explanation of how the human brain deals with such sensory uncertainty during speech recognition is to-date missing. Previous work has shown that recognition of speech without background noise involves modulation of the auditory thalamus (medial geniculate body, MGB): There are higher responses in left MGB for speech recognition tasks that require tracking of fast-varying stimulus properties in contrast to relatively constant stimulus properties (e.g., speaker identity tasks) despite the same stimulus input. Here we tested the hypotheses that (i) this task-dependent modulation for speech recognition increases in parallel with the sensory uncertainty in the speech signal, i.e., the amount of background noise and that (ii) this increase is present in the ventral MGB, which corresponds to the primary sensory part of the auditory thalamus. In accordance with our hypothesis, we show—by using ultra-high-resolution functional magnetic resonance imaging in male and female human participants—that the task-dependent modulation of the left vMGB for speech is particularly strong when recognizing speech in noisy listening conditions in contrast to situations where the speech signal is clear. The results imply that speech in noise recognition is supported by modifications at the level of the subcortical sensory pathway providing driving input to the auditory cortex. SIGNIFICANCE STATEMENT: Speech recognition in noisy environments is a challenging everyday task. One reason why humans can master this task is the recruitment of additional cognitive resources as reflected in recruitment of non-language cerebral cortex areas. Here, we show that also modulation in the primary sensory pathway is specifically involved in speech in noise recognition. We found that the left primary sensory thalamus (ventral medial geniculate body, vMGB) is more involved when recognizing speech signals as opposed to a control task (speaker identity recognition) when heard in background noise vs. when the noise was absent. This finding implies that the brain optimises sensory processing in subcortical sensory pathway structures in a task-specific manner to deal with speech recognition in noisy environments.