Skip to main content icon/video/no-internet

Temporal processing refers to the encoding of the time-varying features of sounds by the auditory system. Temporal cues in acoustic stimuli are important for pitch perception and sound localization. Moreover, temporal cues in speech signals carry crucial information for speech understanding, including the manner and place of articulation, voicing, prosody, and intonation. Therefore, temporal processing is one of the fundamental aspects of auditory perception and plays an essential role in speech communication.

Temporal features contained in an acoustical signal are typically classified into two types: temporal envelope and temporal fine structure. Temporal envelope reflects the intensity of fluctuations over time. For example, the temporal envelope of a brief sound event describes how the sound is gated on and off. Temporal fine structure describes the periodicity information contained in sound and closely relates to the perceived pitch. In the context of speech perception, temporal envelope mainly codes information related to the dynamic articulatory changes of the vocal track, while temporal fine structure codes information related to voice pitch and intonation. This entry discusses both envelope processing and fine-structure processing, including the physiological mechanisms, psychophysical studies, and role in speech understanding of each.

Envelope Processing

Physiological Mechanisms for Envelope Processing

For a listener to utilize the cues in the temporal envelopes of sound stimuli, these temporal cues need to be coded efficiently and accurately by the auditory system. Envelope processing by neurons at various stages of the ascending auditory pathway has been extensively studied using amplitude-modulated stimuli in laboratory animals. In a typical experiment, a pure-tone stimulus is amplitude-modulated using a sinusoidal function, and the response from an auditory neuron is analyzed in terms of its synchronization to the stimulus envelope. A synchronization index is typically defined to quantify the degree of synchronization between the stimulus and neural response. If the pattern of neural spikes matches well to that of the stimulus envelope (i.e., the sinusoidal amplitude modulation), the result is a high synchronization index; on the other hand, if the pattern of neural spikes is independent of the stimulus envelope, it would lead to a low synchronization index. The function relating the synchronization index to the amplitude-modulation rate is called the synchronization modulation transfer function. The function relating the firing rate to the amplitude modulation rate is called the rate modulation transfer function.

At the level of the auditory nerve, the synchronization modulation transfer function typically exhibits a low-pass characteristic. That is, the spiral ganglion neurons are able to code the temporal envelope information via synchronized neural activities up to a certain cutoff modulation rate, above which the fidelity of envelope coding degrades as the amplitude modulation rate increases. Across spiral ganglion neurons, the cutoff rate for the synchronization modulation transfer function is higher for neurons tuned to higher frequencies, corresponding to more basal locations in the cochlea. Therefore, the upper rate limit for envelop coding by the spiral ganglion neurons reflects the time constant in peripheral filtering, which is governed by cochlear mechanics. For basal spiral ganglion neurons, the cutoff modulation rate is typically between 600 and 1000 Hz.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading