Neuronal encoding of sound
This article explores the basic physiological principles of sound perception, and traces hearing mechanisms from sound as pressure waves in air to the transduction of these waves into electrical impulses (action potentials) along auditory nerve fibers, and further processing in the brain.
- 1 Introduction
- 2 Transduction
- 3 Brainstem and midbrain
- 4 Auditory cortex
- 5 Recent ideas
- 6 References
The complexities of contemporary neuroscience are continually redefined. Thus what is known now of the auditory system has changed in the recent times and thus conceivably in the next two years or so, much of this will change.
This article is structured in a format that starts with a small exploration of what sound is followed by the general anatomy of the ear which in turn will finally give way to explaining the encoding mechanism of the engineering marvel that is the ear. This article traces the route that sound waves first take from generation at an unknown source to their integration and perception by the auditory cortex.
Basic physics of sound
Sound waves are what physicists call longitudinal waves, which consist of propagating regions of high pressure (compression) and corresponding regions of low pressure (rarefaction).
Amplitude is the size of the pressure variations in a sound wave, and primarily determines the loudness with which the sound is perceived. In a sinusoidal function such as , C represents the amplitude of the sound wave.
Frequency and wavelength
The frequency of a sound is defined as the number of repetitions of its waveform per second, and is measured in hertz; it is inversely proportional to the wavelength. The wavelength of a sound is the distance between any two consecutive matching points on the waveform. The audible frequency range for humans is about 20 Hz to 20 000 Hz at infants. Hearing of higher frequencies decreases with age limiting to about 16000 Hz for adults and even down to 3000 Hz for elders.
Anatomy of the ear
Given the simple physics of sound, the anatomy and physiology of hearing can be studied in greater detail.
The Outer ear consists of the pinna or auricle (visible parts including ear lobes and concha), and the auditory meatus (the passage way for sound). The fundamental function of this part of the ear is to gather sound energy and deliver it to the eardrum. Resonances of the external ear selectively boost sound pressure with frequency in the range 2–5 kHz.
The pinna as a result of its asymmetrical structure is able to provide further cues about the elevation from which the sound originated. The vertical asymmetry of the pinna selectively amplifies sounds of higher frequency from high elevation thereby providing spatial information by virtue of its mechanical design.
The middle ear plays a crucial role in the auditory process, as it essentially converts pressure variations in air to perturbations in the fluids of the inner ear. In other words, it is the mechanical transfer function that allows for efficient transfer of collected sound energy between two different media. The three small bones that are responsible for this complex process are the malleus, the incus, and the stapes, collectively known as the ear ossicles. The impedance matching is done through via lever ratios and the ratio of areas of the tympanic membrane and the footplate of the stapes, creating a transformer-like mechanism. Furthermore the ossicles are arranged in such a manner as to resonate at 700–800 Hz while at the same time protecting the inner ear from excessive energy. A certain degree of top-down control is present at the middle ear level primarily through two muscles present in this anatomical region: the tensor tympani and the stapedius. These two muscles can restrain the ossicles so as to reduce the amount of energy that is transmitted into the inner ear in loud surroundings.
The cochlea has over 32,000 hair cells. Outer hair cells primarily provide amplification of traveling waves that are induced by sound energy, while inner hair cells detect the motion of those waves and excite the (Type I) neurons of the auditory nerve. The basal end of the cochlea, where sounds enter from the middle ear, encodes the higher end of the audible frequency range while the apical end of the cochlea encodes the lower end of the frequency range. This tonotopy plays a crucial role in hearing, as it allows for spectral separation of sounds. A cross section of the cochlea will reveal an anatomical structure with three main chambers (scala vestibuli, scala media, and scala tympani). At the apical end of the cochlea, at an opening known as the helicotrema, the scala vestibuli merges with the scala tympani. The fluid found in these two cochlear chambers is perilymph, while scala media, or the cochlear duct, is filled with endolymph.
Auditory hair cells
The auditory hair cells in the cochlea are at the core of the auditory system's special functionality (similar hair cells are located in the semicircular canals). Their primary function is mechanotransduction, or conversion between mechanical and neural signals. The relatively small number of the auditory hair cells is surprising when compared to other sensory cells such as the rods and cones of the visual system. Thus the loss of low number (in the order of thousands) of auditory hair cells can be devastating while the loss of a larger number of retinal cells (in the order to hundreds of thousands) will not be as bad from a sensory standpoint.
Cochlear hair cells are organized as inner hair cells and outer hair cells; inner and outer refer to relative position from the axis of the cochlear spiral. The inner hair cells are the primary sensory receptors and a significant amount of the sensory input to the auditory cortex occurs from these hair cells. Outer hair cells on the other hand boost the mechanical signal by using electromechanical feedback.
The apical surface of each cochlear hair cell contains a hair bundle. Each hair bundle contains approximately 300 fine projections known as stereocilia, formed by actin cytoskeletal elements. The stereocilia in a hair bundle are arranged in multiple rows of different heights. In addition to the stereocilia, a true ciliary structure known as the kinocilium exists and is believed to play a role in hair cell degeneration that is caused by exposure to high frequencies.
A stereocilium is able to bend at its point of attachment to the apical surface of the hair cell. The actin filaments that form the core of a stereocilium are highly interlinked and cross linked with fibrin, and are therefore stiff and inflexible at positions other than the base. When stereocilia in the tallest row are deflected in the positive-stimulus direction, the shorter rows of stereocilia are also deflected. These simultaneous deflections occur due to filaments called tip links that attach the side of each taller stereocilium to the top of the shorter stereocilium in the adjacent row. When the tallest stereocilia are deflected, tension is produced in the tip links and causes the stereocilia in the other rows to deflect as well. At the lower end of each tip link is one or more mechano-electrical transduction (MET) channels, which are opened by tension in the tip links. These MET channels are cation-selective transduction channels that allow potassium and calcium ions to enter the hair cell from the endolymph that bathes its apical end.
The influx of cations, particularly potassium, through the open MET channels causes the membrane potential of the hair cell to depolarize. This depolarization opens voltage-gated calcium channels to allow the further influx of calcium. This results in an increase in the calcium concentration, which triggers the exocytosis of neurotransmitter vesicles at ribbon synapses at the basolateral surface of the hair cell. The release of neurotransmitter at a ribbon synapse, in turn, generates an action potential in the connected auditory-nerve fiber. Hyperpolarization of the hair cell, which occurs when potassium leaves the cell, is also important, as it stops the influx of calcium and therefore stops the fusion of vesicles at the ribbon synapses. Thus, as elsewhere in the body, the transduction is dependent on the concentration and distribution of ions. The perilymph that is found in the scala tympani has a low potassium concentration, whereas the endolymph found in the scala media has a high potassium concentration and an electrical potential of about 80 millivolts compared to the perilymph. Mechanotransduction by stereocilia is highly sensitive and able to detect perturbations as small as fluid fluctuations of 0.3 nanometers, and can convert this mechanical stimulation into an electrical nerve impulse in about 10 microseconds.
Nerve fibers from the cochlea
There are two types of afferent neurons found in the cochlear nerve: Type I and Type II. Each type of neuron has specific cell selectivity within the cochlea. The mechanism that determines the selectivity of each type of neuron for a specific hair cell has been proposed by two diametrically opposed theories in neuroscience known as the peripheral instruction hypothesis and the cell autonomous instruction hypothesis. The peripheral instruction hypothesis states that phenotypic differentiation between the two neurons are not made until after these undifferentiated neurons attach to hair cells which in turn will dictate the differentiation pathway. The cell autonomous instruction hypothesis states that differentiation into Type I and Type II neurons occur following the last phase of mitotic division but preceding innervations. Both types of neuron participate in the encoding of sound for transmission to the brain.
Type I neurons
Type I neurons innervate inner hair cells. There is significantly greater convergence of this type of neuron towards the basal end in comparison with the apical end. A radial fiber bundle acts as an intermediary between Type I neurons and inner hair cells. The ratio of innervation that is seen between Type I neurons and inner hair cells is 1:1 which results in high signal transmission fidelity and resolution.
Type II neurons
Type II neurons on the other hand innervate outer hair cells. However, there is significantly greater convergence of this type of neuron towards the apex end in comparison with the basal end. A 1:30-60 ratio of innervation is seen between Type II neurons and outer hair cells which in turn make these neurons ideal for electromechanical feedback. Type II neurons can be physiologically manipulated to innervate inner hair cells provided outer hair cells have been destroyed either through mechanical damage or by chemical damage induced by drugs such as gentamicin.
Brainstem and midbrain
Primary auditory neurons carry action potentials from the cochlea into the transmission pathway shown in the image to the right. Multiple relay stations act as integration and processing centers. The signals reach the first level of cortical processing at the primary auditory cortex (A1), in the superior temporal gyrus of the temporal lobe. Most areas up to and including A1 are tonotopically mapped (that is, frequencies are kept in an ordered arrangement). However, A1 participates in coding more complex and abstract aspects of auditory stimuli without coding well the frequency content, including the presence of a distinct sound or its echos.  Like lower regions, this region of the brain has combination-sensitive neurons that have nonlinear responses to stimuli.
Recent studies conducted in bats and other mammals have revealed that the ability to process and interpret modulation in frequencies primarily occurs in the superior and middle temporal gyri of the temporal lobe. Lateralization of brain function exists in the cortex, with the processing of speech in the left cerebral hemisphere and environmental sounds in the right hemisphere of the auditory cortex. Music, with its influence on emotions, is also processed in the right hemisphere of the auditory cortex. While the reason for such localization is not quite understood, lateralization in this instance does not imply exclusivity as both hemispheres do participate in the processing, but one hemisphere tends to play a more significant role than the other.
- Alternation in encoding mechanisms have been noticed as one progresses through the auditory cortex. Encoding shifts from synchronous responses in the cochlear nucleus and later becomes dependent on rate encoding in the inferior colliculus.
- Despite advances in gene therapy that allows for the alteration of the expression of genes that affect audition, such as ATOH1, and the use of viral vectors for such end, the micro-mechanical and neuronal complexities that surrounds the inner ear hair cells, artificial regeneration in vitro remains a distant reality.
- Recent studies suggest that the auditory cortex may not be as involved in top down processing as was previous thought. In studies conducted on primates for tasks that required the discrimination of acoustic flutter, Lemus found that the auditory cortex played only a sensory role and had nothing to do with the cognition of the task at hand.
- Due to the presence of the tonotopic maps in the auditory cortex at an early age, it has been assumed that cortical reorganization had little to do with the establishment of these maps. However, recent work by Kandler et al. has shown that these maps are formed as a result of plastic reorganization on a sub-cellular and circuit level. It should be stressed that the cortex seems to perform a more complex processing than spectral analysis or even spectro-temporal analysis.
- Hudspeth, AJ. (Oct 1989). "How the ear's works work.". Nature 341 (6241): 397–404. doi:10.1038/341397a0. PMID 2677742.
- Hudspeth, AJ. (2001). "How the ear's works work: mechanoelectrical transduction and amplification by hair cells of the internal ear.". Harvey Lect 97: 41–54. PMID 14562516.
- Hudde, H.; Weistenhofer, C. (2006). "Key features of the human middle ear.". ORL J Otorhinolaryngol Relat Spec 68 (6): 324–8. doi:10.1159/000095274. PMID 17065824.
- Hudspeth, AJ.; Konishi, M. (Oct 2000). "Auditory neuroscience: development, transduction, and integration.". Proc National Academy of Sciences U S A 97 (22): 11690–1. doi:10.1073/pnas.97.22.11690. PMC 34336. PMID 11050196.
- Kaas, JH.; Hackett, TA.; Tramo, MJ. (Apr 1999). "Auditory processing in primate cerebral cortex.". Current Opinions in Neurobiology 9 (2): 164–70. doi:10.1016/S0959-4388(99)80022-1. PMID 10322185.
- Fettiplace, R.; Hackney, CM. (Jan 2006). "The sensory and motor roles of auditory hair cells.". Nat Rev Neurosci 7 (1): 19–29. doi:10.1038/nrn1828. PMID 16371947.
- Beurg, M.; Fettiplace, R.; Nam, JH.; Ricci, AJ. (May 2009). "Localization of inner hair cell mechanotransducer channels using high-speed calcium imaging.". Nature Neuroscience 12 (5): 553–8. doi:10.1038/nn.2295. PMC 2712647. PMID 19330002.
- Rubel, EW.; Fritzsch, B. (2002). "Auditory system development: primary auditory neurons and their targets.". Annual Reviews in Neuroscience 25: 51–101. doi:10.1146/annurev.neuro.25.112701.142849. PMID 12052904.
- Chechik, Gal; Nelken (2012). "Auditory abstraction from spectro-temporal features to coding auditory entities". Proceedings of the National Academy of Sciences of the United States of America 108 (44). doi:10.1073/pnas.1111242109. PMC 3503225. PMID 23112145.
- Frisina, RD. (Aug 2001). "Subcortical neural coding mechanisms for auditory temporal processing.". Hearing Research 158 (1-2): 1–27. doi:10.1016/S0378-5955(01)00296-9. PMID 11506933.
- Brigande, JV.; Heller, S. (Jun 2009). "Quo vadis, hair cell regeneration?". Nature Neuroscience 12 (6): 679–85. doi:10.1038/nn.2311. PMC 2875075. PMID 19471265.
- Lemus, L.; Hernández, A.; Romo, R. (Jun 2009). "Neural codes for perceptual discrimination of acoustic flutter in the primate auditory cortex.". Proceeding of the National Academy of Sciences U S A 106 (23): 9471–6. doi:10.1073/pnas.0904066106. PMC 2684844. PMID 19458263.
- Kandler, K.; Clause, A.; Noh, J. (Jun 2009). "Tonotopic reorganization of developing auditory brainstem circuits.". Nature Neuroscience 12 (6): 711–7. doi:10.1038/nn.2332. PMC 2780022. PMID 19471270.