The anatomy of the auditory region of fossil hominins may shed light on the emergence of human spoken language. Humans differ from other great apes in several features of the external, middle and ...inner ear (e.g., short external ear canal, small tympanic membrane, large oval window). However, the functional implications of these differences remain poorly understood as comparative audiometric data from great apes are scarce and conflicting. Here, we measure the sound transfer function of the external and middle ears of humans, chimpanzees and bonobos, using laser-Doppler vibrometry and finite element analysis. This sound transfer function affects auditory thresholds, which relate to speech reception thresholds in humans. Unexpectedly we find that external and middle ears of chimpanzees and bonobos transfer sound better than human ones in the frequency range of spoken language. Our results suggest that auditory thresholds of the last common ancestor of Homo and Pan were already compatible with speech reception as observed in humans. Therefore, it seems unlikely that the morphological evolution observed in the bony auditory region of fossil hominins was driven by the emergence of spoken language. Instead, the peculiar human configuration may be a by-product of morpho-functional constraints linked to brain expansion.
Articulatory synthesis is based on modeling various physical phenomena of speech production, including sound radiation from the mouth. With regard to sound radiation, the most common approach is to ...approximate it in terms of a simple spherical source of strength equal to the mouth volume velocity. However, because this approximation is only valid at very low frequencies and does not account for the diffraction by the head and the torso, we simulated two alternative radiation characteristics that are potentially more realistic: the radiation from a vibrating piston in a spherical baffle, and the radiation from the mouth of a detailed model of the human head and torso. Using the articulatory speech synthesizer VocalTractLab, a corpus of 10 sentences was synthesized with the different radiation characteristics combined with three different phonation types. The synthesized sentences were acoustically compared with natural recordings of the same sentences in terms of their long-term average spectra (LTAS), and evaluated in terms of their naturalness and intelligibility. The intelligibility was not affected by the type of radiation characteristic. However, it was found that the more similar their LTAS was to real speech, the more natural the synthetic sentences were perceived to be. Hence, the naturalness was not directly determined by the realism of the radiation characteristic, but by the combined spectral effect of the radiation characteristic and the voice source. While the more realistic radiation models do not per se improve synthesis quality, they provide new insights in the study of speech production and articulatory synthesis.
Endoscopic optical coherence tomography (OCT) offers a non-invasive approach to perform the morphological and functional assessment of the middle ear in vivo. However, interpreting such OCT images is ...challenging and time-consuming due to the shadowing of preceding structures. Deep neural networks have emerged as a promising tool to enhance this process in multiple aspects, including segmentation, classification, and registration. Nevertheless, the scarcity of annotated datasets of OCT middle ear images poses a significant hurdle to the performance of neural networks. We introduce the Dresden in vivo OCT Dataset of the Middle Ear (DIOME) featuring 43 OCT volumes from both healthy and pathological middle ears of 29 subjects. DIOME provides semantic segmentations of five crucial anatomical structures (tympanic membrane, malleus, incus, stapes and promontory), and sparse landmarks delineating the salient features of the structures. The availability of these data facilitates the training and evaluation of algorithms regarding various analysis tasks with middle ear OCT images, e.g. diagnostics.
Acousto-mechanical properties of the human tympanic membrane mainly depend on the connective tissue in its layered structure. Using polarization-sensitive optical coherence tomography, a ...depth-resolved imaging technique which provides additional tissue specific contrast, polarization changes of the birefringent layers in the human tympanic membrane were detected. By depicting estimated local retardances, distinguishing different tissue types was possible. This suggests the ability to image pathological alterations of the connective tissue with PSOCT, which extends the conventional diagnostic methods in middle ear surgery.
Tympanic membrane (TM) thickness is an important parameter for differentiation between a healthy and a pathologic TM. Furthermore, it is needed for modeling the middle ear function. Endoscopic ...optical coherence tomography (eOCT) provides the opportunity to measure the TM thickness of the entire TM in vivo.
A total of 27 healthy ears were examined by eOCT. The system uses a light source with a central wavelength of 1,300 nm. The endoscope with an outer diameter of 3.5 mm provides a field of view of 10 mm and a working distance of 10 mm. Thickness measurements were carried out at 8 points on the TM. Additionally, the existing literature was analyzed, and a mean TM thickness value was determined.
The mean thickness of the TM over all measurement points of the pars tensa was 120.2 μm, and the pars flaccida was significantly thicker with a mean thickness of 177.9 μm. Beyond that, there were no significant differences between the single quadrants. The mean TM thickness in the literature was 88.8 μm.
EOCT provides the possibility for in vivo thickness determination of the TM. The mean thickness seems to be higher than in the previous studies, which were mostly carried out ex vivo. Our study takes the three-dimensional refraction into account and provides a method for the refraction correction.
Although the Vibrant Soundbridge is one of the most frequently used active middle ear implants, data regarding how middle ear ventilation disorders may affect the transmission behavior of its ...floating mass transducer are still insufficient. Studies involving coupling the floating mass transducer to the stapes head are particularly lacking.
This temporal bone study evaluated the influence of simulated middle ear ventilation disorders on the middle ear transfer function in the reconstructed middle ear. The middle ear transfer function was measured using Laser Doppler Vibrometry after vibroplasty onto the stapes head, with or without tympanic membrane reconstruction. Middle ear ventilation disorders were simulated through changes in static pressure via the external ear channel with a maximum pressure of +3 kPa. Slice thickness of tympanic membrane reconstruction material was measured using micro-CT.
When the reconstructed ossicular chain and the reconstructed tympanic membrane were mechanically excited by the floating mass transducer under conditions of ambient static pressure, the transmission behavior was found to be independent of the type of tissue used. Increase in static pressure up to +3 kPa caused maximum low frequency transmission loss of 15 dB when elastic grafts were used and 5 dB when stiff tissue was inserted. At high frequencies, measured loss of up to 5 dB was relatively independent of the tissue stiffness.
Increase in static pressure led to displacement of the tissues towards the vestibulum and caused stiffening, especially of the annular ligament. Stiffening-induced transmission losses were mainly found at low frequencies and could not be compensated by the floating mass transducer in this range. Above 1300 Hz, the continuous force spectrum of the actuator sufficiently protected against loss of amplitude. To minimize postoperative transmission loss due to persisting ventilation disorders, choosing a very stiff tympanic membrane reconstruction material seems to be appropriate.
•VSB implantation can be recommended in cases of middle ear ventilation disorders.•The force spectrum of the FMT minimizes high-frequency loss of amplitude.•For TM reconstruction, the use of stiff tissue is recommended.•Cartilage's stiffness can be modified by varying the slice thickness.
Despite the complete embodiment of the stapedius muscle (SM) into the pyramidal eminence, it is possible to safely gain access to the SM belly via a retrofacial approach. This presents a novel ...approach to directly measure the electrically evoked stapedius reflex threshold (eSRT).
Objective fitting of maximum comfortable loudness levels for cochlear implant users can improve the benefit introduced by the device. Sensing SM activity via direct surgical access represents a potential tool for objective eSRT fitting.
Eighteen human temporal bones (TBs) were used. Micro-computed tomography was performed for six TBs. Standard computed tomography for six TBs. Manual 3D-segmentation of the relevant middle and inner ear anatomy was performed on 12 TBs. Mastoidectomy and posterior tympanotomy allowed the access to middle ear of all 18 the TBs. Once identified the mastoidal segment of the facial nerve (FN), the retrofacial access to the SM was drilled.
The total access rate was 72.2%. Only in the first three cases the posterior semi-circular canal was hit. The SM access was identified posterior to the FN at a 4 ± 0.78 mm distance from the stapes' head, almost halfway to the chorda tympani's branching point along the FN direction. The drilling depth to access the SM posterior to the external surface of FN on average was 2 ± 0.30 mm. The exposure took on average of 5 to 8 minutes.
The retrofacial approach seems to offer a feasible and reproducible access to the SM belly opening an avenue to electromyographic sensing of eSRT.