Assessing individuals’ abundance, residency (presence at a site within a certain period) and site fidelity (tendency to return to the same site in subsequent seasons or years) is crucial for ...evaluating and improving the effectiveness of spatial conservation/management measures regarding ecologically and socio-economically valuable species. Using underwater visual census (UVC) and photo-identification (photo-ID) techniques, we estimated the abundance, residency and site fidelity of the dusky grouper,
Epinephelus marginatus
, at two protected sites within the Tavolara-Punta Coda Cavallo Marine Protected Area (Sardinia, Italy) in the summers of 2017–2018. The scope and spatio-temporal resolution of the study was extended by involving volunteer recreational divers in the photo collection. Grouper mean densities varied significantly across sampling dates, with a significant variability between the 2 years and the two investigated sites. At least 94 grouper visited the study sites in the summers of 2017–2018 based on the analysis of 968 high-quality photos using a semi-automated software to photo-identify individuals. Overall, the most frequently sighted grouper was recorded on 32 different days and 21 individuals (22%) identified in 2017 were re-sighted in 2018. The participation of volunteer recreational divers helped detect the inter-site (3.5–4 km apart) movements of a female and a male, supporting previous findings regarding the occurrence of reproduction-related movements. This study provides novel insights into the residency and site-fidelity patterns of the dusky grouper, and its small-scale movements probably related to reproduction. Specifically, we provide indications that effective protection from fishing should encompass the entire area used by grouper for reproductive movements.
Measurements of speech directivity patterns show that it differs according to the phoneme pronounced. In the case of the vowels, these differences can be attributed to variations of the vocal tract ...shape. The multimodal method, which takes into account higher order propagation modes, is used to simulate directivity patterns (2 kHz–15 kHz) for simplified vocal tract geometries of the vowels a, e, i, o and u. The directivity patterns of a simplified and a realistic replica of the vowel a are measured experimentally. The comparison of the experimental data with the simulations shows a good agreement (average difference of 1.7 dB). It is observed that the amplitude, orientation and number of lobes can change significantly for some small frequency intervals, of the order of 100 Hz; these are shown to be caused by higher order modes. These changes can occur as low as 3 kHz if a wide cavity is present near the mouth exit. A small mouth exit limits the effect to specific frequency intervals, and a narrow channel limits the transmission of the higher order mode effect to the mouth exit. The comparison of the directivity measured on a realistic replica corresponding to the vowel a with simulations performed on two simplified geometries shows that a fully asymmetric shape (for which the centers of the cross-sectional contours are not aligned) is qualitatively more realistic than a partially asymmetric shape (when the centers of the cross-sectional contours lie along the same axis).
The human voice is a directional sound source. This property has been explored for more than 200 years, mainly using measurements of human participants. Some efforts have been made to understand the ...anatomical parameters that influence speech directivity, e.g., the mouth opening, diffraction and reflections due to the head and torso, the lips and the vocal tract. However, these parameters have mostly been studied separately, without being integrated into a complete model or replica. The aim of this work was to study the combined influence of the torso, the lips and the vocal tract geometry on speech directivity. For this purpose, a simplified head and torso simulator was built; this simulator made it possible to vary these parameters independently. It consisted of two spheres representing the head and the torso into which vocal tract replicas with or without lips could be inserted. The directivity patterns were measured in an anechoic room with a turntable and a microphone that could be placed at different angular positions. Different effects such as torso diffraction and reflections, the correlation of the mouth dimensions with directionality, the higher-order modes and the increase in directionality due to the lips were confirmed and further documented. Interactions between the different parameters were found. It was observed that torso diffraction and reflections were enhanced by the presence of the lips, that they could be modified or masked by the effect of higher-order modes and that the lips tend to attenuate the effect of higher-order modes.
A detailed understanding of how the acoustic patterns of speech sounds are generated by the complex 3D shapes of the vocal tract is a major goal in speech research. The Dresden Vocal Tract Dataset ...(DVTD) presented here contains geometric and (aero)acoustic data of the vocal tract of 22 German speech sounds (16 vowels, 5 fricatives, 1 lateral), each from one male and one female speaker. The data include the 3D Magnetic Resonance Imaging data of the vocal tracts, the corresponding 3D-printable and finite-element models, and their simulated and measured acoustic and aerodynamic properties. The dataset was evaluated in terms of the plausibility and the similarity of the resonance frequencies determined by the acoustic simulations and measurements, and in terms of the human identification rate of the vowels and fricatives synthesized by the artificially excited 3D-printed vocal tract models. According to both the acoustic and perceptual metrics, most models are accurate representations of the intended speech sounds and can be readily used for research and education.
Acoustic simulation of sound propagation inside the vocal tract is a key element of speech research, especially for articulatory synthesis, which allows one to relate the physics of speech production ...to other fields of speech science, such as speech perception. Usual methods, such as the transmission line method, have a very low computational cost and perform relatively good up to 4-5 kHz, but are not satisfying above. Fully numerical 3D methods such as finite elements achieve the best accuracy, but have a very high computational cost. Better performances are achieved with the state of the art semi-analytical methods, but they cannot describe the vocal tract geometry as accurately as fully numerical methods (e.g. no possibility to take into account the curvature). This work proposes a new semi-analytical method that achieves a better description of the three-dimensional vocal-tract geometry while keeping the computational cost substantially lower than the fully numerical methods. It is a multimodal method which relies on two-dimensional finite elements to compute transverse modes and takes into account the curvature and the variations of cross-sectional area. The comparison with finite element simulations shows that the same degree of accuracy (about 1 % of difference in the resonance frequencies) is achieved with a computational cost about 10 times lower.
Three-dimensional (3-D) numerical approaches for voice production are currently being investigated and developed. Radiation losses produced when sound waves emanate from the mouth aperture are one of ...the key aspects to be modeled. When doing so, the lips are usually removed from the vocal tract geometry in order to impose a radiation impedance on a closed cross-section, which speeds up the numerical simulations compared to free-field radiation solutions. However, lips may play a significant role. In this work, the lips' effects on vowel sounds are investigated by using 3-D vocal tract geometries generated from magnetic resonance imaging. To this aim, two configurations for the vocal tract exit are considered: with lips and without lips. The acoustic behavior of each is analyzed and compared by means of time-domain finite element simulations that allow free-field wave propagation and experiments performed using 3-D-printed mechanical replicas. The results show that the lips should be included in order to correctly model vocal tract acoustics not only at high frequencies, as commonly accepted, but also in the low frequency range below 4 kHz, where plane wave propagation occurs.
Articulatory synthesis generates speech sounds by simulating the physical phenomena involved in speech production. The accuracy of the physical modelling is expected to affect the naturalness of the ...synthesis: the more realistic the description is, the greater the naturalness is expected to be. In this work, the accuracy of acoustic wave propagation in the vocal tract was evaluated with two perceptual experiments. Sustained vowels generated using a one-dimensional acoustic model, a three-dimensional acoustic model and an artificial bandwidth extension algorithm (without a physical basis) were compared. Since the difference between the acoustic methods tested affects mainly the frequencies above 4 kHz, we ensured that the low frequency part of the stimuli, up to 4 kHz, was similar. Thus, the participants' responses were based only on the differences at high frequency. The first experiment was a pair comparison, in which the participants had to select the more natural sounding stimuli. In the second experiment, the participants had to rate the naturalness of the stimuli on a linear scale. The results confirmed that a more accurate physical modeling leads to greater naturalness. However, this was limited to the phonemes /o/ and /u/, for which transverse resonances in the anterior vocal tract may play an important role that only a 3D acoustic simulation can accurately represent. It was also found that male stimuli were perceived as significantly more natural than female ones. However, voice quality did not affect naturalness.
Groupers are marine fishes particularly vulnerable to overexploitation owing to their reproduction‐related traits. The dusky grouper Epinephelus marginatus is classified as ‘Endangered’ in the ...Mediterranean Sea, where protection measures have proven critical for allowing population recovery and persistence of the species. However, knowledge gaps remain regarding its reproductive behaviours and spawning sites. As with other grouper species, the dusky grouper is known to produce courtship‐associated sounds, which were validated in captivity, but no study has ever established a link between sound production and observed behaviours in the wild.
This study aims to characterize, both visually and acoustically, the behavioural patterns of wild dusky grouper, thus specifically linking visual courtship behaviours to the associated sound production. Visual as well as passive acoustic methods were employed at two presumptive spawning sites within a Mediterranean Marine Protected Area (MPA) in the north‐western Mediterranean Sea over two consecutive summer seasons (2017 and 2018).
The temporal patterns of sound production showed site specificity over the month in which reproductive activity was most intense. Courtship calls were mostly recorded around sunset at both study sites and were significantly associated with visual reproductive behaviours. Results also indicated a temporal partitioning in the acoustic activity of the species, with different sound types being recorded at distinct times of the day, likely corresponding to the occurrence of various behaviours during those periods.
These findings have important implications for identifying and monitoring dusky grouper spawning sites using passive acoustic methods, therefore providing valuable information for the development, implementation, and/or enforcement of effective conservation measures therein.
Articulatory synthesis is based on modeling various physical phenomena of speech production, including sound radiation from the mouth. With regard to sound radiation, the most common approach is to ...approximate it in terms of a simple spherical source of strength equal to the mouth volume velocity. However, because this approximation is only valid at very low frequencies and does not account for the diffraction by the head and the torso, we simulated two alternative radiation characteristics that are potentially more realistic: the radiation from a vibrating piston in a spherical baffle, and the radiation from the mouth of a detailed model of the human head and torso. Using the articulatory speech synthesizer VocalTractLab, a corpus of 10 sentences was synthesized with the different radiation characteristics combined with three different phonation types. The synthesized sentences were acoustically compared with natural recordings of the same sentences in terms of their long-term average spectra (LTAS), and evaluated in terms of their naturalness and intelligibility. The intelligibility was not affected by the type of radiation characteristic. However, it was found that the more similar their LTAS was to real speech, the more natural the synthetic sentences were perceived to be. Hence, the naturalness was not directly determined by the realism of the radiation characteristic, but by the combined spectral effect of the radiation characteristic and the voice source. While the more realistic radiation models do not per se improve synthesis quality, they provide new insights in the study of speech production and articulatory synthesis.