Patterns of durational variation were examined by applying 15 previously published rhythm measures to a large corpus of speech from five languages. In order to achieve consistent segmentation across ...all languages, an automatic speech recognition system was developed to divide the waveforms into consonantal and vocalic regions. The resulting duration measurements rest strictly on acoustic criteria. Machine classification showed that rhythm measures could separate languages at rates above chance. Within-language variability in rhythm measures, however, was large and comparable to that between languages. Therefore, different languages could not be identified reliably from single paragraphs. In experiments separating pairs of languages, a rhythm measure that was relatively successful at separating one pair often performed very poorly on another pair: there was no broadly successful rhythm measure. Separation of all five languages at once required a combination of three rhythm measures. Many triplets were about equally effective, but the confusion patterns between languages varied with the choice of rhythm measures.
While segmental contrasts are under prosodically strong conditions, acoustic properties encoding the contrastive features are generally exaggerated, which gives rise to phonological enhancement. One ...exception that previous research found for such prosodic effects is on consonantal place of articulation (Cole, Kim, Choi, & Hasegawa-Johnson, 2007; Silbert & de Jong, 2008). Whether this is an issue of the nature of the segments under study, or a language-specific phenomenon is worth further investigation. This paper builds on Chuang and Fon's (2010) study of Taiwan Mandarin alveolar and retroflex sibilants and extends the examination to another dialect of Mandarin, Beijing Mandarin. With a series of map tasks to elicit natural yet well-controlled data, this study asks whether contrastive focus realizations of the alveolar–retroflex contrast vary across vowel contexts between the two dialects. Results show that, consistent with Silbert and de Jong's finding for place distinction in English fricatives, focal prominence may result in the exaggeration of non-contrastive dimensions (i.e., longer syllable and frication duration as well as higher frication amplitude) without enhancing feature-defining properties (i.e., a greater acoustic distance between alveolar and retroflex sibilants). It is suggested that the place feature, particularly in coronal sibilants, is generally less subject to cue-enhancing hyperarticulation, regardless of languages and dialects.
•We examine focus effects on sibilant place contrasts in two dialects of Mandarin.•Map tasks are devised to elicit alveolar and retroflex productions.•Focal prominence increases frication duration and amplitude.•Focal prominence does not result in enhanced spectral distance.•We confirm the limitation of focus effects on acoustic cues marking place contrasts.
•Mandarin neutral tone can be greatly influenced by anticipatory tonal coarticulation.•Long-distance anticipatory tonal coarticulation can be assimilatory and dissimilatory.•Anticipatory tonal ...coarticulation is reduced when across a strong boundary.•Boundary-induced coarticulatory variation is not due to boundary-induced lengthening.
The goal of this study is to examine (1) whether and how the F0 contour of a sequence of neutral tones in Standard Mandarin is affected by the following full tone due to anticipatory tonal coarticulation, (2) whether and how the degree of anticipatory tonal coarticulation varies as a function of the strength of an intervening boundary, and (3) whether boundary-conditioned anticipatory tonal coarticulatory variations can be accounted for by boundary-induced durational changes. By using generalized additive mixed models to analyze time-varying F0 contours of neutral-tone sequence, we are able to formally detect anticipatory tonal coarticulation effects across different boundary conditions while simultaneously taking into account individual variations of 20 speakers. Results show that (a) F0 contours of neutral-tone sequence can be affected by both assimilatory and dissimilatory anticipatory effects and anticipation could spread to non-adjacent neutral tones, that (b) anticipatory tonal coarticulation becomes weaker when crossing a strong boundary than when crossing a weak boundary, and that (c) boundary-induced anticipatory tonal coarticulation reduction cannot be simply accounted for by boundary-induced lengthening effect. Based on the results, we argue that the anticipatory tonal coarticulation patterns reported in this study are more consistent with the soft template mark-up language (Stem-ML) model than with the parallel encoding and target approximation (PENTA) model, and that boundary-induced anticipatory tonal coarticulation reduction is primarily the result of active speech planning.
Abstract There is a prevailing belief that second language (L2) textbooks should strive for authenticity, aiming to accurately and reliably represent natural language use. However, assessing the ...authenticity of language textbooks is not a straightforward task, as it requires both a comparison between instructional texts and real-world language use and a consideration of the comparison within the broader context of language development. This study employs a corpus-based approach to examine word use in 31 widely used commercial textbooks across different proficiency levels, which serve as the primary resource for teaching, learning, and assessment in Chinese language programs at US universities. Using hierarchical cluster analysis, Zipf’s law analysis, and descriptive statistics, we found supportive evidence for the authenticity of word use in Chinese textbooks from a developmental perspective. Specifically, our findings reveal a nuanced gradation in the distribution of words across proficiency levels within these Chinese textbooks. Moreover, as proficiency levels increase, there is a discernible register shift in word use from spoken to written discourse. These observations carry significant implications for the utility of textbooks in Chinese language programs, particularly in the realms of instruction and assessment.
This two‐phased study explored a corpus‐driven, curriculum‐based approach to assessing proficiency development of Mandarin Chinese in US universities. In Phase I, we developed an elicited imitation ...(EI) test based on the Chinese curriculum at a large US university. This test demonstrated higher psychometric qualities to (1) place L2 learners into three curricular levels, and (2) assess learners' production of key vocabulary and grammatical items, providing useful information for placement, diagnosis, and achievement purposes as compared with a non‐curriculum‐based EI test. In Phase II, we revised the EI test to make it applicable across university contexts. We identified a lexico‐grammar core from a corpus‐driven analysis of 36 widely‐used Chinese textbooks in US universities and incorporated this core in EI item development. The revised EI test showed improved psychometric qualities. This study has broad methodological implications for the assessment of Chinese as well as other less commonly taught languages in university contexts.
The Challenge
There is an increasing number of general language proficiency tests available for teachers and program administrators in less commonly taught languages programs. However, are these tests well linked to the language curriculum in your context? How useful are they for placement, diagnosis, and achievement purposes? This article presents a corpus‐driven, curriculum‐based approach to assessing proficiency development of Mandarin Chinese, a less commonly taught language (LCTL) in US universities. The full version of the test is provided, and the methodological implications of the corpus‐driven, curriculum‐based approach for the assessment of Chinese as well as other LCTLs are also discussed.
Some studies have argued that native speakers of tonal languages have been shown to perceive lexical tone continua in a more categorical manner than speakers of non-tonal languages. Among these, ...Zhang and colleagues (NeuroReport 23 (1): 35-9) conducted an event-related potential (ERP) study using an oddball paradigm showing that native Mandarin speakers exhibit different sensitivity to deviant tones that cross category boundaries compared to deviants that belong to the same category as the standard. Other recent ERP findings examining consonant voicing categories question whether perception is truly categorical. The current study investigated these discrepant findings by replicating and extending the Zhang et al. study. Native Mandarin speakers and naïve English speakers performed an auditory oddball detection test while ERPs were recorded. Naïve English speakers were included to test for language experience effects. We found that Mandarin speakers and English speakers demonstrated qualitatively similar responses, in that both groups showed a larger N2 to the across-category deviant and a larger P3 to the within-category deviant. The N2/P3 pattern also did not differ in scalp topography for the within- versus across-category deviants, as was reported by Zhang et al. Cross-language differences surfaced in behavioral results, where Mandarin speakers showed better discrimination for the across-category deviant, but English speakers showed better discrimination for within-category deviants, though all results were near-ceiling. Our results therefore support models suggesting that listeners remain sensitive to gradient acoustic differences in speech even when they have learned phonological categories along an acoustic dimension.
This paper explores the effects of talker-to-listener distance (TLD) on tone and examines its impact on tone perception by both native listeners and second language learners.
Speakers naturally ...adjust vocal effort to talk to people at different distances, which leads to changes not only in intensity, duration and formant frequencies, but also in fundamental frequency (f0). Fundamental frequency is the primary acoustic cue that differentiates Mandarin lexical tones from one another. This study aims to answer the question of whether changes in f0 as a function of TLD affect tone perception by native (L1) and second language (L2) listeners. If so, what are the specific changes that have an impact on tone perception?
The production study investigates the acoustic correlates of the effects of TLD on tone, using 7959 monosyllabic Mandarin speech files recorded by three speakers under 11 levels of TLD. The perception study explores the effects of TLD on L1 and L2 tone perception by 2 native listeners and 52 L2 listeners.
The effects of TLD on speech production are systematic, and we present models and analyses with sufficient detail to simulate these effects. Intensity, duration, and initial and maximum f0 increase along with TLD, while time-normalized toneshapes remain invariant. The results of the perception study show that native listeners' performance is robust under changes in TLD, while L2 listeners' perception of tone interacts with TLD. One significant finding is that tone 3 recognition improves with TLD.
This work investigates the relationship between speech production and perception when acoustic attributes change naturally in response to the demands of speech communication. The results have potential applications in speech synthesis, pronunciation training and second language testing.
•We model the effects of talker-to-listener distance (TLD) on tone.•We examine the effects of TLD on tone perception by L1 and L2 listeners.•Farther TLDs help tone perception by increasing pitch, intensity and duration.•Two invariant tone features preserve tone information under changes in TLD.
•Musical tone discrimination was correlated with Mandarin tone discrimination.•Internal representations of rapid pitch changes appear to differ in three language groups.•Pitch-shift responses ...contribute to the classification of the speaker groups.
Adult second language learners (L2) of Mandarin have to acquire both new perceptual categories for discriminating and identifying lexical pitch variation and new sensorimotor skills to produce rapid tone changes. Perceptual learning was investigated using two perceptual tasks, musical tone discrimination and linguistic tone discrimination, which were administered to 10 naïve adults (native speakers of English with no tonal language exposure), 10 L2 adults, and 9 Mandarin-speaking adults. Changes in sensorimotor skills were examined with a pitch-shift paradigm that examines rapid responses to unexpected pitch perturbations in auditory feedback. Discrimination of musical tones was correlated significantly with discrimination of Mandarin tones, with the clearest advantage (better performance) among Mandarin speakers and some advantage among L2 learners. Group differences were found in the fundamental frequency (F0) contours of responses to pitch-shift stimuli. The F0 contours of Mandarin speakers were least affected quantitatively by the amplitude and direction of pitch perturbations, suggesting more stable internal tone models, while the F0 contours of naïve speakers and L2 learners were significantly altered by the perturbations. Discriminant analysis suggests that pitch-shift responses and tone discrimination predict class membership for the three groups. Discrimination of variations in tone appears to change early in L2 learning, possibly reflecting a process whereby new pitch representations are internalized. These findings indicate that tone discrimination and internal models for audio–vocal control are sensitive to language experience.
In acoustic studies of vowel nasalization, it is sometimes assumed that the primary articulatory difference between an oral vowel and a nasal vowel is the coupling of the nasal cavity to the rest of ...the vocal tract. Acoustic modulations observed in nasal vowels are customarily attributed to the presence of additional poles affiliated with the naso-pharyngeal tract and zeros affiliated with the nasal cavity. We test the hypothesis that oral configuration may also change during nasalized vowels, either enhancing or compensating for the acoustic modulations associated with nasality. We analyze tongue position, nasal airflow, and acoustic data to determine whether American English /i/ and /a/ manifest different oral configurations when they are nasalized, i.e. when they are followed by nasal consonants. We find that tongue position is higher during nasalized ĩ than it is during oral i but do not find any effect for nasalized ã. We argue that speakers of American English raise the tongue body during nasalized ĩ in order to counteract the perceived F1-raising (centralization) associated with high vowel nasalization.
► Many analyses presuppose that nasal and oral vowels have the same oral properties. ► F1-modulation can be attributed to both nasalization and tongue height. ► We find that the tongue is higher in American English /i/ when preceded by a nasal. ► Speakers may be compensating for F1-modulation associated with nasality. ► Speakers may be resisting phonological vowel nasalization by raising the tongue.