•The simplification universal hypothesis has not been fully supported.•Entropy was used to analyse the complexity between translated and native Chinese.•Translational Chinese tends to be simpler at ...the lexical level.•Such a finding was not obtained at the syntactic level.
For a long time, translation researchers, particularly those working in corpus-based translation studies, have held the presumption that translated texts tend to be simpler in lexical and syntactical features than non-translated native texts. Such claims have led to the formulation of the simplification universal hypothesis in translation studies. However, this line of research which focuses predominantly on the investigation of individual linguistic features has failed to provide sufficient evidence to confirm the existence of the simplification universal. To a large extent, the lack of global quantitative indicators for evaluating the complexity level of the translated and non-translated texts has hindered progress in this field. The current study, using entropy as an indicator, analysed the linguistic complexity between translated and native Chinese from the information-theoretical perspective. Our research found that translational Chinese tends to be simpler than its non-translated counterpart at the lexical level based on unigram entropy, but not the syntactic level based on part-of-speech entropy. Our study has confirmed the use of entropy as a reliable measure for lexical and syntactic complexity in the field of translation studies.
A methodology for the development and implementation of inflectional rules for verbs in Karelian Proper is presented. The materials for this study were lemmas and word forms from the Open corpus of ...Veps and Karelian languages (VepKar) and the electronic version of the Karelian language Dictionary. The system of rules for automatic verb inflection for the Karelian Proper supradialect is of both practical and theoretical scientific interest. The new rules have already enabled entering 141 000 Karelian Proper word forms in the VepKar dictionary. The new program for word form generation has significantly reduced the time for adding the full inflectional paradigm of any Karelian Proper verb to the VepKar dictionary. One only needs to fill in several template parameters instead of 125 word forms.
Kokkuvõte. Natalia Krizhanovskaya, Irina Novak, Andrew Krizhanovsky, Nataliya Pellinen: Morfoloogilised muutereeglid päriskarjala verbide jaoks. Artiklis esitletakse metodoloogiat, mida kasutati muutereeglite väljatöötamisel ja rakendamisel päriskarjala verbide jaoks. Materjali moodustasid vepsa ja karjala keele avatud korpusest (VepKar) ning karjala keele sõnaraamatu elektroonilisest versioonist kogutud lemmad ja sõnavormid. Esmakordselt arendati välja reeglite süsteem päriskarjala verbivormide automaatseks genereerimiseks. See on teaduslikult huvipakkuv nii praktilise kui ka teoreetilise poole pealt. Uued reeglid on juba võimaldanud lisada 141 000 päriskarjala sõnavormi VepKar sõnaraamatusse. Uus sõnavormide genereerimise programm on oluliselt vähendanud aega, mis kulub täieliku muuteparadigma lisamisele mingi päriskarjala verbi juurde Vepkar sõnaraamatus. 125 sõnavormi asemel on selleks üksnes vaja täita mallid mõningate parameetritega.
Four sets of word-form tasks were administered during fMRI scanning to 18 child dyslexics and 21 controls to identify unique brain activation associated with four kinds of mapping—orthographic, ...morpheme with and without phonological shift, and phoneme—before treatment, and to measure the effect on each kind of mapping after orthographic and morphological spelling treatment (to which dyslexics were randomly assigned). Dyslexics and/or controls showed significant pretreatment activation in group maps in 18 brain regions during one or more of the mapping tasks. Average fMRI
z-scores were used to determine for each kind of fMRI mapping which of the 18 brain areas (a) differentiated dyslexics and controls before treatment; (b) showed significant pre- to post-treatment activation change in dyslexics; (c) showed post-treatment ‘normalization’ of activation; and (d) changed differently for dyslexics as a function of the kind of treatment received. Dyslexics in orthographic treatment showed reliable change, normalization, and treatment-specific response in right inferior frontal gyrus and right posterior parietal gyrus. Implications of the findings of the combined group map and individual (region of interest) analyses for neurolinguistics, including assessment, treatment and brain plasticity, and the role of different word forms in spelling at a specific developmental stage, are discussed.
•An inattentional blindness paradigm with word form stimuli was combined with ERPs.•Words and consonant strings helped test for orthographic and lexical processing.•A 3-phase design helped separate ...awareness of word forms from task-based reporting.•ERPs suggest orthographic but not lexical processing during inattentional blindness.•VAN correlated closely with awareness, while P3b correlated with the reporting task.
A three-phase inattentional blindness paradigm was combined with ERPs. While participants performed a distracter task, line segments in the background formed words or consonant-strings. Nearly half of the participants failed to notice these word-forms and were deemed inattentionally blind. All participants noticed the word-forms in phase 2 of the experiment while they performed the same distracter task. In the final phase, participants performed a task on the word-forms. In all phases, including during inattentional blindness, word-forms elicited distinct ERPs during early latencies (∼200–280ms) suggesting unconscious orthographic processing. A subsequent ERP (∼320–380ms) similar to the visual awareness negativity appeared only when subjects were aware of the word-forms, regardless of the task. Finally, word-forms elicited a P3b (∼400–550ms) only when these stimuli were task-relevant. These results are consistent with previous inattentional blindness studies and help distinguish brain activity associated with pre- and post-perceptual processing from correlates of conscious perception.
The sudden appearance of a new epidemic disease in China created the need for names identifying that disease. Between December 2019 and January 2020, a variety of severe pneumonia-related disease ...names suddenly appeared, and more name varieties kept coming up afterwards. To better understand the introduction and spread of these names, 16 different COVID-19-related name varieties were selected covering the period from the end of December 2019, when the epidemic started, to mid-March 2020, a moment at which the term competition had stabilized. By way of big data analysis, the initiation and distribution of the 16 names across the media landscape was traced with regard to the impact of different media platforms, while the distribution frequency of each of the selected terms was mapped, resulting in a distinction of three groups of disease names, each with a different media and time profile. The results were discussed based on the hypotheses of disease confusion by name variety and management failures in absence of clear language governance at the national and global levels. The analysis of the data led to a refutation of both hypotheses. Based on this discussion, the study offers empirically based suggestions for the WHO in their naming practices and further research.
This study focused on the associative learning of new word forms in the first language and haptic stimuli. In this study, healthy Japanese participants performed three-step tasks. First, participants ...made nine subjective evaluations of haptic stimuli using five-point semantic differential scales (e.g., regarding stickiness, scored from 1 (not sticky) to 5 (sticky)). Second, the participants carried out learning and recognition tasks for associative pairs of new (meaningless) word forms in their first language (Japanese) and haptic stimulus (H condition), and performed learning and recognition tasks for new (meaningless) word forms only (W condition). The order of conditions was counterbalanced among participants. Third, participants performed free recall tasks. The results of the recognition tasks showed that the proportions and response times of the W condition were better and faster, respectively, than those of the H condition. Furthermore, preference of haptic features negatively correlated with free recall scores of the H condition; however, there was no significant difference between the free recall scores of the H and W conditions. Our results suggest that new word forms were learned better than associative pairs of new word forms and haptic stimuli in a single day of learning. Furthermore, the free recall performance of word forms associated with haptic features could also be affected by their subjective evaluation (preference).
The present book contains all of the articles that were accepted and published in the Special Issue of MDPI’s journal Mathematics titled "Computational Intelligence and Human–Computer Interaction: ...Modern Methods and Applications". This Special Issue covered a wide range of topics connected to the theory and application of different computational intelligence techniques to the domain of human–computer interaction, such as automatic speech recognition, speech processing and analysis, virtual reality, emotion-aware applications, digital storytelling, natural language processing, smart cars and devices, and online learning. We hope that this book will be interesting and useful for those working in various areas of artificial intelligence, human–computer interaction, and software engineering as well as for those who are interested in how these domains are connected in real-life situations.
Automatic speech recognition is essential for establishing natural communication with a human–computer interface. Speech recognition accuracy strongly depends on the complexity of language. Highly ...inflected word forms are a type of unit present in some languages. The acoustic background presents an additional important degradation factor influencing speech recognition accuracy. While the acoustic background has been studied extensively, the highly inflected word forms and their combined influence still present a major research challenge. Thus, a novel type of analysis is proposed, where a dedicated speech database comprised solely of highly inflected word forms is constructed and used for tests. Dedicated test sets with various acoustic backgrounds were generated and evaluated with the Slovenian UMB BN speech recognition system. The baseline word accuracy of 93.88% and 98.53% was reduced to as low as 23.58% and 15.14% for the various acoustic backgrounds. The analysis shows that the word accuracy degradation depends on and changes with the acoustic background type and level. The highly inflected word forms’ test sets without background decreased word accuracy from 93.3% to only 63.3% in the worst case. The impact of highly inflected word forms on speech recognition accuracy was reduced with the increased levels of acoustic background and was, in these cases, similar to the non-highly inflected test sets. The results indicate that alternative methods in constructing speech databases, particularly for low-resourced Slovenian language, could be beneficial.
We explored 12-month-olds' flexibility in accepting phonotactically illegal or ill-formed word forms in a modified associative-learning task. Sixty-four English-learning infants were presented with a ...training phase that either clarified the purpose of a sound-object association task or left the task ambiguous. Infants were then habituated to sets of Czech words with onsets that are illegal in English (e.g., ptak), consonantal sounds (e.g., /l/), or novel functionlike words (e.g., iv). When infants were provided with a training phase that highlighted the purpose of the task, they associated the phonotactically illegal Czech words, but not the consonantal sounds or novel functionlike words, with objects. Thus, English-learning 12-month-old infants' flexibility in associating various sound forms with novel objects is limited to labels that share the structural shape of well-formed nounlike words.
Prefix Tag Clouds Burch, Michael; Lohmann, Steffen; Pompe, Daniel ...
2013 17th International Conference on Information Visualisation,
2013-July
Conference Proceeding
Tag clouds are a popular way to visually represent word frequencies. However, one major limitation is that they do not relate different word forms but treat every form as an individual tag. This ...results not only in a non-efficient use of screen space but, in particular, leaves the viewer with no indication whether there are other forms of a word or not. To overcome this limitation, we introduce prefix tag clouds: a visualization technique that uses a prefix tree to group different word forms and visualizes the sub trees as tag cloud. The grouping is emphasized by color, while the relative frequencies of the word forms are indicated by font size. A circular tag cloud layout supports the quick identification of the most frequent words and word forms. We show the usefulness of the approach for a large dataset of paper titles from the computer science bibliography DBLP.