This paper focuses on symmetry of positive morphological first language influence on the bases of the examples of EILC and ICLFI which were chosen according to convergence and divergence between ...Estonian and Finnish morphological structures. Five groups of elative singular forms in texts of proficiency levels B1 and B2 according to a scale of Common European Framework of Reference for Languages: Learning, teaching and assessment (CEFR), written by Estonian learners of Finnish and Finnish learners of Estonian were analyzed.
Opsomming: Voor Pampus liggen het 'n oordragtelike betekenis en idiomatiese status in Ne-derlands verwerf, maar is gewoonlik onbekend aan die Afrikaanse leksikograaf buite die Neder-landse konteks. ...Vir 'n Afrikaanse leksikograaf met sy unieke verwysingsraamwerk en kulturele en sosiale waardeoordele word dit slegs in 'n Nederlandse milieu duidelik waarom die leksikografiese redigering van Nederlandse lemmata in 'n vertalende Nederlands-Afrikaanse woordeboek so 'n groot uitdaging inhou, ten spyte van die feit dat die twee tale so nou verwant is. Taalversteuring en -beïnvloeding vind makliker plaas tussen nou verwante tale. Taalrelatiwisme impliseer dat elkeen van die betrokke tale 'n unieke soort perspektief op die taaleie werklikheid bied vanweë 'n unieke lewens- en wêreldbeskouing wat 'n inherente deel uitmaak van 'n betrokke moedertaal. Belangrike metaleksikografiese riglyne sluit die oordeelkundige aanwending van pragmatiese leiding (bv. etikette), goedgekose, verhelderende sitate en poëme en relevante gebruiksleiding in. Dit kan die teikengebruiker van 'n vertalende woordeboek 'n insig gee in die unieke en kultuurspesifieke woordeskatelemente wat die tale van mekaar onderskei.
Sleutelwoorde: NOU VERWANTE TALE, VERTALENDE NEDERLANDS–AFRIKAANSE WOORDEBOEK, BRONTAAL, DOELTAAL, SKUIFFUNKSIE, OMKERING, KOMBINASIELEM-MA, UNIEKE VERWYSINGSRAAMWERK, SOSIO-KULTURELE WAARDEOORDELE, TAAL-VERSTEURING EN -BEÏNVLOEDING, TAALRELATIWISME, SAPIR–WHORF-HIPOTESE, LIN-GUISTIESE RELATIWITEIT EN DETERMINISME, METALEKSIKOGRAFIESE RIGLYNE, KOL-LOKASIES, ETIKETTE, PRAGMATIESE LEIDING, SITATE, POËME, GEBRUIKSLEIDING, RAME, METATAAL, TWEETALIGE LEKSIKOGRAFIE
Abstract: Challenges for the Afrikaans Lexicographer with regard to Dutch Lemmata in a Dutch-Afrikaans Translation Dictionary. Voor Pampus liggen has acquired a metaphorical meaning and idiomatic status in Dutch, but is generally unknown to the Afrikaans lexicographer outside the Dutch context. For the Afrikaans lexicographer with his unique frame of reference and cultural and social value judgements, it becomes clear only in a Dutch milieu why the lexicographical editing of Dutch lemmata in a Dutch-Afrikaans translation dictionary is such a big challenge, despite the fact that the two languages are so closely related. Language interference and influence occur more readily between closely related languages. Lan-guage relativity implies that each of the languages concerned gives a unique perspective on its own reality, because of a unique life- and world-view which form an inherent part of a particular mother tongue. Important metalexicographic guidelines include the judicious application of prag-matic signals (e.g. labels), well-chosen clarificatory citations and made-up examples and relevant guidance as to usage. This can give the target user of such a dictionary an insight into the unique and culture-specific lexical elements which distinguish the two languages from each other.
Keywords: CLOSELY-RELATED LANGUAGES, DUTCH-AFRIKAANS TRANSLATION DICTIONARY, SOURCE LANGUAGE, TARGET LANGUAGE, SHIFT FUNCTION, REVERSAL, COMBINATION LEMMA, UNIQUE FRAME OF REFERENCE, SOCIO-CULTURAL VALUE JUDGEMENTS, LINGUISTIC INTERFERENCE AND INFLUENCE, LANGUAGE RELATIVISM, SAPIR-WHORF HYPOTHESIS, LINGUISTIC RELATIVITY AND DETERMINISM, METALEXI-COGRAPHICAL GUIDELINES, COLLOCATIONS, LABELS, PRAGMATIC SIGNALS, CITA-TIONS, MADE-UP EXAMPLES, GUIDANCE FOR USE, FRAMES, METALANGUAGE, BILIN-GUAL LEXICOGRAPHY
Our aim is to discuss language strategies of a trilingual child, who uses Estonian, Finnish and Norwegian on a daily basis. Especially, we focus on the influence of the Finnish language on the ...Estonian language used by our informant. Estonian and Finnish are closely related languages, and they both have rich morphological system. We argue that the interaction between morphologically rich inflectional
systems is different than the interaction between morphologically very different systems.
Our informant speaks Estonian with her mother and brother and Norwegian with her father. She lives in Finland, and Finnish is her school language as well as the language of many daily situations. At the time of collecting the data, she was 10 years old.
Hindi-Punjabi being closely related language pair, Hybrid Machine Translation approach has been used for developing Hindi to Punjabi Machine Translation System. Non-availability of lexical resources, ...spelling variations in the source language text, source text ambiguous words, named entity recognition and collocations are the major challenges faced while developing this syetm. The key activities involved during translation process are preprocessing, translation engine and post processing. Lookup algorithms,pattern matching algorithms etc formed the basis for solving these issues. The system accuracy has been evaluated using intelligibility test, accuracy test and BLEU score. The hybrid syatem is found to perform better than the constituent systems.
Machine Translation (MT) is very useful in supporting multicultural communication. Existing Statistical Machine Translation (SMT) which requires high quality and quantity of corpora and Rule-Based ...Machine Translation (RBMT) which requires bilingual dictionaries, morphological, syntax, and semantic analyzer are scarce for low-resource languages. Due to the lack of language resources, it is difficult to create MT from high-resource languages to low-resource languages like Indonesian ethnic languages. Nevertheless, Indonesian ethnic languages' characteristics motivate us to introduce a Pivot-Based Hybrid Machine Translation (PHMT) by combining SMT and RBMT with Indonesian as a pivot which we further utilize in a multilingual communication support system. We evaluate PHMT translation quality with fluency and adequacy as metrics and then evaluate usability of the system. Despite the medium average translation quality (3.05 fluency score and 3.06 adequacy score), the 3.71 average mean scores of the usability evaluation indicates that the system is useful to support multilingual collaboration.
The constraint-based approach has been proven useful for inducing bilingual lexicons for closely-related low-resource languages. When we want to create multiple bilingual dictionaries linking several ...languages, we need to consider manual creation by bilingual language experts if there are no available machine-readable dictionaries are available as input. To overcome the difficulty in planning the creation of bilingual dictionaries, the consideration of various methods and costs, plan optimization is essential. We adopt the Markov Decision Process (MDP) in formalizing plan optimization for creating bilingual dictionaries; the goal is to better predict the most feasible optimal plan with the least total cost before fully implementing the constraint-based bilingual dictionary induction framework. We define heuristics based on input language characteristics to devise a baseline plan for evaluating our MDP-based approach with total cost as an evaluation metric. The MDP-based proposal outperformed heuristic planning on total cost for all datasets examined.
Dice's coefficient on trigram profiles as metric for language similarity Oco, Nathaniel; Romeritch Syliongka, Leif; Roxas, Rachel Edita ...
2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE),
2013-Nov.
Conference Proceeding
In this study, we present Dice's coefficient on trigram profiles as metric for language similarity. As testbed, we focused on eight Philippine languages. No known language similarity value for these ...languages exists. Documents containing transcribed audio recordings, news articles, religious and literary texts were taken from an online corpus and used as training data. Character trigram profiles were then generated using an n-gram generator and language similarity was computed. The results were matched against those reported in the literature and against the language family tree. To evaluate the metric, it was applied to five languages with known similarity values. The results were then compared with an existing lexical similarity metric. The average difference is 27%. Analyses of the results reveal that phonetic spelling play an important role in language similarity. As future work, the metric can be used on phonetic transcriptions.
Computational approaches in language identification often result in highnumber of false positivesand low recall rates, especially if the languages involved come from the same subfamily. In this ...paper, we aim to determine the cause of this problemby measuring language similarity through trigrams. Religious and literary texts were used as training data. Our experiments involving language identification show that the number of common trigrams for a given language pair is inversely proportional to precision and recall rates, whereas the average word length is directly proportional to the number of true positives. Future directions include improving language modeling and providing an approach to increase precision and recall.
Voor Pampus liggen het 'n oordragtelike betekenis en idiomatiese status in Nederlands verwerf, maar is gewoonlik onbekend aan die Afrikaanse leksikograaf buite die Nederlandse konteks. Vir 'n ...Afrikaanse leksikograaf met sy unieke verwysingsraamwerk en kulturele en sosiale waardeoordele word dit slegs in 'n Nederlandse milieu duidelik waarom die leksikografiese redigering van Nederlandse lemmata in 'n vertalende Nederlands-Afrikaanse woordeboek so 'n groot uitdaging inhou, ten spyte van die feit dat die twee tale so nou verwant is. Taalversteuring en -beïnvloeding vind makliker plaas tussen nou verwante tale. Taalrelatiwisme impliseer dat elkeen van die betrokke tale 'n unieke soort perspektief op die taaleie werklikheid bied vanweë 'n unieke lewens- en wêreldbeskouing wat 'n inherente deel uitmaak van 'n betrokke moedertaal. Belangrike metaleksikografiese riglyne sluit die oordeelkundige aanwending van pragmatiese leiding (bv. etikette), goedgekose, verhelderende sitate en poëme en relevante gebruiksleiding in. Dit kan die teikengebruiker van 'n vertalende woordeboek 'n insig gee in die unieke en kultuurspesifieke woordeskatelemente wat die tale van mekaar onderskei. Challenges for the Afrikaans lexicographer with regard to Dutch lemmata in a Dutch-Afrikaans Translation Dictionary Voor Pampus liggen has acquired a metaphorical meaning and idiomatic status in Dutch, but is generally unknown to the Afrikaans lexicographer outside the Dutch context. For the Afrikaans lexicographer with his unique frame of reference and cultural and social value judgements, it becomes clear only in a Dutch milieu why the lexicographical editing of Dutch lemmata in a Dutch-Afrikaans translation dictionary is such a big challenge, despite the fact that the two languages are so closely related. Language interference and influence occur more readily between closely related languages. Language relativity implies that each of the languages concerned gives a unique perspective on its own reality, because of a unique life- and world-view which form an inherent part of a particular mother tongue. Important metalexicographic guidelines include the judicious application of pragmatic signals (e.g. labels), well-chosen clarificatory citations and made-up examples and relevant guidance as to usage. This can give the target user of such a dictionary an insight into the unique and culture-specific lexical elements which distinguish the two languages from each other.