It is known that in today's globalized era, language corpora are a kind of teaching tool, a multifunctional linguistic electronic system. The presence of lexicographic resources in it along with the ...text base as a source of information gives users the opportunity to acquire basic knowledge and replenish their vocabulary. Each language has phrasemes that are considered linguocultural units. Phrasemes have a portable meaning and serve to make thought attractive and influential. In the educational process, it is important to enrich the speech of students with phrasemes. Therefore, it is very necessary to create a base of phrasemas in linguistic digital technologies, including language corps. For this purpose, it is considered important to have a phraseme database in the lexicographic database. This article discusses the reasons for ensuring the existence of a database of expressions in linguistic corpora and its meaning. The methods of working with phrases in the Uzbek language educational corpus were also shown.
The essay presents the project of a Church Slavonic-Italian lexicon undertaken in 2012 and currently about to be accepted among the digital resources of ILIESI. The first part of the essay explains ...the reasons for the project and its aims, the methodology initially adopted, the first results obtained, the turning point in 2018 with the fine-tuning of the methodology currently in use, and the most recent results. The second part introduces the transition of the project to the web, describing the features of the platform that will host it in terms of content and functions, and reflecting on possible future developments. The project is conceived to overcome the lack of specific tools for the translation of religious and philosophical-theological vocabulary from Church Slavonic to Italian and the difficulties posed by the transition from one culture to another. The research, which does not disregard existing vocabularies, is mainly based on the resources offered by corpora.
Im vorliegenden Beitrag werden die im Projekt ZuMult – „
gänge zu
imodalen Korpora gesprochener Sprache“; Leibniz-Institut für Deutsche Sprache (IDS) Mannheim / Herder-Institut, Univ. Leipzig / ...Hamburger Zentrum für Sprachkorpora (HZSK), Univ. Hamburg – entwickelten Zugangswege zu Korpora der gesprochenen Sprache vorgestellt. Es handelt sich dabei um digitale Anwendungen, die gezielt für Sprachdidaktiker/-innen geschaffen wurden, um ihnen einen möglichst bedarfsgerechten Zugriff auf authentische gesprochensprachliche Daten zu ermöglichen. Die neu geschaffenen Zugriffsmöglichkeiten sind besonders auf Bedürfnisse der Sprachvermittlung zugeschnitten. So können nunmehr Gesprächsbeispiele aus den mündlichen Korpora FOLK und GWSS
anhand schwierigkeitsbezogener Parameter (wie etwa Wortschatzniveau, Standardnähe/-ferne, Sprechgeschwindigkeit, Anteil typisch mündlicher Phänomene) für die Thematisierung im Unterricht ausgewählt werden. Zudem werden an sprachdidaktischen Nutzungsszenarien orientierte Arbeitsmöglichkeiten mit dem jeweiligen Einzeltranskript angeboten. Der Beitrag zeigt anhand eines sprachdidaktischen Anwendungsszenarios konkret auf, wie die in ZuMult entwickelten digitalen Anwendungen genutzt werden können.
S pomočjo jezikovnih korpusov in anket smo analizirali 200 paremioloških izrazov iz štirih različnih virov. Zanimala nas je aktualna raba teh izrazov (in njihovih variant) ter njihova pomenska ...določljivost s pomočjo njihovega sobesedila. Predstavljamo seznam 200 analiziranih enot z rezultati analize in oceno kakovosti izhodiščnih virov tega paremiološkega gradiva. Enote z dokazano aktualno rabo in določenim pomenom bodo vključene v prvo izdajo rastočega spletnega Slovarja pregovorov in sorodnih paremioloških izrazov.
In the field of German linguistics, several important contributions have been published in recent years focusing on the value of spoken language corpora for language teaching (e.g. Costa 2008, Römer ...2008, Paschke 2018, Günthner/Schopf/Weidner 2021, Fandrych/Meißner/Wallner 2021). Taking these theoretical approaches as a starting point, the contribution intends to highlight the often neglected potential that spoken language corpora offer for DaF teaching in secondary school and how they can be used as a database for designing thematic CLIL contents in secondary schools, by proposing some language teaching applications taken from the corpus Fluchtgeschichten aus Ostpreußen (FGOP) edited by Lucia Cinato and available on the Datenbank für gesprochens Deutsch (DGD) platform of the Institut für deutsche Sprache (IDS) in Mannheim.
On the semiotic diversity of language Lepeut, Alysson; Beukeleers, Inez
Belgian journal of linguistics,
12/2022, Volume:
36, Issue:
1
Journal Article
Peer reviewed
Abstract
Language is complex in many respects. When conceived as a system that is to be analysed at all levels of
linguistic structure, it is interpreted as a static and abstract phenomenon in which ...the rules are disconnected from their context
of use. However, the ability to do language, construed as a fundamentally social practice grounded in our
in situ
face-to-face interactions, does not exclusively rely on knowing the rules that govern the grammatical principles in a given
language, nor does it limit itself to understanding the lexical content of utterances. Language is more than that; it is
fundamentally social and inherently multimodal in that it enables all humans to create, express, and construe meaningful
utterances through their bodies. For a long time, however, linguistic theories have neglected to consider the diverse and rich
ways humans
do
language using their bodies. In this introduction, particular attention is paid to the different
roles the body plays across a range of distinct sign languages and contexts. In that respect, a short historical detour into the
evolutive stages of sign language research is provided first. Next, the aims and the different contributions of this volume are
outlined. Finally, some conclusions are drawn.
This article discusses a pilot project aimed at giving tertiary students a wider repertoire of resources to use in language learning, with a particular focus on Italian. This project responds to the ...exponential increase in and access to online data and the potential value such data represent for students studying additional languages at tertiary level. By examining whether current language students are aware of online resources, such as linguistic corpora and other potential applications of big data, we aim to provide an insight into the possible uses of corpus-assisted learning in the language classroom. In this paper, we detail a project undertaken in 2017 with undergraduate students of Italian in a major metropolitan university. Our project directed students to complete a translation task using corpora-based resources and assessed their experience through a post-assessment survey. Subsequently, we present our initial findings in relation to the possibilities of a corpus-based approach to language teaching and learning. While today’s students are already predisposed to relying on online resources as part of their language studies, our results suggest students are not aware of emerging online resources such as corpora. Moreover, even when these resources are presented to students, the complex nature of the software programs used to interrogate corpora often results in their underutilisation.
Children's language development can reflect the developmental process of children's cognition and social emotions. The present study focuses on preschool children's conversational competence and ...narrative competence, aiming at exploring developmental features of preschool children's pragmatic competence through analyzing data from self-built oral language corpora using the INCA-A coding system and CLAN. Results show that with respect to using pragmatic indicators, frequency increases significantly with age in the dimensions of knowledge and language. In the language dimension, frequency is highest in the middle group and in the behavior dimension, frequency gradually decreases. Results are taken to indicate that preschool children's language competence improves continually, and gradually displaces the use of behavior-assisted language. Meanwhile, children can initiate conversation or discourse narration by using knowledge and language. Based on the findings, this study analyzes the development of preschool children's pragmatic competence at different ages within the framework of perlocutionary acts. Finally, corresponding educational suggestions are suggested.
Full text
Available for:
BFBNIB, NUK, PILJ, SAZU, UL, UM, UPUK
In this paper, we present an overview of freely available web applications providing online access to spoken language corpora. We explore and discuss various solutions with which the corpus providers ...and corpus platform developers address the needs of researchers who are working with spoken language. The paper aims to contribute to the long-overdue exchange and discussion of methods and best practices in the design of online access to spoken language corpora.
This data article presents a dataset for Siswati, a Bantu language of the Nguni group that is one of the eleven official South African languages and the official language of Eswatini (together with ...English). The dataset contains parallel textual data between English and Siswati as well as monolingual data for Siswati and was developed for use as training data for machine translation systems, specifically the Autshumato machine translation project. Both corpora can also be used for development and evaluation of Natural Language Processing (NLP) core technologies for Siswati. In addition, the data lends itself for corpus linguistic studies. The article describes how the data was collected, what type of texts it contains and what clean-up was done. It also provides an overview of the number of words contained in the datasets.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP