V članku predstavimo primerjavo rabe treh tipično govorjenih diskurznih označevalcev v korpusu govorjene slovenščine Gos in korpusu slovenskih uporabniških spletnih vsebin Janes. Rezultati ...potrjujejo, da so ti izrazi na spletu rabljeni bistveno redkeje kot v spontanem govoru, vendarle pa njihova raba ni zanemarljiva, zlasti v besedilnih tipih s poudarjeno interaktivno oz. dialoško izmenjavo uporabniških sporočil. Pri tem se označevalci na spletu pojavljajo predvsem v semantično motiviranih funkcijah, kot so vzpostavljanje stika z naslovnikom, preverjanje strinjanja ali omiljevanje izrečenega, ki se hkrati prepletajo tudi z besedilnimi funkcijami poudarjanja ter menjave vlog. Prav tako na spletu razvijajo nekatere nove kontekste rabe, kot so nagovarjanje neznanega ali neudeleženega naslovnika, stilizacija in vstopanje v nove stalne besedne zveze.
Prispevek prikazuje postopke zbiranja in raziskovanja dveh primerljivih strokovnih korpusov. Njuna primerjava prikazuje poenostavljeno večdimenzionalno analizo jezikovnih sprememb s primerjavo list ...besed, ključnih besed in z opisom kolokacijskih in sintagmatskih vzorcev rabe besed. Za raziskavo uporabi brezplačne računalniške programe in utemelji rabo strokovnih korpusov kot ustrezno orodje za pomoč učiteljem tujega strokovnega jezika pri luščenju ključnega strokovnega in polstrokovnega besedišča.
From verbal to adjectival Paulsen, Geda; Tuulik, Maria; Lohk, Ahti ...
Slovenscina 2.0,
2022, Volume:
10, Issue:
1
Journal Article
Peer reviewed
Open access
This study addresses categorization issues related to adjective candidates in Estonian, focusing on the category of participles. The aim of the analysis was to assess the ranges of the prototypical ...adjective and to determine its degree of deviation on the prototypicality scale. The investigation was based on a group of validated adjectives – selected adjectives included in the Basic Estonian Dictionary – and two control groups of more and less lexicalized participles. We tested seven morphosyntactic corpus patterns characteristic of adjectives. The test patterns were based on the prototypical features of the adjective, as well as on observations made in the actual lexicographic analysis. To assess the sample words and determine the significance of the test patterns from the point of view of defining adjectivity, we used deviation analysis. The results of this study can be applied to establish a measure of adjectivity for lexicographic judgments when distinguishing, for instance, lexicalized participles from regular ones.
Predmet razprave so teoretsko-metodološka načela, ki so se razvijala v krogih t. i. novofirthijancev, kjer se od vsega začetka opredeljujejo za korpusno analizo, čim manj obremenjeno s predhodnimi ...jezikoslovnimi teorijami. V prispevku najprej pregledamo dela teh avtorjev, iz katerih izhajajo med drugim slovnica vzorcev (angl. pattern grammar), teorija leksikalnega proženja (angl. lexical priming), teorija konvencij in invencij (angl. theory of norms and exploitations) ter teorija kontekstne prozodije (angl. contextual prosodic theory). Nato povzamemo njihove zaledne predpostavke in stališča o jeziku kot rezultatih raziskovanja. Tako definiramo šest skupnih načel: predmet raziskovanja je jezikovna raba, tj. jezik v »kontekstu situacije«, raziskovalni fokus se obrne k temu, kar je običajno, raziskovalčeva intuicija je v vlogi evalvacije avtentičnih jezikovnih rab, jezikovne ravni (slovnica in slovar) so razumljene kot prepletene, na jezikovni sistem se gleda kot visoko dinamičen, v teoretskih temeljih pa zavzame eno osrednjih mest jezikovni vzorec. Nazadnje opozorimo tudi na nekatere omejitve, s katerimi se soočamo pri korpusnem pristopu.
The work presents the set of the 300 most well-known and most frequent proverbs, sayings and similar paremiological units in modern Slovene, its theoretical and methodological basis and also how it ...can be used in different phraseological and phraseographical tasks. The concept of the paremiological optimum was developed by Ďurčo and it combines the concept of the Permjakov’s paremiological minimum of the most well-known units with corpus-based and frequency-oriented analyses. 316 Slovene speakers were included in the sociolinguistic test and the top list of the 300 most well-known out of 918 units presented to them was additionally arranged according to their frequency in the FidaPLUS language corpus. Author presents different search procedures which he used in the language corpora. The paremiological optimum is modernized according to the most frequent form of each unit in the Gigafida language corpus. The comparative research of the Slovak and Slovene paremiology based on the paremiological optimum is presented in details. Author also describes how the data he gained can be used in the lexicography.
The goal of the monograph ('Web texts and language on the Web (the case of blogs and Wikipedia in the Slovenian language)') entitled Web texts and language on the Web (the case of blogs and Wikipedia ...in the Slovenian language) is to give an overview, as complete as possible, of the topic of web texts, although its main part is limited to blog and Wikipedia texts, where, as it turns out, there is a need for placing the topic into a broader context of electronic texts. The first chapter treats the circumstances of the formation of the Web and its definition in relation to the Internet and other electronic media. In the second chapter, corpus and dictionary are presented in relation to the Web, especially in terms of web corpora, the current role of web search engines is discussed, as well as the use of the Web in lexicography. The third and the largest part of the monograph includes a detailed analysis of Slovenian language and texts, especially of the selected material obtained from blogs and Wikipedia.