The paper describes the combined results of several projects which constitute a basic language resource infrastructure for printed historical Slovene. The IMP language resources consist of a digital ...library, an annotated corpus and a lexicon, which are interlinked and uniformly encoded following the Text Encoding Initiative Guidelines. The library holds about 650 units (mostly complete books) consisting of facsimiles with 45,000 pages as well as hand-corrected and structured transcriptions. The hand-annotated corpus has 300,000 tokens, where each word is tagged with its modernised word form, lemma, part-of-speech and, in cases of archaic words, its nearest contemporary equivalents. This information was extracted into the lexicon, which also covers an extended target-annotated corpus, resulting in 20,000 lemmas (of these 4,000 archaic) with 50,000 modern word forms and 70,000 attested forms. We have also developed a program to modernise, tag and lemmatise historical Slovene, and annotated the digital library with it, producing an automatically annotated corpus of 15 million words. To serve the humanities, the digital library and lexicon are available for reading and browsing on the web and the corpora via a concordancer. For language technology research and development the resources are available in source TEI XML under the Creative Commons Attribution licence. The paper presents the IMP resources, available from http://nl.ijs.si/imp/, the process of their compilation, encoding and dissemination, and concludes with directions for future research.
The Dictionary of Legal Terminology' shows the conceptual system of modern Slovenian law. In more than 10,000 dictionary entries, we present the terminology of contemporary Slovenian legal science, ...legal practice and legislation after 1991. The dictionary is primarily intended for legal experts, who practice various professions (including judges, lawyers, notaries, prosecutors as well as heads of HR and legal departments in companies). It is also intended for students of law and related sciences, in order to familiarize them with the conceptual system of Slovenian law in a professionally relevant manner. In order to use the dictionary efficiently at least a basic knowledge of law is required. This does not mean that journalists, translators, proof-readers and others who encounter legal terminology cannot use it. However, it should be borne in mind that definitions contained in a terminological dictionary are very concise and sufficiently informative only for an expert who is familiar with the conceptual system of a specific field of study, while other users are provided with a suitable starting point so they can find additional information if necessary.
The present monography entitled A Study of Slovenian Multi-Word Lexemes from a Lexicographical Perspective provides a typology of newer nominal multi-word lexemes with potential terminological ...meaning, based on the theory of Russian linguist N. M. Shanskiy which brings four precisely defined levels of semantic merger of their components. The analysis of individual components of multi-word lexemes from a morphological, syntactic and semantic perspective reveals typological tendencies of nominal multi-word lexemes which occurred especially in the last twenty years.The typology takes into account the causal relationship between the level of semantic transfer (of the components) of a multi-word lexeme on the one hand and the level of semantic merger of components of a multi-word lexeme and related lexicalization of a multi-word lexeme on the other hand, which is also reflected in the restricted collocability of their components. Lexicalized or non-lexicalized metaphorical and metonymic semantic transfers are contributing to the new semantic and syntactic combinations of individual words in the phrase, to the consequent integration of their meanings to form a single meaning of the multi-word lexeme and thus to a higher level of lexicalization and semantic firmness of a multi-word lexeme.
The article deals with the etymology of the Slovene place-name Trbiž (standard Slovene form), relating to a small settlement in the Canal Valley (Val Canale, North-Eastern Italy) in a traditionally ...multilingual area with people speaking Slovene, Friulian and Bavarian. The name’s contemporary dialectal variants are attested in the Gail Valley dialect and the Resian dialect of the Slovene language, as well as in the Carinthian dialects of Southern Bavarian (the contemporary Friulian form is a late adaptation of the Bavarian variant and thus irrelevant for the analysis). The extant Slovene dialectal forms lead to reconstruct a common Slovene prototype with a satisfactory Slavic etymology. However, for reasons of historical phonology, a Slavic starting point can hardly be the ultimate source of the Bavarian form, which makes it reasonable to conclude that both Slavic and Bavarian forms originate in a Romance (Old Friulian) place-name which must go back to some Pre-Romance source. We are thus probably dealing with an independent integration of the Romance source into the two superstratal linguistic systems in the region (viz. Slavic and Bavarian). In the context of the ethnic and linguistic history of the region, this scenario seems plausible, yet it is very difficult to establish the etymology of the postulated Pre-Romance form. The article provides a detailed analysis of its likely wordformation, which confirms a transparently Indo-European pedigree of the name and enables the author to isolate the stem that is most likely to be related to Celtic *taruo- ‘aurochs’. The author shows that the validity of this already familiar suggestion is etymologically feasible, the same etymon being also attested in the ancient place-name Taruisium in Northern Italy. Nevertheless, there exists a number of extralinguistic factors which prevent from considering this etymology as definitive.
V članku so prikazani odrazi praslovanskih naglasnih tipov samostalnikov a-jevske sklanjatve v govoru vasi Dolnje in Gorenje Mraševo, ki spada v dolenjsko narečno skupino slovenščine. Podan je tudi ...naglasoslovni opis navedenega govora s poudarkom na naboru oblik, v katerih se pojavlja odraz popraslovanskega novega cirkumfleksa. Namen prispevka je pojasniti naglasnomestne, kolikostne in tonemske značilnosti naglasnih tipov a-jevske sklanjatve v mraševskem govoru.