U radu je dan pregled područja povezanog s procesiranjem prirodnih jezika i njihova međusobnog odnosa, počevši od šire domene kao što je umjetna inteligencija, putem strojnog učenja, računalne ...lingvistike, metoda strojnog prevođenja te posebice onih zasnovanim na dubokom učenju. Opisane su karakteristike, primjene, faze i glavni problemi obrade prirodnih jezika s leksičke, sintaktičke, semantičke, govorne i pragmatičke perspektive. Opisane su faze prepoznavanja i analize prirodnog jezika kao i faza generiranja prirodnih jezika. Postupci pre-editinga i post-editinga uz korištenje kontroliranih prirodnih jezika dani su kao primjeri prakse kojom se povećava točnost i kvaliteta automatskog prevođenja i općenito procesiranja teksta. Poseban je fokus stavljen na strojno prevođenje te metode strojnog prevođenja. Pristupi strojnom prevođenju kao statistički, temeljen na pravilima, hibridni i pristup temeljen na dubokom učenju opisani su i predstavljeni s obzirom na njihove prednosti i nedostatke i prikladnu primjenu u praksi. Na kraju su dani još uvijek neriješeni izazovi kao smjer daljnjih istraživanja vezanih uz obradu prirodnih jezika te značaj razvoja pristupa temeljenog na dubokom učenju.
The paper provides an overview of areas related to the processing of natural languages and their interrelationships, starting from a broader
domain such as artificial intelligence, through machine learning, computational linguistics, machine translation methods and especially those based on deep learning. The characteristics, applications, phases and main problems of natural language processing from the lexical, syntactic, semantic, speech and pragmatic perspective
are described. The phases of natural language recognition and analysis
as well as the natural language generation phase are described. Pre-editing and post-editing procedures using controlled natural languages are given as examples of practices that increase the accuracy
and quality of automatic translation and text processing in general. Special focus is given to machine translation and machine translation methods. Approaches to machine translation as statistical, rule-based, example-based, hybrid and deep learning-based approach are described and discussed with regard to their advantages and disadvantages including appropriate application in practice. In the end, still unresolved challenges are given as a direction of future research related to natural language processing and the importance of further development of a deep learning-based approach.
This book explains the concept, framework, implementation, and evaluation of controlled document authoring in this age of translation technologies. Machine translation (MT) is routinely used in many ...situations, by companies, governments, and individuals. Despite recent advances, MT tools are still known to be imperfect, sometimes producing critical errors. To enhance the performance of MT, researchers and language practitioners have developed controlled languages that impose restrictions on the form or length of the source-language text. However, a fundamental, persisting problem is that both current MT systems and controlled languages deal only with the sentence as the unit of processing. To be effective, controlled languages must be contextualised at the document level, consequently enabling MT to generate outputs appropriate for their functional context within the target document. With a specific focus on Japanese municipal documents, this book establishes a framework for controlled document authoring by integrating various research strands including document formalisation, controlled language, and terminology management. It then presents the development and evaluation of an authoring support system, MuTUAL, that is designed to help non-professional writers create well-organised documents that are both readable and translatable. The book provides useful insights for researchers and practitioners interested in translation technology, technical writing, and natural language processing applications.
In the article I compare expressions of translational motion in Finnish texts and their Czech translation. The semantic analysis of verbs of motion is based on Talmy (2001, 2003) and I establish ...which components of a motion event (path, direction, goal, figure, manner) are essential when expressing motion events in the two languages and by what means they can be expressed. I also compare the results of my analysis with Talmyan typology of satellite-framed and verb-framed languages. I conclude that in Finnish many frequent motion verbs express aspects of path together with adverbial expressions, whereas in Czech verbs express manner more often than in Finnish. Finnish and Czech also use differing anchorings of the direction of motion - in Czech the direction is anchored with respect to explicitly mentioned surroundings; in Finnish deictic motion verbs are in frequent use and also several means of anchoring may be used in one clause (for example by a deictic motion verb and an adverbial or by several adverbials).
The noun shimu (师母 ‘ teacher’s wife’) is defined in Chinese as a so-called honorific, i.e., chengweiyu (称谓语), one of many honorifics used in academic and educational contexts. We analyze this word ...through Chinese philology (wenzixue, 文字学), that is, the science of written language and characters, and through anthropological linguistics. The use of honorifics in Chinese culture is based on the Chinese concept of politeness, Confucian philosophy, and the understanding of hierarchy in Chinese society. In the end, we give a brief overview of the need to learn and acquire certain pragmatic (as well as anthropolinguistic) information about the honorific shimu (师母) when learning Chinese as a foreign language (some of this information is applicable in the use of other honorifics in the Chinese language). We also provide a short review of how this honorific is viewed in scholarship on Chinese culture, traditionally, and in modern Chinese society (especially in the context of the use of gender-sensitive language), as well as a short analysis of the frequency of use of the honorific shimu, based on examples from BCC, a contemporary Chinese corpus developed by the Beijing Language and Culture University.
U radu se analiziraju teorijsko-metodološke osnove suvremenoga pristupa nastavi stranoga jezika struke. Brojni utjecaji koji su izgradili taj pristup sagledani su u meðupovezanosti sinkronijske i ...dijakronijske razine u svjetlu smjene dviju znanstvenih paradigmi koje su obilježile 20. i 21. stoljeće: strukturalizma i kognitivizma. Znanje o teorijskoj i metodološkoj pozadini suvremene nastave u ovome području tvori siguran oslonac u suočavanju s izazovima ove zahtjevne profesije kao i za kritičko i promišljeno vrednovanje novih trendova i njihovo integriranje u nastavu u okolnostima brzih društvenih promjena suvremenoga tehnološkog doba. Cilj je rada analizirati složenu mrežu utjecaja koji utiru evolucijski put ove discipline prema suvremenosti ne zanemarujući pritom i najsuvremenije trendove koji ju neprestano oblikuju i pridonose njezinome daljnjem razvoju. Rezultati analize pokazuju da je suvremeni pristup nastavi u ovome području sinteza bogatoga nasljeða društveno-povijesnih okolnosti i interdisciplinarnih utjecaja izmeðu kojih ključnu ulogu imaju kognitivna psihologija i kognitivna lingvistika. Rad argumentira primjenjivost temeljnih kognitivno-lingvističkih postavki na teorijska istraživanja jezika raznih struka kao i njihovih nazivlja te njihovo integriranje u nastavni proces kao osnove budućega razvoja.
Plato's Cratylus is a brilliant but enigmatic dialogue. It bears on a topic, the relation of language to knowledge, which has never ceased to be of central philosophical importance, but tackles it in ...ways which at times look alien to us. In this reappraisal of the dialogue, Professor Sedley argues that the etymologies which take up well over half of it are not an embarrassing lapse or semi-private joke on Plato's part. On the contrary, if taken seriously as they should be, they are the key to understanding both the dialogue itself and Plato's linguistic philosophy more broadly. The book's main argument is so formulated as to be intelligible to readers with no knowledge of Greek, and will have a significant impact both on the study of Plato and on the history of linguistic thought.
Publikuojamas amerikiečių etnografo, antropologo ir lingvisto F. Boaso straipsnio „Kai kurios visuomenės mokslų metodologijos problemos“ vertimas. Autorius aptaria visuomenės mokslų metodologijos ...problemas, su kuriomis jis susiduria kaip antropologas. Teigiama, kad individą galima suprasti tik kaip dalį visuomenės, kuriai jis priklauso, ir kad visuomenę galima suprasti tik ją sudarančių individų tarpusavio santykių pagrindu. Visuomenės mokslų problemos susijusios su pavienių individų ir jų grupių reakcijų į išorinius stimulus formomis, su jų tarpusavio sąveikomis ir su šių procesų sukurtomis socialinėmis formomis. Nagrinėjami gyvulio ir žmogaus socialinio elgesio skirtumai, primityvios kultūros sričių sąveika, kultūros santykis su geografinėmis sąlygomis. Autorius kritikuoja bandymus susieti socialinę organizaciją ir ekonomines sąlygas bei aiškinimą, kad kultūros formos priklauso vien nuo ekonominių sąlygų. Mėginimai suformuluoti bendruosius kultūros integracijos dėsnius neduoda reikšmingų rezultatų. Bandymai nustatyti įvairių kultūros pusių santykius reiškia, kad būtinai reikia tirti jų ryšių dinamiką.
Straipsnis skirtas vokiečių kilmės amerikiečių etnografo, antropologo, lingvisto F. Boaso gyvenimui ir kūrybai aptarti. Aprašoma jo mokslinė veikla, priklausymas įvairioms mokslinėms organizacijoms, ...jų steigimas, vadovavimas joms. Antropologija F. Boasui yra bendras mokslas apie žmogų, apimąs etnografiją, fizinę antropologiją ir lingvistiką. Visose antropologijos šakose jis paliko reikšmingų tyrinėjimų. F. Boasas parengė nemažai eskimų ir indėnų legendų bei mitų rinkinių ir juos išvertė. Šis antropologas yra vienas iš aprašomosios, arba deskriptyvinės, lingvistikos pradininkų, JAV kultūrinės antropologijos istorinės mokyklos įkūrėjas ir ilgametis jos vadovas. Jis teigė, kad tiriant ikiklasinių visuomenių kultūrą, reikia remtis dviem principais: visų rasių ir kultūrų žmonėms būdingu pagrindinių psichologinių procesų panašumu; į bet kokią kultūrą turi būti žiūrima kaip į istorinį kitimo rezultatą. F. Boasas vietoj lyginamojo metodo pasiūlė savo istorinio tyrimo metodą, kuriuo nustatomos kultūros reiškinių atsiradimo konkrečios sąlygos ir naudojamas kontroliuojamasis palyginimas.
Multilingual Speech Processing Tanja Schultz, Katrin Kirchhoff / Tanja Schultz, Katrin Kirchhoff
2006, 2006-04-21T00:00:00, 2006-12-31
eBook
Tanja Schultz and Katrin Kirchhoff have compiled a comprehensive overview of speech processing from a multilingual perspective. By taking this all- inclusive approach to speech processing, the ...editors have included theories, algorithms, and techniques that are required to support spoken input and output in a large variety of languages. Multilingual Speech Processing presents a comprehensive introduction to research problems and solutions, both from a theoretical as well as a practical perspective, and highlights technology that incorporates the increasing necessity for multilingual applications in our global community. Current challenges of speech processing and the feasibility of sharing data and system components across different languages guide contributors in their discussions of trends, prognoses and open research issues. This includes automatic speech recognition and speech synthesis, but also speech-to-speech translation, dialog systems, automatic language identification, and handling non-native speech. The book is complemented by an overview of multilingual resources, important research trends, and actual speech processing systems that are being deployed in multilingual human-human and human-machine interfaces. Researchers and developers in industry and academia with different backgrounds but a common interest in multilingual speech processing will find an excellent overview of research problems and solutions detailed from theoretical and practical perspectives. * State-of-the-art research with a global perspective by authors from the USA, Asia, Europe, and South Africa * The only comprehensive introduction to multilingual speech processing currently available * Detailed presentation of technological advances integral to security, financial, cellular and commercial applications