Abstract
Light verb constructions (LVCs) are verb and noun combinations in which the verb has lost its meaning to some degree and the noun is used in one of its original senses, typically denoting an ...event or an action. They exhibit special linguistic features, especially when regarded in a multilingual context. In this paper, we focus on the automatic detection of LVCs in raw text in four different languages, namely, English, German, Spanish, and Hungarian. First, we analyze the characteristics of LVCs from a linguistic point of view based on parallel corpus data. Then, we provide a standardized (i.e., language-independent) representation of LVCs that can be used in machine learning experiments. After, we experiment on identifying LVCs in different languages: we exploit language adaptation techniques which demonstrate that data from an additional language can be successfully employed in improving the performance of supervised LVC detection for a given language. As there are several annotated corpora from several domains in the case of English and Hungarian, we also investigate the effect of simple domain adaptation techniques to reduce the gap between domains. Furthermore, we combine domain adaptation techniques with language adaptation techniques for these two languages. Our results show that both out-domain and additional language data can improve performance. We believe that our language adaptation method may have practical implications in several fields of natural language processing, especially in machine translation.
Examines how violence was described and evaluated in the foundational texts of Islam. How was violence justified in early Islam? What role did violent actions play in the formation and maintenance of ...the Muslim political order? How did Muslim thinkers view the origins and acceptability of violence? These questions are addressed by an international range of eminent authors through both general accounts of types of violence and detailed case studies of violent acts drawn from the early Islamic sources. Violence is understood widely, to include jihad, state repressions and rebellions, and also more personally directed violence against victims (women, animals, children, slaves) and criminals. By understanding the early development of Muslim thinking around violence, our comprehension of subsequent trends in Islamic thought, during the medieval period and up to the modern day, become clearer.
Key Features: Examines the portrayal of violence in a variety of different intellectual contexts
* Takes a broad understanding of violence - from warfare between Muslims (and between Muslims and others) to individual acts of violence
* Enables a better informed debate about the nature of violence in early Islam
* Includes contributions from leading international experts including Michael Cooperson, Maribel Fierro, Geert Jan van Gelder, Christopher Melchert, John Nawas, Andrew Rippin and Dominique Urvoy
We present a web mining system that clusters persons sharing the same name and also extracts bibliographical information about them. The input of our system is the result of web search engine queries ...in English or in Hungarian. For system evaluation in English, our system (RGAI) participated in the third Web People Search Task challenge 1. The chief characteristics of our approach compared to the others are that we focus on the raw textual parts of the web pages instead of the structured parts, we group similar attribute classes together and we explicitly handle their interdependencies. The RGAI system achieved top results on the person attribute extraction subtask, and average results on the person clustering subtask. Following the shared task annotation principles, we also manually constructed a Hungarian person disambiguation corpus and adapted our system from English to Hungarian. We present experimental results on this as well.
Light verb constructions consist of a verbal and a nominal component, where the noun preserves its original meaning while the verb has lost it (to some degree). They are syntactically flexible and ...their meaning can only be partially computed on the basis of the meaning of their parts, thus they require special treatment in natural language processing. For this purpose, the first step is to identify light verb constructions.
In this study, we present our conditional random fields-based tool—called FXTagger—for identifying light verb constructions. The flexibility of the tool is demonstrated on two, typologically different, languages, namely, English and Hungarian. As earlier studies labeled different linguistic phenomena as light verb constructions, we first present a linguistics-based classification of light verb constructions and then show that FXTagger is able to identify different classes of light verb constructions in both languages.
Different types of texts may contain different types of light verb constructions; moreover, the frequency of light verb constructions may differ from domain to domain. Hence we focus on the portability of models trained on different corpora, and we also investigate the effect of simple domain adaptation techniques to reduce the gap between the domains. Our results show that in spite of domain specificities, out-domain data can also contribute to the successful LVC detection in all domains.
Precision oncology is currently based on pairing molecularly targeted agents (MTA) to predefined single driver genes or biomarkers. Each tumor harbors a combination of a large number of potential ...genetic alterations of multiple driver genes in a complex system that limits the potential of this approach. We have developed an artificial intelligence (AI)-assisted computational method, the digital drug-assignment (DDA) system, to prioritize potential MTAs for each cancer patient based on the complex individual molecular profile of their tumor. We analyzed the clinical benefit of the DDA system on the molecular and clinical outcome data of patients treated in the SHIVA01 precision oncology clinical trial with MTAs matched to individual genetic alterations or biomarkers of their tumor. We found that the DDA score assigned to MTAs was significantly higher in patients experiencing disease control than in patients with progressive disease (1523 versus 580, P = 0.037). The median PFS was also significantly longer in patients receiving MTAs with high (1000+ <) than with low (<0) DDA scores (3.95 versus 1.95 months, P = 0.044). Our results indicate that AI-based systems, like DDA, are promising new tools for oncologists to improve the clinical benefit of precision oncology.
In this paper, we focus on various methods for detecting verbal collocations, i.e. verb-particle constructions and light verb constructions in Wikipedia articles. Our results suggest that for ...verb-particle constructions, POS-tagging and restriction on the particle seem to yield the best result whereas the combination of POS-tagging, syntactic information and restrictions on the nominal and verbal component have the most beneficial effect on identifying light verb constructions. The identification of multiword semantic units can be successfully exploited in several applications in the fields of machine translation or information extraction.
In this paper we present a slightly modified machine learning approach for text classification working exclusively from positive and unlabeled samples. Our method can assure that the positive class ...is not underrepresented during the iterative training process and it can achieve 30% better F-value when the amount of positive examples is low.
G. Richter, F. Gabrieli, I. c Abbās and M. Zakeri conducted fundamental research on Al-Adab al-ṣaġīr which is attributed to Ibn al-Muqaffac. They concluded that this work is spurious. This view is ...generally accepted by most of the scholars. The primary objective of the first section of this article is to add some comments and information to their research, as well as to argue that Al-Adab al-ṣaġīr is very likely an authentic work of Ibn al-Muqaffac. It is plausible that he compiled Pahlavi texts which he translated, and prefaced this compilation with an introduction. In the second section of this article I will discuss the question of the titles of some of Ibn al-Muqaffa c ,s compositions, such as the Kitāb al-ādāb al-kabīr, Al-Adab al-ṣaġīr, Al-Yatīma, and the Polemic against Islam. This article is the second of a series on Ibn al-Muqaffa c ,s oeuvre. The first article of this series La Lumière et les Ténèbres dans l'œuvre d'Ibn al-Muqaffa c (Light and Darkness in Ibn al-Muqaffa c ) was published in a previous volume of the AOH (61/3). The next article of this series, "Reason, Religion and Power in Ibn al-Muqaffa c ,,, will be published in one of the next issues.