This article investigates the use and non-use of objects with six transitive verbs in a corpus of English football match reports. The verbs were selected on the basis of their frequency as well as ...their lexico-grammatical features of "footballness" and transitivity. The study suggests that object omission may not be as pervasive as hinted at in previous studies (e.g. Bergh and Ohlander 2016; Ruppenhofer and Michaelis 2010). Regarding potential reasons for object omission, it is uncovered that the football verbs-net, save, play-are more prone to object omission than the general verbs: feed, create, take. This is attributed to the strong attraction of the former to recurrent collocates such as goal and ball. This suggests that verbs used to report on unremarkable and canonical situations (to the game of football) more readily omit the object, albeit not on a general basis, as individual differences between the verbs also emerge.
Characteristics of Malay translated hadith corpus Sazali, Siti Syakirah; Rahman, Nurazzah Abdul; Bakar, Zainab Abu
Journal of King Saud University. Computer and information sciences,
20/May , Letnik:
34, Številka:
5
Journal Article
Recenzirano
Odprti dostop
Annotated corpus can greatly assist in the natural language processing field. For example, computers can understand more of the document context, and indexing and clustering in information retrieval ...can be done precisely with less or no ambiguity of words. However, there are only a few annotated corpora in Malay language, which are not publicly shared. In this paper, we delve into analysing and annotating Malay translated hadith documents in terms of tagging and entities. There are three phases, which are manual filtering and cleaning, analysing the corpus and creating the benchmark. As the result, an analysis and benchmark of Malay translated hadith corpus were produced in term of part-of-speech and named entities tags that follows the Zipf’s law distribution.
Investigating agency has become a pivotal issue in discourse studies, especially organizational discourse. This study aims to identify the controlling agency (who/what) behind G20 leaders' ...declarations in the last decade and how such agency is constructed. To this end, this study offers a concise examination of relevant literature investigating fundamental concepts like discourse and agency in light of the overlapping relationship between form and function in language studies. Further, an eclectic methodological approach has been devised to arrive at a multi-leveled analysis. Two stages of analysis were designed. First, a corpus of the declarations between 2012 and 2021 was created and analyzed using #LancsBox v.6.x. and Wmatrix. At this stage, we was established as a prime agent in the corpus and proven to collocate heavily with agentive speech acts. One sample declaration, Riyadh 2020, was used for minute discourse analysis in the second stage. Inspired by transitivity system, process type analysis, and multivalence frameworks, this stage revealed the profound presence of non-human agency alongside the human one. Nevertheless, further examination demonstrated that this sample still constrains non-human agency due to semantic and textual constraints.
G20; Discourse analysis; Corpus linguistic; Speech acts; Process types; Agency; #LancsBox
Scholars in discourse studies have defined legitimation as the justification (and critique) of powerful institutions and their practices. In moments of crisis, legitimation tactics often shift. This ...article considers how such shifts are incited by unauthorized information leaks. Leaks, I argue, constitute freshly available texts that reveal privileged institutional information presented in a specialized rhetorical style. To explore how leaks are harnessed by institutional critics, I examine the 2013 Snowden/National Security Agency (NSA) crisis. Combining corpus analysis with discourse analysis, I explore how Snowden’s NSA leaks affected the online writing of the American Civil Liberties Union (ACLU). I also consider overlaps between the rhetorical patterns in the leaked NSA documents and those in the ACLU’s post-leaks writing. Findings from my analysis of legitimation and style categories suggest that, prior to the leaks, ACLU writers primarily used a character- and narrative-based style to delegitimize the NSA’s policies as illegal and secretive, and to push for their reform. After the leaks, though, the ACLU mainly used an informationally dense style rife with academic terms and vocabularies of strategic action, portraying NSA surveillance as massive and complex. As the documents moved from the NSA’s secret, technical discourses to public, critical discourses, the latter came to resemble the former rhetorically. These findings raise crucial questions about how critics can make use of leaks without necessarily relegitimizing institutional power.
Corpus linguistic is an area which focuses on a set of procedures or methods for studying language and is not directly about the study of any particular aspect of language. Corpus-based approach is ...an effective way to research many areas of linguistics, which offers empirical data to the researchers and makes the research more convincing. The paper makes use of corpus and extracts the real data from CCL Corpus (Center for Chinese Linguistics Corpus) to investigate LIFE metaphors of Cognitive linguistics.
La Reingenieria de Corpus es el proceso de reconfiguration de muestras de habla para su reutilización en diversos ambitos (Guirado 2014, 2015, 2018). Esta investigación propone crear un subcorpus a ...partir de la reingenieria de corpus orales de tres ciudades venezolanas (Maracaibo, Caracas y Merida), para realizar estudios dialectales en una microdiacronia del siglo XX. La metodologia para el rediseño incluyo: i. describir la estruduración de los corpus originales; ii. evaluar los aspectos estables e inoperantes; Mi. crear el nuevo diseño y estimar su representatividad. Como resultado se obtuvo el CORDIA-VEN-sXX, el cual consta de 144 entrevistas, distribuidas equitativamente segun metropoMs, edad, nivel socioeconómico y sexo. En conclusión, la Reingenieria de Corpus resulta una práctica eficaz dentro de la Linguistica de Corpus para la diversificación de algunos materiales en nuevos contextos interpretativos.
This paper explores how metaphors in Dante's Commedia can be identified, classified and organized in a database so as to provide thorough and solid data to aid their understanding. All metaphors in ...the poem are first identified through a linguistic procedure called MIP (i.e. Metaphor Identification Process), based on the comparison between the contextual and the basic meaning of each word-unit. The metaphors thus identified are later classified based on syntactic, semantic and rhetorical structure criteria, taking into account both medieval explanations of figurative language and the features of Dante's metaphors that have always struck the readers of his poem. The findings from this combined linguistic and stylistic analysis provide evidence to support tenets put forward by some literary scholars (e.g. that metaphors increase in number and complexity throughout the poem) and show that metaphors converge in sections of political and religious invective, where one of the main purposes of the author is to elevate his style. The study suggests that a systematic multi-dimensional analysis of metaphor can shed light on its conceptual importance in a text and raise awareness of all its stylistic traits.
This volume presents results of bilateral research project BeSyMPHONic (ÖAW/Univ. Toulouse) funded by ANR & FWF. Differences between the two languages with respect to the processing of morphonotactic ...(MPH) vs. phonotactic (PH) consonant clusters are shown for the first time, the linguistically challenging claim that differences between MPH and PH are also realized phonetically is refuted, and the importance of the relative morphological richness of a language is illustrated.
Der Band zeigt Ergebnisse des von ANR & FWF geförderten, bilateralen Forschungsprojekts BeSyMPHONic (ÖAW/Univ. Toulouse). Unterschiede zwischen beiden Sprachen in Bezug auf die Verarbeitung morphonotaktischer (MPH) vs. phonotaktischer (PH) Konsonantengruppen werden erstmalig aufgezeigt, die sprachtheoretisch herausfordernde Behauptung, dass Unterschiede zwischen MPH und PH auch phonetisch realisiert werden, widerlegt, und die Wichtigkeit des relativen morphologischen Reichtums einer Sprache veranschaulicht.