Metadiscourse refers to the linguistic element that is used to communicate meanings with imagined readers and to express a viewpoint as members of a particular academic community. Accordingly, this ...study reported the distributions of interactive and interactional metadiscourse markers in a corpus of 99 research articles representing the English language, Computer Sciences, and Education disciplines. To observe the writers’ metadiscourse devices usage in their discourse community, Hyland’s (Metadiscourse: exploring interaction in writing. Continuum, New York, 2005) metadiscourse taxonomy was employed. The data were computed through descriptive statistics, Chi square, Kruskal–Wallis test, and content analysis. Hence, the data revealed that though articles in all disciplines employed both interactive and interactional metadiscourse markers, English Language discipline articles contained highest metadiscourse devices compared with Education and Computer sciences discipline articles. It was also observed that the book review writers used much more interactive markers such as transition and evidential devices than interactional markers. However, among interactional markers, self-mention markers were extensively used. The data also indicated that there was statistically a significant difference across disciplines in using interactive and interactional metadiscourse devices. Hence, these findings implied that academic writing teachers should focus on discipline-oriented metadiscourse devices while teaching academic writing skills.
This article investigates the use and non-use of objects with six transitive verbs in a corpus of English football match reports. The verbs were selected on the basis of their frequency as well as ...their lexico-grammatical features of "footballness" and transitivity. The study suggests that object omission may not be as pervasive as hinted at in previous studies (e.g. Bergh and Ohlander 2016; Ruppenhofer and Michaelis 2010). Regarding potential reasons for object omission, it is uncovered that the football verbs-net, save, play-are more prone to object omission than the general verbs: feed, create, take. This is attributed to the strong attraction of the former to recurrent collocates such as goal and ball. This suggests that verbs used to report on unremarkable and canonical situations (to the game of football) more readily omit the object, albeit not on a general basis, as individual differences between the verbs also emerge.
Characteristics of Malay translated hadith corpus Sazali, Siti Syakirah; Rahman, Nurazzah Abdul; Bakar, Zainab Abu
Journal of King Saud University. Computer and information sciences,
20/May , Volume:
34, Issue:
5
Journal Article
Peer reviewed
Open access
Annotated corpus can greatly assist in the natural language processing field. For example, computers can understand more of the document context, and indexing and clustering in information retrieval ...can be done precisely with less or no ambiguity of words. However, there are only a few annotated corpora in Malay language, which are not publicly shared. In this paper, we delve into analysing and annotating Malay translated hadith documents in terms of tagging and entities. There are three phases, which are manual filtering and cleaning, analysing the corpus and creating the benchmark. As the result, an analysis and benchmark of Malay translated hadith corpus were produced in term of part-of-speech and named entities tags that follows the Zipf’s law distribution.
Investigating agency has become a pivotal issue in discourse studies, especially organizational discourse. This study aims to identify the controlling agency (who/what) behind G20 leaders' ...declarations in the last decade and how such agency is constructed. To this end, this study offers a concise examination of relevant literature investigating fundamental concepts like discourse and agency in light of the overlapping relationship between form and function in language studies. Further, an eclectic methodological approach has been devised to arrive at a multi-leveled analysis. Two stages of analysis were designed. First, a corpus of the declarations between 2012 and 2021 was created and analyzed using #LancsBox v.6.x. and Wmatrix. At this stage, we was established as a prime agent in the corpus and proven to collocate heavily with agentive speech acts. One sample declaration, Riyadh 2020, was used for minute discourse analysis in the second stage. Inspired by transitivity system, process type analysis, and multivalence frameworks, this stage revealed the profound presence of non-human agency alongside the human one. Nevertheless, further examination demonstrated that this sample still constrains non-human agency due to semantic and textual constraints.
G20; Discourse analysis; Corpus linguistic; Speech acts; Process types; Agency; #LancsBox
Scholars in discourse studies have defined legitimation as the justification (and critique) of powerful institutions and their practices. In moments of crisis, legitimation tactics often shift. This ...article considers how such shifts are incited by unauthorized information leaks. Leaks, I argue, constitute freshly available texts that reveal privileged institutional information presented in a specialized rhetorical style. To explore how leaks are harnessed by institutional critics, I examine the 2013 Snowden/National Security Agency (NSA) crisis. Combining corpus analysis with discourse analysis, I explore how Snowden’s NSA leaks affected the online writing of the American Civil Liberties Union (ACLU). I also consider overlaps between the rhetorical patterns in the leaked NSA documents and those in the ACLU’s post-leaks writing. Findings from my analysis of legitimation and style categories suggest that, prior to the leaks, ACLU writers primarily used a character- and narrative-based style to delegitimize the NSA’s policies as illegal and secretive, and to push for their reform. After the leaks, though, the ACLU mainly used an informationally dense style rife with academic terms and vocabularies of strategic action, portraying NSA surveillance as massive and complex. As the documents moved from the NSA’s secret, technical discourses to public, critical discourses, the latter came to resemble the former rhetorically. These findings raise crucial questions about how critics can make use of leaks without necessarily relegitimizing institutional power.
A raíz de la fundación de la Real Academia Española a principios del siglo XVIII comenzaron a publicarse diferentes obras de carácter normativo con la finalidad de establecer patrones y reglas claras ...para la codificación escrita de la lengua. No obstante, el alcance de estas reformas fue limitado y los procesos de cambio y variación lingüística ven continuidad a lo largo de esa centuria. El objetivo de este estudio se centra en el análisis de algunos fenómenos gráfico-fonéticos que ponen de manifiesto pronunciaciones seseantes o confusión entre consonantes líquidas. Para ello, se toma como base metodológica los datos extraídos de un conjunto de inventarios de bienes dieciochescos andaluces del corpus Oralia diacrónica del español (ODE). Así, se establece la distribución y configuración dialectal que esos fenómenos adquieren en Andalucía durante el siglo ilustrado en el tipo textual analizado.
This paper investigates a special type of recurrent expressions, lexical bundles, defined as a sequence of three or more words that co-occur frequently in a particular register (Biber et al., 1999). ...Considering the importance of this group of multi-word sequences in academic prose, this study explores the forms and syntactic structures of three- and four-word bundles in English abstracts written by Iranian scholars in three disciplines; business studies, history, and linguistics. Applying a frequency-based approach suggested by Biber et al. (1999), the data were drawn from a corpus consisting of 660 English abstracts of research articles published by various leading academic journals. As regards syntactic structure, the structural characteristics of the lexical bundles were first explored through careful analysis and then classified using a modified version of Biber et al.’s (1999) structural framework. The study also compares the forms, frequencies, and structures of the lexical bundles in the corpus to those found in Hyland’s (2004) research article corpus. This comparison shows that not only are there relatively large differences between the two corpora in the forms and frequencies of bundles employed, but also there are some discrepancies in terms of distribution of the different structural types and tokens.
Corpus linguistic is an area which focuses on a set of procedures or methods for studying language and is not directly about the study of any particular aspect of language. Corpus-based approach is ...an effective way to research many areas of linguistics, which offers empirical data to the researchers and makes the research more convincing. The paper makes use of corpus and extracts the real data from CCL Corpus (Center for Chinese Linguistics Corpus) to investigate LIFE metaphors of Cognitive linguistics.
La Reingenieria de Corpus es el proceso de reconfiguration de muestras de habla para su reutilización en diversos ambitos (Guirado 2014, 2015, 2018). Esta investigación propone crear un subcorpus a ...partir de la reingenieria de corpus orales de tres ciudades venezolanas (Maracaibo, Caracas y Merida), para realizar estudios dialectales en una microdiacronia del siglo XX. La metodologia para el rediseño incluyo: i. describir la estruduración de los corpus originales; ii. evaluar los aspectos estables e inoperantes; Mi. crear el nuevo diseño y estimar su representatividad. Como resultado se obtuvo el CORDIA-VEN-sXX, el cual consta de 144 entrevistas, distribuidas equitativamente segun metropoMs, edad, nivel socioeconómico y sexo. En conclusión, la Reingenieria de Corpus resulta una práctica eficaz dentro de la Linguistica de Corpus para la diversificación de algunos materiales en nuevos contextos interpretativos.