Este trabajo se inserta en el marco del proyecto VITALEX y en él ofrecemos los datos relativos al léxico agrícola pertenecientes al punto Gr601 que se corresponde conel pueblo de Trevélez. Realizamos ...un estudio de vitalidad léxica de dicho municipio por medio de una metodología contrastiva donde comparamos los datos de VITALEX con los del tomo I del ALEA. Además, podemos observar una relación interesante entre la transformación económica y social de la comarca de La Alpujarra con los porcentajes de vitalidad y mortandad del léxico agrícola. Se hace contrastando los resultados obtenidos por las tres generaciones que configuran VITALEX en relación con los datos del ALEA.
In this piece, we honor the work of Albert Costa. His work focused on how bilinguals manage two languages, the brain mechanisms involved, and the ways in which language and emotion are related. We ...end by discussing ways in which his work will frame research in the field going forward.
Nowadays, the spread of deceptive reviews is a problem that has reached critical dimensions, having a significant economic impact on business activities. This paper aims to estimate – at the ...quantitative and qualitative levels – the possibility of using particular words to disambiguate between truthful and deceptive text, focusing on reviews produced in the cultural heritage domain. For this purpose, a lexicon-based methodology has used two different lexicons: sentiment information, intensifiers, downtoners, and negation operators. As known in the literature, these elements are crucial in a classification process related to deceptiveness. The evaluation phase has considered quantitative metrics such as accuracy and F1 score and ad hoc developed metrics that consider specific linguistic parameters such as polarity and tone of voice intensifiers. A qualitative analysis of a subset of the corpus has also been carried out to understand better factors that impact the classification of deceptive review. Several linguistic features have been considered, ranging from the number of intensifiers to their type and position in phrases and sentences. A comparison between the performances of two different lexicons used has been added to the analysis.
•Lexicon-based approach for classifying Italian fake Cultural Heritage reviews.•A methodology based on intensifiers and downtoners from sentiment analysis.•New Italian Cultural Heritage Corpus for Deceptive Reviews Classification.•Comparison of manual and automatically created lexicons.•New metric considering polarity and linguistic features.
Upaya melakukan analisis emosi pada teks komentar mahasiswa dalam evaluasi pembelajaran sangat penting dilakukan. Komentar dalam kuesener umumnya tidak diolah, padahal data tersebut mengandung ...informasi dalam mengungkap emosi mahasiswa dalam proses pembelajaran. Untuk itu deteksi dan klasifikasi emosi pada opini mahasiswa dapat memperbaiki hasil kuesioner. Penelitian ini bertujuan menerapkan metode klasifikasi emosi pada teks komentar mahasiswa berbasis pada leksikon emosi dari NRC Emolex. Jenis emosi yang akan dideteksi adalah 8 jenis emosi, yaitu marah (Anger), antisipasi (anticipation), jijik (disgust), takut (fear), bahagia (joy), sedih (sadness), terkejut (surprise) dan yakin (trust) . Data diambil dari komentar dan saran mahasiswa pada kuesioner pada Universitas AKPRIND Indonesia tahun 2020-2022 sebanyak 4000 data yang telah dilabeli secara manual. Tujuan lain dari studi ini adalah melihat sejauh mana efektivitas leksikon emosi Emolex untuk klasifikasi emosi teks kuesioner akademis. Hasil penelitian menunjukkan rata-rata akurasi sebesar 56,2%. Dari yang diketahui label emosinya 3 prosentase tertinggi ada pada label Sadness (19,2%), Joy(16,7%) dan Fear (13,5%) yang masing-masing memiliki akurasi 72%, 68% dan 68%. Dari penelitian terungkap bahwa kinerja Emolex untuk klasifikasi emosi masih kurang memuaskan dan memerlukan pengembangan leksikon lebih jauh lagi.
Even though considerable attention has been given to the polarity of words (positive and negative) and the creation of large polarity lexicons, research in emotion analysis has had to rely on limited ...and small emotion lexicons. In this paper, we show how the combined strength and wisdom of the crowds can be used to generate a large, high‐quality, word–emotion and word–polarity association lexicon quickly and inexpensively. We enumerate the challenges in emotion annotation in a crowdsourcing scenario and propose solutions to address them. Most notably, in addition to questions about emotions associated with terms, we show how the inclusion of a word choice question can discourage malicious data entry, help to identify instances where the annotator may not be familiar with the target term (allowing us to reject such annotations), and help to obtain annotations at sense level (rather than at word level). We conducted experiments on how to formulate the emotion‐annotation questions, and show that asking if a term is associated with an emotion leads to markedly higher interannotator agreement than that obtained by asking if a term evokes an emotion.
Brogues and Blarney Walshe, Shane
English text construction,
01/2023, Letnik:
16, Številka:
2
Journal Article
Recenzirano
From the earliest days, Irish characters have played a prominent role in American comics. The hero of Hogan’s Alley, the first American comic strip to feature a speech balloon, was an Irish child ...named Mickey Dugan, a.k.a. “The Yellow Kid”. Likewise, the star of Happy Hooligan, the first US comic strip to employ speech balloons on a regular basis, was Irish. Thus, not only are Irish characters important in the early history of American comics, their speech is too. The way that this was represented resembled existing portrayals in pop culture, involving respellings, non-standard grammar and lexical items deemed typical of the variety. While some of these continued to index Irishness in comics, others also emerged. Building on previous research, this study examines a corpus of American comics from numerous genres and publishing houses to offer the most comprehensive overview yet of Irish speech in the medium.
Este trabajo se centra en el tratamiento de la variación léxica en el ámbito de la enseñanza del español como segunda lengua. En él se revisan oposiciones diatópicas del léxico hispánico y se ensaya ...una nueva metodología de aproximación al fenómeno de la representatividad. El objetivo último del análisis es proponer una renovación y actualización de las Nociones específicas del Plan Curricular del Instituto Cervantes (PCIC) incluyendo: (a) panhispanismos, (b) americanismos y (c) españolismos. El trabajo propone un nuevo índice de representatividad léxica y analiza los resultados del cálculo, comparándolos con los de modelos anteriores. Para la presentación de resultados se incluye una herramienta de acceso gratuito con mapas interactivos de geosinónimos. El estudio aporta una perspectiva lingüística plural, policéntrica y ecolingüística al inventario léxico del PCIC y compensa la restricción que supone describir el vocabulario únicamente a partir de la norma centro-norte peninsular española.
This paper argues that there are lexical items that conventionally express the idea of dividing one quantity by another, and per is one of them. In particular, the proposal is that there are three ...ratio-related senses of per: (i) a quotient function; (ii) a quotient operator; and (iii) quotient of measure functions. The ratio-based approach, which is built up here in order to handle a wider range of data than previous ratio-based approaches could, is contrasted with an opposing view, one on which per is a distributivity marker like each. Four types of evidence are used: (i) cases involving measurement of an object or an event whose measure is smaller than the unit given by per’s complement; (ii) uses in the differential argument of a comparative; (iii) uses modifying a measure function noun; and and (iv) uses modifying a gradable predicate. All of these are problematic for a distributivity- marker analysis, and support the idea that per expresses the concept of ratio. Along the way, we gain diagnostics for whether a given item conventionally expresses the concept of a ratio in a given language.
Social media has become the largest data source of public opinion. The application of sentiment analysis to social media texts has great potential, but faces great challenges because of domain ...heterogeneity. Sentiment orientation of words varies by content domain, but learning context-specific sentiment in social media domains continues to be a major challenge. The language domain poses another challenge since the language used in social media today differs significantly from that used in traditional media. To address these challenges, we propose a method to adapt existing sentiment lexicons for domain-specific sentiment classification using an unannotated corpus and a dictionary. We evaluate our method using two large developing corpora, containing 743,069 tweets related to the stock market and one million tweets related to political topics, respectively, and five existing sentiment lexicons as seeds and baselines. The results demonstrate the usefulness of our method, showing significant improvement in sentiment classification performance.
•We propose a method to adapt existing sentiment lexicons for domain-specific sentiment classification.•The proposed method addresses challenges from both content domain and language domain.•We evaluate our method using two large developing corpora and five existing sentiment lexicons as seeds and baselines.•The evaluation results demonstrate the usefulness of our method.