Subscribe to the gold package and get unlimited access to Shamra Academy

Quantifying Contextual Aspects of Inter-annotator Agreement in Intertextuality Research

تحديد الجوانب السياقية لاتفاق المشتريات في أبحاث InterteXtuality

549 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

quantifying contextual aspects intertextuality research inter-annotator agreement تحديد الجوانب السياقية أبحاث intertextuality اتفاقية المعلقين صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

We report on an inter-annotator agreement experiment involving instances of text reuse focusing on the well-known case of biblical intertextuality in medieval literature. We target the application use case of literary scholars whose aim is to document instances of biblical references in the apparatus fontium' of a prospective digital edition. We develop a Bayesian implementation of Cohen's kappa for multiple annotators that allows us to assess the influence of various contextual effects on the inter-annotator agreement, producing both more robust estimates of the agreement indices as well as insights into the annotation process that leads to the estimated indices. As a result, we are able to produce a novel and nuanced estimation of inter-annotator agreement in the context of intertextuality, exploring the challenges that arise from manually annotating a dataset of biblical references in the writings of Bernard of Clairvaux. Among others, our method was able to unveil the fact that the obtained agreement depends heavily on the biblical source book of the proposed reference, as well as the underlying algorithm used to retrieve the candidate match.

References used

https://aclanthology.org/

rate research

Profiling of Intertextuality in Latin Literature Using Word Embeddings

1089 - Association for Computation Linguistics 2021 مقالة

Identifying intertextual relationships between authors is of central importance to the study of literature. We report an empirical analysis of intertextuality in classical Latin literature using word embedding models. To enable quantitative evaluatio n of intertextual search methods, we curate a new dataset of 945 known parallels drawn from traditional scholarship on Latin epic poetry. We train an optimized word2vec model on a large corpus of lemmatized Latin, which achieves state-of-the-art performance for synonym detection and outperforms a widely used lexical method for intertextual search. We then demonstrate that training embeddings on very small corpora can capture salient aspects of literary style and apply this approach to replicate a previous intertextual study of the Roman historian Livy, which relied on hand-crafted stylometric features. Our results advance the development of core computational resources for a major premodern language and highlight a productive avenue for cross-disciplinary collaboration between the study of literature and NLP.

latin literature classical latin literature profiling of intertextuality الأدب اللاتيني الأدب اللاتيني الكلاسيكي تنميط intertextulity. صناعة حمض الفوسفور المزيد..

Introducing CAD: the Contextual Abuse Dataset

871 - Association for Computation Linguistics 2021 مقالة

Online abuse can inflict harm on users and communities, making online spaces unsafe and toxic. Progress in automatically detecting and classifying abusive content is often held back by the lack of high quality and detailed datasets.We introduce a new dataset of primarily English Reddit entries which addresses several limitations of prior work. It (1) contains six conceptually distinct primary categories as well as secondary categories, (2) has labels annotated in the context of the conversation thread, (3) contains rationales and (4) uses an expert-driven group-adjudication process for high quality annotations. We report several baseline models to benchmark the work of future researchers. The annotated dataset, annotation guidelines, models and code are freely available.

contextual abuse dataset introducing cad contextual abuse بيانات الإساءة السياقية تقديم CAD. سوء المعاملة السياقية صناعة حمض الفوسفور المزيد..

Quantifying Cognitive Factors in Lexical Decline

683 - Association for Computation Linguistics 2021 مقالة

Abstract We adopt an evolutionary view on language change in which cognitive factors (in addition to social ones) affect the fitness of words and their success in the linguistic ecosystem. Specifically, we propose a variety of psycholinguistic factor s---semantic, distributional, and phonological---that we hypothesize are predictive of lexical decline, in which words greatly decrease in frequency over time. Using historical data across three languages (English, French, and German), we find that most of our proposed factors show a significant difference in the expected direction between each curated set of declining words and their matched stable words. Moreover, logistic regression analyses show that semantic and distributional factors are significant in predicting declining words. Further diachronic analysis reveals that declining words tend to decrease in the diversity of their lexical contexts over time, gradually narrowing their ecological niches'.

quantifying cognitive factors quantifying cognitive lexical decline تحديد العوامل المعرفية تحديد الكمي المعرفي انخفاض المعجمات صناعة حمض الفوسفور المزيد..

Relational World Knowledge Representation in Contextual Language Models: A Review

672 - Association for Computation Linguistics 2021 مقالة

Relational knowledge bases (KBs) are commonly used to represent world knowledge in machines. However, while advantageous for their high degree of precision and interpretability, KBs are usually organized according to manually-defined schemas, which l imit their expressiveness and require significant human efforts to engineer and maintain. In this review, we take a natural language processing perspective to these limitations, examining how they may be addressed in part by training deep contextual language models (LMs) to internalize and express relational knowledge in more flexible forms. We propose to organize knowledge representation strategies in LMs by the level of KB supervision provided, from no KB supervision at all to entity- and relation-level supervision. Our contributions are threefold: (1) We provide a high-level, extensible taxonomy for knowledge representation in LMs; (2) Within our taxonomy, we highlight notable models, evaluation tasks, and findings, in order to provide an up-to-date review of current knowledge representation capabilities in LMs; and (3) We suggest future research directions that build upon the complementary aspects of LMs and KBs as knowledge representations.

الاهتمام العصبي يدرك التسلسل الهرمي relational world knowledge world knowledge representation المعرفة العالمية العلائقية تمثيل المعرفة العالمي صناعة حمض الفوسفور

Exploiting Image--Text Synergy for Contextual Image Captioning

1170 - Association for Computation Linguistics 2021 مقالة

Modern web content - news articles, blog posts, educational resources, marketing brochures - is predominantly multimodal. A notable trait is the inclusion of media such as images placed at meaningful locations within a textual narrative. Most often, such images are accompanied by captions - either factual or stylistic (humorous, metaphorical, etc.) - making the narrative more engaging to the reader. While standalone image captioning has been extensively studied, captioning an image based on external knowledge such as its surrounding text remains under-explored. In this paper, we study this new task: given an image and an associated unstructured knowledge snippet, the goal is to generate a contextual caption for the image.

text synergy synergy for contextual نص التآزر التآزر إلى السياق صناعة حمض الفوسفور

Quantifying Contextual Aspects of Inter-annotator Agreement in Intertextuality Research

تحديد الجوانب السياقية لاتفاق المشتريات في أبحاث InterteXtuality

Ask ChatGPT about the research

Read More

suggested questions