New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Monolingual Word Sense Alignment as a Classification Problem

محاذاة معنى كلمة أحادية الأحادية كمشكلة التصنيف

392 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

word sense alignment classification problem monolingual word sense محاذاة معنى كلمة مشكلة التصنيف كلمة أحادية الأحادية صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Words are defined based on their meanings in various ways in different resources. Aligning word senses across monolingual lexicographic resources increases domain coverage and enables integration and incorporation of data. In this paper, we explore the application of classification methods using manually-extracted features along with representation learning techniques in the task of word sense alignment and semantic relationship detection. We demonstrate that the performance of classification methods dramatically varies based on the type of semantic relationships due to the nature of the task but outperforms the previous experiments.

References used

https://aclanthology.org/

rate research

ConSeC: Word Sense Disambiguation as Continuous Sense Comprehension

419 - Association for Computation Linguistics 2021 مقالة

Supervised systems have nowadays become the standard recipe for Word Sense Disambiguation (WSD), with Transformer-based language models as their primary ingredient. However, while these systems have certainly attained unprecedented performances, virt ually all of them operate under the constraining assumption that, given a context, each word can be disambiguated individually with no account of the other sense choices. To address this limitation and drop this assumption, we propose CONtinuous SEnse Comprehension (ConSeC), a novel approach to WSD: leveraging a recent re-framing of this task as a text extraction problem, we adapt it to our formulation and introduce a feedback loop strategy that allows the disambiguation of a target word to be conditioned not only on its context but also on the explicit senses assigned to nearby words. We evaluate ConSeC and examine how its components lead it to surpass all its competitors and set a new state of the art on English WSD. We also explore how ConSeC fares in the cross-lingual setting, focusing on 8 languages with various degrees of resource availability, and report significant improvements over prior systems. We release our code at https://github.com/SapienzaNLP/consec.

continuous sense comprehension الفهم المعنى المستمر صناعة حمض الفوسفور

Now, It's Personal : The Need for Personalized Word Sense Disambiguation

286 - Association for Computation Linguistics 2021 مقالة

Authors of text tend to predominantly use a single sense for a lemma that can differ among different authors. This might not be captured with an author-agnostic word sense disambiguation (WSD) model that was trained on multiple authors. Our work find s that WordNet's first senses, the predominant senses of our dataset's genre, and the predominant senses of an author can all be different and therefore, author-agnostic models could perform well over the entire dataset, but poorly on individual authors. In this work, we explore methods for personalizing WSD models by tailoring existing state-of-the-art models toward an individual by exploiting the author's sense distributions. We propose a novel WSD dataset and show that personalizing a WSD system with knowledge of an author's sense distributions or predominant senses can greatly increase its performance.

personalized word sense personalized word كلمة شخصية بالمعنى كلمة شخصية صناعة حمض الفوسفور

CombAlign: a Tool for Obtaining High-Quality Word Alignments

638 - Association for Computation Linguistics 2021 مقالة

Being able to generate accurate word alignments is useful for a variety of tasks. While statistical word aligners can work well, especially when parallel training data are plentiful, multilingual embedding models have recently been shown to give good results in unsupervised scenarios. We evaluate an ensemble method for word alignment on four language pairs and demonstrate that by combining multiple tools, taking advantage of their different approaches, substantial gains can be made. This holds for settings ranging from very low-resource to high-resource. Furthermore, we introduce a new gold alignment test set for Icelandic and a new easy-to-use tool for creating manual word alignments.

obtaining high-quality word obtaining high-quality high-quality word alignments الحصول على كلمة عالية الجودة الحصول على جودة عالية محاذاة كلمة عالية الجودة صناعة حمض الفوسفور المزيد..

Persian SemCor: A Bag of Word Sense Annotated Corpus for the Persian Language

449 - Association for Computation Linguistics 2021 مقالة

Supervised approaches usually achieve the best performance in the Word Sense Disambiguation problem. However, the unavailability of large sense annotated corpora for many low-resource languages make these approaches inapplicable for them in practice. In this paper, we mitigate this issue for the Persian language by proposing a fully automatic approach for obtaining Persian SemCor (PerSemCor), as a Persian Bag-of-Word (BoW) sense-annotated corpus. We evaluated PerSemCor both intrinsically and extrinsically and showed that it can be effectively used as training sets for Persian supervised WSD systems. To encourage future research on Persian Word Sense Disambiguation, we release the PerSemCor in http://nlp.sbu.ac.ir.

تطابق word sense annotated word sense كلمة معنى المشروح كلمة معنى صناعة حمض الفوسفور

Parallel Text Alignment and Monolingual Parallel Corpus Creation from Philosophical Texts for Text Simplification

385 - Association for Computation Linguistics 2021 مقالة

Text simplification is a growing field with many potential useful applications. Training text simplification algorithms generally requires a lot of annotated data, however there are not many corpora suitable for this task. We propose a new unsupervis ed method for aligning text based on Doc2Vec embeddings and a new alignment algorithm, capable of aligning texts at different levels. Initial evaluation shows promising results for the new approach. We used the newly developed approach to create a new monolingual parallel corpus composed of the works of English early modern philosophers and their corresponding simplified versions.

creation from philosophical parallel corpus creation philosophical texts إنشاء من الفلسفية موازية إنشاء كوربوس النصوص الفلسفية صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Monolingual Word Sense Alignment as a Classification Problem

محاذاة معنى كلمة أحادية الأحادية كمشكلة التصنيف

Ask ChatGPT about the research

Read More

suggested questions