Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Learning Cross-lingual Representations for Event Coreference Resolution with Multi-view Alignment and Optimal Transport

تعلم تمثيلات عبر اللغات لتحليل Aquerence الحدث مع محاذاة متعددة الرؤية والنقل الأمثل

990 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

متخصصة لغة متعددة اللغات صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

We study a new problem of cross-lingual transfer learning for event coreference resolution (ECR) where models trained on data from a source language are adapted for evaluations in different target languages. We introduce the first baseline model for this task based on XLM-RoBERTa, a state-of-the-art multilingual pre-trained language model. We also explore language adversarial neural networks (LANN) that present language discriminators to distinguish texts from the source and target languages to improve the language generalization for ECR. In addition, we introduce two novel mechanisms to further enhance the general representation learning of LANN, featuring: (i) multi-view alignment to penalize cross coreference-label alignment of examples in the source and target languages, and (ii) optimal transport to select close examples in the source and target languages to provide better training signals for the language discriminators. Finally, we perform extensive experiments for cross-lingual ECR from English to Spanish and Chinese to demonstrate the effectiveness of the proposed methods.

References used

https://aclanthology.org/

rate research

Aligning Cross-lingual Sentence Representations with Dual Momentum Contrast

558 - Association for Computation Linguistics 2021 مقالة

In this paper, we propose to align sentence representations from different languages into a unified embedding space, where semantic similarities (both cross-lingual and monolingual) can be computed with a simple dot product. Pre-trained language mode ls are fine-tuned with the translation ranking task. Existing work (Feng et al., 2020) uses sentences within the same batch as negatives, which can suffer from the issue of easy negatives. We adapt MoCo (He et al., 2020) to further improve the quality of alignment. As the experimental results show, the sentence representations produced by our model achieve the new state-of-the-art on several tasks, including Tatoeba en-zh similarity search (Artetxe andSchwenk, 2019b), BUCC en-zh bitext mining, and semantic textual similarity on 7 datasets.

dual momentum contrast momentum contrast dual momentum التباين الزخم المزدوج النقيض من الزخم الزخم المزدوج صناعة حمض الفوسفور المزيد..

Using Optimal Transport as Alignment Objective for fine-tuning Multilingual Contextualized Embeddings

752 - Association for Computation Linguistics 2021 مقالة

Recent studies have proposed different methods to improve multilingual word representations in contextualized settings including techniques that align between source and target embedding spaces. For contextualized embeddings, alignment becomes more c omplex as we additionally take context into consideration. In this work, we propose using Optimal Transport (OT) as an alignment objective during fine-tuning to further improve multilingual contextualized representations for downstream cross-lingual transfer. This approach does not require word-alignment pairs prior to fine-tuning that may lead to sub-optimal matching and instead learns the word alignments within context in an unsupervised manner. It also allows different types of mappings due to soft matching between source and target sentences. We benchmark our proposed method on two tasks (XNLI and XQuAD) and achieve improvements over baselines as well as competitive results compared to similar recent works.

optimal transport multilingual contextualized embeddings multilingual contextualized النقل الأمثل تضمينات محتوى متعددة اللغات المحاكيات متعددة اللغات صناعة حمض الفوسفور المزيد..

Specializing Multilingual Language Models: An Empirical Study

751 - Association for Computation Linguistics 2021 مقالة

Pretrained multilingual language models have become a common tool in transferring NLP capabilities to low-resource languages, often with adaptations. In this work, we study the performance, extensibility, and interaction of two such adaptations: voca bulary augmentation and script transliteration. Our evaluations on part-of-speech tagging, universal dependency parsing, and named entity recognition in nine diverse low-resource languages uphold the viability of these approaches while raising new questions around how to optimally adapt multilingual models to low-resource settings.

specializing multilingual language متخصصة لغة متعددة اللغات صناعة حمض الفوسفور

MultiFix: Learning to Repair Multiple Errors by Optimal Alignment Learning

934 - Association for Computation Linguistics 2021 مقالة

We consider the problem of learning to repair erroneous C programs by learning optimal alignments with correct programs. Since the previous approaches fix a single error in a line, it is inevitable to iterate the fixing process until no errors remain . In this work, we propose a novel sequence-to-sequence learning framework for fixing multiple program errors at a time. We introduce the edit-distance-based data labeling approach for program error correction. Instead of labeling a program repair example by pairing an erroneous program with a line fix, we label the example by paring an erroneous program with an optimal alignment to the corresponding correct program produced by the edit-distance computation. We evaluate our proposed approach on a publicly available dataset (DeepFix dataset) that consists of erroneous C programs submitted by novice programming students. On a set of 6,975 erroneous C programs from the DeepFix dataset, our approach achieves the state-of-the-art result in terms of full repair rate on the DeepFix dataset (without extra data such as compiler error message or additional source codes for pre-training).

optimal alignment learning learning optimal alignments optimal alignment التعلم محاذاة الأمثل تعلم المحاذاة المثلى المحاذاة المثلى صناعة حمض الفوسفور المزيد..

Adversarial Attack against Cross-lingual Knowledge Graph Alignment

1171 - Association for Computation Linguistics 2021 مقالة

Recent literatures have shown that knowledge graph (KG) learning models are highly vulnerable to adversarial attacks. However, there is still a paucity of vulnerability analyses of cross-lingual entity alignment under adversarial attacks. This paper proposes an adversarial attack model with two novel attack techniques to perturb the KG structure and degrade the quality of deep cross-lingual entity alignment. First, an entity density maximization method is employed to hide the attacked entities in dense regions in two KGs, such that the derived perturbations are unnoticeable. Second, an attack signal amplification method is developed to reduce the gradient vanishing issues in the process of adversarial attacks for further improving the attack effectiveness.

cross-lingual knowledge graph knowledge graph alignment الرسم البياني المعرفة عبر اللغات المعرفة الرسم البياني محاذاة صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Learning Cross-lingual Representations for Event Coreference Resolution with Multi-view Alignment and Optimal Transport

تعلم تمثيلات عبر اللغات لتحليل Aquerence الحدث مع محاذاة متعددة الرؤية والنقل الأمثل

Ask ChatGPT about the research

Read More

suggested questions