New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Exophoric Pronoun Resolution in Dialogues with Topic Regularization

قرار الضمير exophoric في الحوارات مع تنظيم موضوع

104 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Resolving pronouns to their referents has long been studied as a fundamental natural language understanding problem. Previous works on pronoun coreference resolution (PCR) mostly focus on resolving pronouns to mentions in text while ignoring the exophoric scenario. Exophoric pronouns are common in daily communications, where speakers may directly use pronouns to refer to some objects present in the environment without introducing the objects first. Although such objects are not mentioned in the dialogue text, they can often be disambiguated by the general topics of the dialogue. Motivated by this, we propose to jointly leverage the local context and global topics of dialogues to solve the out-of-text PCR problem. Extensive experiments demonstrate the effectiveness of adding topic regularization for resolving exophoric pronouns.

References used

https://aclanthology.org/

rate research

Pseudo Zero Pronoun Resolution Improves Zero Anaphora Resolution

87 - Association for Computation Linguistics 2021 مقالة

Masked language models (MLMs) have contributed to drastic performance improvements with regard to zero anaphora resolution (ZAR). To further improve this approach, in this study, we made two proposals. The first is a new pretraining task that trains MLMs on anaphoric relations with explicit supervision, and the second proposal is a new finetuning method that remedies a notorious issue, the pretrain-finetune discrepancy. Our experiments on Japanese ZAR demonstrated that our two proposals boost the state-of-the-art performance, and our detailed analysis provides new insights on the remaining challenges.

pronoun resolution improves pseudo zero pronoun pronoun resolution قرار الضمير يحسن ضمير زائف صفر قرار الضمير صناعة حمض الفوسفور المزيد..

Coupling Context Modeling with Zero Pronoun Recovering for Document-Level Natural Language Generation

445 - Association for Computation Linguistics 2021 مقالة

Natural language generation (NLG) tasks on pro-drop languages are known to suffer from zero pronoun (ZP) problems, and the problems remain challenging due to the scarcity of ZP-annotated NLG corpora. In this case, we propose a highly adaptive two-sta ge approach to couple context modeling with ZP recovering to mitigate the ZP problem in NLG tasks. Notably, we frame the recovery process in a task-supervised fashion where the ZP representation recovering capability is learned during the NLG task learning process, thus our method does not require NLG corpora annotated with ZPs. For system enhancement, we learn an adversarial bot to adjust our model outputs to alleviate the error propagation caused by mis-recovered ZPs. Experiments on three document-level NLG tasks, i.e., machine translation, question answering, and summarization, show that our approach can improve the performance to a great extent, and the improvement on pronoun translation is very impressive.

تلخيص الجملة coupling context modeling نموذج سياق اقتران صناعة حمض الفوسفور

Improving Unsupervised Dialogue Topic Segmentation with Utterance-Pair Coherence Scoring

140 - Association for Computation Linguistics 2021 مقالة

Dialogue topic segmentation is critical in several dialogue modeling problems. However, popular unsupervised approaches only exploit surface features in assessing topical coherence among utterances. In this work, we address this limitation by leverag ing supervisory signals from the utterance-pair coherence scoring task. First, we present a simple yet effective strategy to generate a training corpus for utterance-pair coherence scoring. Then, we train a BERT-based neural utterance-pair coherence model with the obtained training corpus. Finally, such model is used to measure the topical relevance between utterances, acting as the basis of the segmentation inference. Experiments on three public datasets in English and Chinese demonstrate that our proposal outperforms the state-of-the-art baselines.

dialogue topic segmentation unsupervised dialogue topic improving unsupervised dialogue تجزئة موضوع الحوار موضوع الحوار غير المزعوم تحسين الحوار غير المنشور صناعة حمض الفوسفور المزيد..

Multi-view Subword Regularization

270 - Association for Computation Linguistics 2021 مقالة

Multilingual pretrained representations generally rely on subword segmentation algorithms to create a shared multilingual vocabulary. However, standard heuristic algorithms often lead to sub-optimal segmentation, especially for languages with limited amounts of data. In this paper, we take two major steps towards alleviating this problem. First, we demonstrate empirically that applying existing subword regularization methods (Kudo, 2018; Provilkov et al., 2020) during fine-tuning of pre-trained multilingual representations improves the effectiveness of cross-lingual transfer. Second, to take full advantage of different possible input segmentations, we propose Multi-view Subword Regularization (MVR), a method that enforces the consistency of predictors between using inputs tokenized by the standard and probabilistic segmentations. Results on the XTREME multilingual benchmark (Hu et al., 2020) show that MVR brings consistent improvements of up to 2.5 points over using standard segmentation algorithms.

multi-view subword regularization subword regularization multi-view subword تنظيم الكلمات الفرعية متعددة المنظر تنظيم الكلمات الفرعية كلمة فرعية متعددة صناعة حمض الفوسفور المزيد..

Proxy Indicators for the Quality of Open-domain Dialogues

69 - Association for Computation Linguistics 2021 مقالة

The automatic evaluation of open-domain dialogues remains a largely unsolved challenge. Despite the abundance of work done in the field, human judges have to evaluate dialogues' quality. As a consequence, performing such evaluations at scale is usual ly expensive. This work investigates using a deep-learning model trained on the General Language Understanding Evaluation (GLUE) benchmark to serve as a quality indication of open-domain dialogues. The aim is to use the various GLUE tasks as different perspectives on judging the quality of conversation, thus reducing the need for additional training data or responses that serve as quality references. Due to this nature, the method can infer various quality metrics and can derive a component-based overall score. We achieve statistically significant correlation coefficients of up to 0.7.

proxy indicators open-domain dialogues open-domain dialogues remains مؤشرات الوكيل الحوارات مفتوحة المجال يبقى حوارات النطاق المفتوح صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Exophoric Pronoun Resolution in Dialogues with Topic Regularization

قرار الضمير exophoric في الحوارات مع تنظيم موضوع

Ask ChatGPT about the research

Read More

suggested questions