Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics

التعميم في NLI: طرق (لا) لتجاوز الاستدلال البسيط

809 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

simple heuristics generalization in nli learning dataset-specific heuristics الاستدلال البسيطة التعميم في NLI. التعلم الاستدلال محددات البيانات صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Much of recent progress in NLU was shown to be due to models' learning dataset-specific heuristics. We conduct a case study of generalization in NLI (from MNLI to the adversarially constructed HANS dataset) in a range of BERT-based architectures (adapters, Siamese Transformers, HEX debiasing), as well as with subsampling the data and increasing the model size. We report 2 successful and 3 unsuccessful strategies, all providing insights into how Transformer-based models learn to generalize.

References used

https://aclanthology.org/

rate research

Investigating the Effect of Natural Language Explanations on Out-of-Distribution Generalization in Few-shot NLI

607 - Association for Computation Linguistics 2021 مقالة

Although neural models have shown strong performance in datasets such as SNLI, they lack the ability to generalize out-of-distribution (OOD). In this work, we formulate a few-shot learning setup and examine the effects of natural language explanation s on OOD generalization. We leverage the templates in the HANS dataset and construct templated natural language explanations for each template. Although generated explanations show competitive BLEU scores against ground truth explanations, they fail to improve prediction performance. We further show that generated explanations often hallucinate information and miss key elements that indicate the label.

التعدين الرسم البياني few-shot nli قليل النار nli صناعة حمض الفوسفور

Generalization in Instruction Following Systems

556 - Association for Computation Linguistics 2021 مقالة

Understanding and executing natural language instructions in a grounded domain is one of the hallmarks of artificial intelligence. In this paper, we focus on instruction understanding in the blocks world domain and investigate the language understand ing abilities of two top-performing systems for the task. We aim to understand if the test performance of these models indicates an understanding of the spatial domain and of the natural language instructions relative to it, or whether they merely over-fit spurious signals in the dataset. We formulate a set of expectations one might have from an instruction following model and concretely characterize the different dimensions of robustness such a model should possess. Despite decent test performance, we find that state-of-the-art models fall short of these expectations and are extremely brittle. We then propose a learning strategy that involves data augmentation and show through extensive experiments that the proposed learning strategy yields models that are competitive on the original test set while satisfying our expectations much better.

natural language instructions language instructions تعليمات اللغة الطبيعية تعليمات اللغة صناعة حمض الفوسفور

Clustering Monolingual Vocabularies to Improve Cross-Lingual Generalization

1042 - Association for Computation Linguistics 2021 مقالة

Multilingual language models exhibit better performance for some languages than for others (Singh et al., 2019), and many languages do not seem to benefit from multilingual sharing at all, presumably as a result of poor multilingual segmentation (Pyy sal o et al., 2020). This work explores the idea of learning multilingual language models based on clustering of monolingual segments. We show significant improvements over standard multilingual segmentation and training across nine languages on a question answering task, both in a small model regime and for a model of the size of BERT-base.

improve cross-lingual generalization vocabularies to improve clustering monolingual vocabularies تحسين التعميم عبر اللغات المفردات لتحسين تجميع المفردات أحادية الأونلينغ صناعة حمض الفوسفور المزيد..

Can NLI Models Verify QA Systems' Predictions?

923 - Association for Computation Linguistics 2021 مقالة

To build robust question answering systems, we need the ability to verify whether answers to questions are truly correct, not just good enough'' in the context of imperfect QA datasets. We explore the use of natural language inference (NLI) as a way to achieve this goal, as NLI inherently requires the premise (document context) to contain all necessary information to support the hypothesis (proposed answer to the question). We leverage large pre-trained models and recent prior datasets to construct powerful question conversion and decontextualization modules, which can reformulate QA instances as premise-hypothesis pairs with very high reliability. Then, by combining standard NLI datasets with NLI examples automatically derived from QA training data, we can train NLI models to evaluate QA models' proposed answers. We show that our approach improves the confidence estimation of a QA model across different domains, evaluated in a selective QA setting. Careful manual analysis over the predictions of our NLI model shows that it can further identify cases where the QA model produces the right answer for the wrong reason, i.e., when the answer sentence cannot address all aspects of the question.

nli models verify systems' predictions verify qa systems' نماذج nli تحقق تنبؤات النظم تحقق من أنظمة ضمان الجودة صناعة حمض الفوسفور المزيد..

Compositional Generalization via Semantic Tagging

928 - Association for Computation Linguistics 2021 مقالة

Although neural sequence-to-sequence models have been successfully applied to semantic parsing, they fail at compositional generalization, i.e., they are unable to systematically generalize to unseen compositions of seen components. Motivated by trad itional semantic parsing where compositionality is explicitly accounted for by symbolic grammars, we propose a new decoding framework that preserves the expressivity and generality of sequence-to-sequence models while featuring lexicon-style alignments and disentangled information processing. Specifically, we decompose decoding into two phases where an input utterance is first tagged with semantic symbols representing the meaning of individual words, and then a sequence-to-sequence model is used to predict the final meaning representation conditioning on the utterance and the predicted tag sequence. Experimental results on three semantic parsing datasets show that the proposed approach consistently improves compositional generalization across model architectures, domains, and semantic formalisms.

compositional generalization semantic tagging التعميم التركيبي العلامة الدلالية صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics

التعميم في NLI: طرق (لا) لتجاوز الاستدلال البسيط

Ask ChatGPT about the research

Read More

suggested questions