Research papers, master and doctoral theses about تحسين

Improving and Simplifying Pattern Exploiting Training

124 - Association for Computation Linguistics 2021 مقالة

Recently, pre-trained language models (LMs) have achieved strong performance when fine-tuned on difficult benchmarks like SuperGLUE. However, performance can suffer when there are very few labeled examples available for fine-tuning. Pattern Exploitin g Training (PET) is a recent approach that leverages patterns for few-shot learning. However, PET uses task-specific unlabeled data. In this paper, we focus on few-shot learning without any unlabeled data and introduce ADAPET, which modifies PET's objective to provide denser supervision during fine-tuning. As a result, ADAPET outperforms PET on SuperGLUE without any task-specific unlabeled data.

simplifying pattern exploiting pattern exploiting training improving and simplifying تبسيط استغلال نمط نمط استغلال التدريب تحسين وتبسيطها صناعة حمض الفوسفور المزيد..

Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding

463 - Association for Computation Linguistics 2021 مقالة

Large-scale, pre-trained language models (LMs) have achieved human-level performance on a breadth of language understanding tasks. However, evaluations only based on end task performance shed little light on machines' true ability in language underst anding and reasoning. In this paper, we highlight the importance of evaluating the underlying reasoning process in addition to end performance. Toward this goal, we introduce Tiered Reasoning for Intuitive Physics (TRIP), a novel commonsense reasoning dataset with dense annotations that enable multi-tiered evaluation of machines' reasoning process. Our empirical results show that while large LMs can achieve high end performance, they struggle to support their predictions with valid supporting evidence. The TRIP dataset and our baseline results will motivate verifiable evaluation of commonsense reasoning and facilitate future research toward developing better language understanding and reasoning models.

تحسين nlu. intuitive physics commonsense language understanding الفيزياء بديهية فهم لغة المنطقية صناعة حمض الفوسفور

Improving Stance Detection with Multi-Dataset Learning and Knowledge Distillation

228 - Association for Computation Linguistics 2021 مقالة

Stance detection determines whether the author of a text is in favor of, against or neutral to a specific target and provides valuable insights into important events such as legalization of abortion. Despite significant progress on this task, one of the remaining challenges is the scarcity of annotations. Besides, most previous works focused on a hard-label training in which meaningful similarities among categories are discarded during training. To address these challenges, first, we evaluate a multi-target and a multi-dataset training settings by training one model on each dataset and datasets of different domains, respectively. We show that models can learn more universal representations with respect to targets in these settings. Second, we investigate the knowledge distillation in stance detection and observe that transferring knowledge from a teacher model to a student model can be beneficial in our proposed training settings. Moreover, we propose an Adaptive Knowledge Distillation (AKD) method that applies instance-specific temperature scaling to the teacher and student predictions. Results show that the multi-dataset model performs best on all datasets and it can be further improved by the proposed AKD, outperforming the state-of-the-art by a large margin. We publicly release our code.

improving stance detection stance detection determines تحسين اكتشاف الموقف يحدد الكشف عن الموقف صناعة حمض الفوسفور

Translate \& Fill: Improving Zero-Shot Multilingual Semantic Parsing with Synthetic Data

155 - Association for Computation Linguistics 2021 مقالة

While multilingual pretrained language models (LMs) fine-tuned on a single language have shown substantial cross-lingual task transfer capabilities, there is still a wide performance gap in semantic parsing tasks when target language supervision is a vailable. In this paper, we propose a novel Translate-and-Fill (TaF) method to produce silver training data for a multilingual semantic parser. This method simplifies the popular Translate-Align-Project (TAP) pipeline and consists of a sequence-to-sequence filler model that constructs a full parse conditioned on an utterance and a view of the same parse. Our filler is trained on English data only but can accurately complete instances in other languages (i.e., translations of the English training utterances), in a zero-shot fashion. Experimental results on three multilingual semantic parsing datasets show that data augmentation with TaF reaches accuracies competitive with similar systems which rely on traditional alignment techniques.

multilingual semantic parsing improving zero-shot multilingual تحليل الدلالي متعدد اللغات تحسين صفر النار متعددة اللغات صناعة حمض الفوسفور

``Be nice to your wife! The restaurants are closed'': Can Gender Stereotype Detection Improve Sexism Classification?

170 - Association for Computation Linguistics 2021 مقالة

In this paper, we focus on the detection of sexist hate speech against women in tweets studying for the first time the impact of gender stereotype detection on sexism classification. We propose: (1) the first dataset annotated for gender stereotype d etection, (2) a new method for data augmentation based on sentence similarity with multilingual external datasets, and (3) a set of deep learning experiments first to detect gender stereotypes and then, to use this auxiliary task for sexism detection. Although the presence of stereotypes does not necessarily entail hateful content, our results show that sexism classification can definitively benefit from gender stereotype detection.

gender stereotype detection gender stereotype improve sexism classification كشف النمط الجنساني الصورة النمطية بين الجنسين تحسين تصنيف الجنس صناعة حمض الفوسفور المزيد..

Paraphrase Generation: A Survey of the State of the Art

257 - Association for Computation Linguistics 2021 مقالة

This paper focuses on paraphrase generation,which is a widely studied natural language generation task in NLP. With the development of neural models, paraphrase generation research has exhibited a gradual shift to neural methods in the recent years. This has provided architectures for contextualized representation of an input text and generating fluent, diverseand human-like paraphrases. This paper surveys various approaches to paraphrase generation with a main focus on neural methods.

تحسين المفهوم حالة صناعة حمض الفوسفور

MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations

216 - Association for Computation Linguistics 2021 مقالة

Entity retrieval, which aims at disambiguating mentions to canonical entities from massive KBs, is essential for many tasks in natural language processing. Recent progress in entity retrieval shows that the dual-encoder structure is a powerful and ef ficient framework to nominate candidates if entities are only identified by descriptions. However, they ignore the property that meanings of entity mentions diverge in different contexts and are related to various portions of descriptions, which are treated equally in previous works. In this work, we propose Multi-View Entity Representations (MuVER), a novel approach for entity retrieval that constructs multi-view representations for entity descriptions and approximates the optimal view for mentions via a heuristic searching method. Our method achieves the state-of-the-art performance on ZESHEL and improves the quality of candidates on three standard Entity Linking datasets.

improving first-stage entity improving first-stage first-stage entity retrieval تحسين كيان المرحلة الأولى تحسين المرحلة الأولى استرجاع كيان المرحلة الأولى صناعة حمض الفوسفور المزيد..

Reconsidering the Past: Optimizing Hidden States in Language Models

244 - Association for Computation Linguistics 2021 مقالة

We present Hidden-State Optimization (HSO), a gradient-based method for improving the performance of transformer language models at inference time. Similar to dynamic evaluation (Krause et al., 2018), HSO computes the gradient of the log-probability the language model assigns to an evaluation text, but uses it to update the cached hidden states rather than the model parameters. We test HSO with pretrained Transformer-XL and GPT-2 language models, finding improvement on the WikiText-103 and PG-19 datasets in terms of perplexity, especially when evaluating a model outside of its training distribution. We also demonstrate downstream applicability by showing gains in the recently developed prompt-based few-shot evaluation setting, again with no extra parameters or training data.

optimizing hidden states reconsidering the past optimizing hidden تحسين الدول المخفية إعادة النظر في الماضي تحسين مخفي صناعة حمض الفوسفور المزيد..

Improving Knowledge Graph Embedding Using Affine Transformations of Entities Corresponding to Each Relation

115 - Association for Computation Linguistics 2021 مقالة

To find a suitable embedding for a knowledge graph remains a big challenge nowadays. By using previous knowledge graph embedding methods, every entity in a knowledge graph is usually represented as a k-dimensional vector. As we know, an affine transf ormation can be expressed in the form of a matrix multiplication followed by a translation vector. In this paper, we firstly utilize a set of affine transformations related to each relation to operate on entity vectors, and then these transformed vectors are used for performing embedding with previous methods. The main advantage of using affine transformations is their good geometry properties with interpretability. Our experimental results demonstrate that the proposed intuitive design with affine transformations provides a statistically significant increase in performance with adding a few extra processing steps or adding a limited number of additional variables. Taking TransE as an example, we employ the scale transformation (the special case of an affine transformation), and only introduce k additional variables for each relation. Surprisingly, it even outperforms RotatE to some extent on various data sets. We also introduce affine transformations into RotatE, Distmult and ComplEx, respectively, and each one outperforms its original method.

طرق استخراج العلاقة improving knowledge graph تحسين الرسم البياني المعرفة صناعة حمض الفوسفور

Clustering Monolingual Vocabularies to Improve Cross-Lingual Generalization

509 - Association for Computation Linguistics 2021 مقالة

Multilingual language models exhibit better performance for some languages than for others (Singh et al., 2019), and many languages do not seem to benefit from multilingual sharing at all, presumably as a result of poor multilingual segmentation (Pyy sal o et al., 2020). This work explores the idea of learning multilingual language models based on clustering of monolingual segments. We show significant improvements over standard multilingual segmentation and training across nine languages on a question answering task, both in a small model regime and for a model of the size of BERT-base.

improve cross-lingual generalization vocabularies to improve clustering monolingual vocabularies تحسين التعميم عبر اللغات المفردات لتحسين تجميع المفردات أحادية الأونلينغ صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد