New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Human-Model Divergence in the Handling of Vagueness

اختلاف النموذج البشري في التعامل مع الغموض

365 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

handling of vagueness human-model divergence vagueness التعامل مع الغموض النموذج البشري الاختلاف غموض صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

While aggregate performance metrics can generate valuable insights at a large scale, their dominance means more complex and nuanced language phenomena, such as vagueness, may be overlooked. Focusing on vague terms (e.g. sunny, cloudy, young, etc.) we inspect the behavior of visually grounded and text-only models, finding systematic divergences from human judgments even when a model's overall performance is high. To help explain this disparity, we identify two assumptions made by the datasets and models examined and, guided by the philosophy of vagueness, isolate cases where they do not hold.

References used

https://aclanthology.org/

rate research

Posterior Differential Regularization with f-divergence for Improving Model Robustness

440 - Association for Computation Linguistics 2021 مقالة

We address the problem of enhancing model robustness through regularization. Specifically, we focus on methods that regularize the model posterior difference between clean and noisy inputs. Theoretically, we provide a connection of two recent methods , Jacobian Regularization and Virtual Adversarial Training, under this framework. Additionally, we generalize the posterior differential regularization to the family of f-divergences and characterize the overall framework in terms of the Jacobian matrix. Empirically, we compare those regularizations and standard BERT training on a diverse set of tasks to provide a comprehensive profile of their effect on model generalization. For both fully supervised and semi-supervised settings, we show that regularizing the posterior difference with f-divergence can result in well-improved model robustness. In particular, with a proper f-divergence, a BERT-base model can achieve comparable generalization as its BERT-large counterpart for in-domain, adversarial and domain shift scenarios, indicating the great potential of the proposed framework for enhancing NLP model robustness.

improving model robustness posterior differential regularization improving model تحسين نموذج المتانة التنظيم التفاضلي الخلفي تحسين نموذج صناعة حمض الفوسفور المزيد..

Handling synset overgeneration: Sense Merging in BTB-WN

229 - Association for Computation Linguistics 2021 مقالة

The paper reports on an effort to reconsider the representation of some cases of derivational paradigm patterns in Bulgarian. The new treatment implemented within BulTreeBank-WordNet (BTB-WN), a wordnet for Bulgarian, is the grouping together of rela ted words that have a common main meaning in the same synset while the nuances in sense are to be encoded within the synset as a modification functions over the main meaning. In this way, we can solve the following challenges: (1) to avoid the influence of English Wordnet (EWN) synset distinctions over Bulgarian that was a result from the translation of some of the synsets from Core WordNet; (2) to represent the common meaning of such derivation patterns just once and to improve the management of BTB-WN, and (3) to encode idiosyncratic usages locally to the corresponding synsets instead of introducing new semantic relations.

handling synset overgeneration sense merging handling synset المناولة معنى دمج التعامل مع Syns صناعة حمض الفوسفور المزيد..

Improving Human Text Simplification with Sentence Fusion

396 - Association for Computation Linguistics 2021 مقالة

The quality of fully automated text simplification systems is not good enough for use in real-world settings; instead, human simplifications are used. In this paper, we examine how to improve the cost and quality of human simplifications by leveragin g crowdsourcing. We introduce a graph-based sentence fusion approach to augment human simplifications and a reranking approach to both select high quality simplifications and to allow for targeting simplifications with varying levels of simplicity. Using the Newsela dataset (Xu et al., 2015) we show consistent improvements over experts at varying simplification levels and find that the additional sentence fusion simplifications allow for simpler output than the human simplifications alone.

improving human text human text simplification human simplifications تحسين النص البشري تبسيط النص البشري التبسيط البشري صناعة حمض الفوسفور المزيد..

Human Perception in Natural Language Generation

348 - Association for Computation Linguistics 2021 مقالة

We ask subjects whether they perceive as human-produced a bunch of texts, some of which are actually human-written, while others are automatically generated. We use this data to fine-tune a GPT-2 model to push it to generate more human-like texts, an d observe that this fine-tuned model produces texts that are indeed perceived more human-like than the original model. Contextually, we show that our automatic evaluation strategy well correlates with human judgements. We also run a linguistic analysis to unveil the characteristics of human- vs machine-perceived language.

مستوى الصف Flesch-Kincaid perception in natural التصور في الطبيعية صناعة حمض الفوسفور

The Great Misalignment Problem in Human Evaluation of NLP Methods

330 - Association for Computation Linguistics 2021 مقالة

We outline the Great Misalignment Problem in natural language processing research, this means simply that the problem definition is not in line with the method proposed and the human evaluation is not in line with the definition nor the method. We st udy this misalignment problem by surveying 10 randomly sampled papers published in ACL 2020 that report results with human evaluation. Our results show that only one paper was fully in line in terms of problem definition, method and evaluation. Only two papers presented a human evaluation that was in line with what was modeled in the method. These results highlight that the Great Misalignment Problem is a major one and it affects the validity and reproducibility of results obtained by a human evaluation.

great misalignment problem great misalignment misalignment problem مشكلة اختلال كبيرة اختلال كبير مشكلة اختلال صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Human-Model Divergence in the Handling of Vagueness

اختلاف النموذج البشري في التعامل مع الغموض

Ask ChatGPT about the research

Read More

suggested questions