New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Foreseeing the Benefits of Incidental Supervision

يتوقع فوائد الإشراف العرضي

303 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

غالبا ما تتطلب تطبيقات العالم الواقعي نماذج محسنة عن طريق الاستفادة * مجموعة من إشارات الإشراف العرضي الرخيص . يمكن أن تشمل هذه ملصقات جزئية، ملصقات صاخبة، قيود قائمة على المعرفة، والشروح عبر المجال أو التعليق الشرح - جميعها وجود ارتباطات إحصائية مع شروح ذهبية ولكن ليس نفسها بالضبط. ومع ذلك، فإننا نفتقر حاليا إلى طريقة مبدئية لقياس فوائد هذه الإشارات إلى مهمة مستهدفة معينة، والممارسة المشتركة لتقييم هذه الفوائد هي من خلال تجارب شاملة مع نماذج مختلفة وليفرتات. تدرس هذه الورقة ما إذا كان بإمكاننا ذلك، في إطار واحد، حدد فوائد أنواع مختلفة من الإشارات العرضية لمهمة مستهدحة معينة دون ممارسة التجارب بين التجديف *. نقترح نقلا عن تدبير المعلومات الدوافع PAC-Bayesian الموحدة، PABI، الذي يميز الحد من عدم اليقين المنصوص عليه من إشارات الإشراف العرضي. نوضح فعالية PABI عن طريق تحديد القيمة المضافة من قبل أنواع مختلفة من الإشارات العرضية إلى مهام علامات التسلسل. تشير التجارب على التعرف على الكيان المسمى (NER) وإجابة السؤال (QA) أن تنبؤات Pabi ترتبط بشكل جيد مع أداء التعلم، مما يوفر طريقة واعدة لتحديد، قبل التعلم، التي ستكون إشارات الإشراف مفيدة.

Real-world applications often require improved models by leveraging a range of cheap incidental supervision signals. These could include partial labels, noisy labels, knowledge-based constraints, and cross-domain or cross-task annotations -- all having statistical associations with gold annotations but not exactly the same. However, we currently lack a principled way to measure the benefits of these signals to a given target task, and the common practice of evaluating these benefits is through exhaustive experiments with various models and hyperparameters. This paper studies whether we can, in a single framework, quantify the benefits of various types of incidental signals for a given target task without going through combinatorial experiments. We propose a unified PAC-Bayesian motivated informativeness measure, PABI, that characterizes the uncertainty reduction provided by incidental supervision signals. We demonstrate PABI's effectiveness by quantifying the value added by various types of incidental signals to sequence tagging tasks. Experiments on named entity recognition (NER) and question answering (QA) show that PABI's predictions correlate well with learning performance, providing a promising way to determine, ahead of learning, which supervision signals would be beneficial.

References used

https://aclanthology.org/

rate research

Reasoning over Vision and Language: Exploring the Benefits of Supplemental Knowledge

270 - Association for Computation Linguistics 2021 مقالة

The limits of applicability of vision-and language models are defined by the coverage of their training data. Tasks like vision question answering (VQA) often require commonsense and factual information beyond what can be learned from task-specific d atasets. This paper investigates the injection of knowledge from general-purpose knowledge bases (KBs) into vision-and-language transformers. We use an auxiliary training objective that encourages the learned representations to align with graph embeddings of matching entities in a KB. We empirically study the relevance of various KBs to multiple tasks and benchmarks. The technique brings clear benefits to knowledge-demanding question answering tasks (OK-VQA, FVQA) by capturing semantic and relational knowledge absent from existing models. More surprisingly, the technique also benefits visual reasoning tasks (NLVR2, SNLI-VE). We perform probing experiments and show that the injection of additional knowledge regularizes the space of embeddings, which improves the representation of lexical and semantic similarities. The technique is model-agnostic and can expand the applicability of any vision-and-language transformer with minimal computational overhead.

supplemental knowledge vision-and language models exploring المعرفة الإضافية نماذج الرؤية واللغة استكشاف صناعة حمض الفوسفور المزيد..

Avengers, Ensemble! Benefits of ensembling in grapheme-to-phoneme prediction

581 - Association for Computation Linguistics 2021 مقالة

We describe three baseline beating systems for the high-resource English-only sub-task of the SIGMORPHON 2021 Shared Task 1: a small ensemble that Dialpad's speech recognition team uses internally, a well-known off-the-shelf model, and a larger ensem ble model comprising these and others. We additionally discuss the challenges related to the provided data, along with the processing steps we took.

المعرفة اللغوية avengers المنتقمون فرقة صناعة حمض الفوسفور

Self-Training with Weak Supervision

323 - Association for Computation Linguistics 2021 مقالة

State-of-the-art deep neural networks require large-scale labeled training data that is often expensive to obtain or not available for many tasks. Weak supervision in the form of domain-specific rules has been shown to be useful in such settings to a utomatically generate weakly labeled training data. However, learning with weak rules is challenging due to their inherent heuristic and noisy nature. An additional challenge is rule coverage and overlap, where prior work on weak supervision only considers instances that are covered by weak rules, thus leaving valuable unlabeled data behind. In this work, we develop a weak supervision framework (ASTRA) that leverages all the available data for a given task. To this end, we leverage task-specific unlabeled data through self-training with a model (student) that considers contextualized representations and predicts pseudo-labels for instances that may not be covered by weak rules. We further develop a rule attention network (teacher) that learns how to aggregate student pseudo-labels with weak rule labels, conditioned on their fidelity and the underlying context of an instance. Finally, we construct a semi-supervised learning objective for end-to-end training with unlabeled data, domain-specific rules, and a small amount of labeled data. Extensive experiments on six benchmark datasets for text classification demonstrate the effectiveness of our approach with significant improvements over state-of-the-art baselines.

weak supervision weak إشراف ضعيف ضعيف صناعة حمض الفوسفور

Relevance-guided Supervision for OpenQA with ColBERT

280 - Association for Computation Linguistics 2021 مقالة

Abstract Systems for Open-Domain Question Answering (OpenQA) generally depend on a retriever for finding candidate passages in a large corpus and a reader for extracting answers from those passages. In much recent work, the retriever is a learned com ponent that uses coarse-grained vector representations of questions and passages. We argue that this modeling choice is insufficiently expressive for dealing with the complexity of natural language questions. To address this, we define ColBERT-QA, which adapts the scalable neural retrieval model ColBERT to OpenQA. ColBERT creates fine-grained interactions between questions and passages. We propose an efficient weak supervision strategy that iteratively uses ColBERT to create its own training data. This greatly improves OpenQA retrieval on Natural Questions, SQuAD, and TriviaQA, and the resulting system attains state-of-the-art extractive OpenQA performance on all three datasets.

open-domain question answering relevance-guided supervision الإجابة على سؤال المجال المفتوح الأهمية المرشد الإشراف صناعة حمض الفوسفور

Bootstrapping a Music Voice Assistant with Weak Supervision

573 - Association for Computation Linguistics 2021 مقالة

One of the first building blocks to create a voice assistant relates to the task of tagging entities or attributes in user queries. This can be particularly challenging when entities are in the tenth of millions, as is the case of e.g. music catalogs . Training slot tagging models at an industrial scale requires large quantities of accurately labeled user queries, which are often hard and costly to gather. On the other hand, voice assistants typically collect plenty of unlabeled queries that often remain unexploited. This paper presents a weakly-supervised methodology to label large amounts of voice query logs, enhanced with a manual filtering step. Our experimental evaluations show that slot tagging models trained on weakly-supervised data outperform models trained on hand-annotated or synthetic data, at a lower cost. Further, manual filtering of weakly-supervised data leads to a very significant reduction in Sentence Error Rate, while allowing us to drastically reduce human curation efforts from weeks to hours, with respect to hand-annotation of queries. The method is applied to successfully bootstrap a slot tagging system for a major music streaming service that currently serves several tens of thousands of daily voice queries.

تحسين nlu reranking. music voice assistant مساعد صوت الموسيقى مساعد الصوت صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Foreseeing the Benefits of Incidental Supervision

يتوقع فوائد الإشراف العرضي

Ask ChatGPT about the research

Read More

suggested questions