New community

Subscribe to the gold package and get unlimited access to Shamra Academy

CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization

Crossvqa: توليد المعايير بشكل متقن لاختبار نظام VQA بشكل منهجي

321 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

scalably generating benchmarks testing vqa generalization systematically testing vqa توليد المعايير المتوسطة اختبار تعميم VQA. اختبار النظامية VQA. صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

One challenge in evaluating visual question answering (VQA) models in the cross-dataset adaptation setting is that the distribution shifts are multi-modal, making it difficult to identify if it is the shifts in visual or language features that play a key role. In this paper, we propose a semi-automatic framework for generating disentangled shifts by introducing a controllable visual question-answer generation (VQAG) module that is capable of generating highly-relevant and diverse question-answer pairs with the desired dataset style. We use it to create CrossVQA, a collection of test splits for assessing VQA generalization based on the VQA2, VizWiz, and Open Images datasets. We provide an analysis of our generated datasets and demonstrate its utility by using them to evaluate several state-of-the-art VQA systems. One important finding is that the visual shifts in cross-dataset VQA matter more than the language shifts. More broadly, we present a scalable framework for systematically evaluating the machine with little human intervention.

References used

https://aclanthology.org/

rate research

EaSe: A Diagnostic Tool for VQA based on Answer Diversity

310 - Association for Computation Linguistics 2021 مقالة

We propose EASE, a simple diagnostic tool for Visual Question Answering (VQA) which quantifies the difficulty of an image, question sample. EASE is based on the pattern of answers provided by multiple annotators to a given question. In particular, it considers two aspects of the answers: (i) their Entropy; (ii) their Semantic content. First, we prove the validity of our diagnostic to identify samples that are easy/hard for state-of-art VQA models. Second, we show that EASE can be successfully used to select the most-informative samples for training/fine-tuning. Crucially, only information that is readily available in any VQA dataset is used to compute its scores.

visual question answering diagnostic tool answer diversity السؤال المرئي الرد أداة التشخيص الإجابة التنوع صناعة حمض الفوسفور المزيد..

MiniVQA - A resource to build your tailored VQA competition

559 - Association for Computation Linguistics 2021 مقالة

MiniVQA is a Jupyter notebook to build a tailored VQA competition for your students. The resource creates all the needed resources to create a classroom competition that engages and inspires your students on the free, self-service Kaggle platform. InClass competitions make machine learning fun!.

tailored vqa competition tailored vqa vqa competition منافسة VQA مصممة مخصصة VQA. مسابقة VQA صناعة حمض الفوسفور المزيد..

Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation

208 - Association for Computation Linguistics 2021 مقالة

Although pre-training models have achieved great success in dialogue generation, their performance drops dramatically when the input contains an entity that does not appear in pre-training and fine-tuning datasets (unseen entity). To address this iss ue, existing methods leverage an external knowledge base to generate appropriate responses. In real-world practical, the entity may not be included by the knowledge base or suffer from the precision of knowledge retrieval. To deal with this problem, instead of introducing knowledge base as the input, we force the model to learn a better semantic representation by predicting the information in the knowledge base, only based on the input context. Specifically, with the help of a knowledge base, we introduce two auxiliary training objectives: 1) Interpret Masked Word, which conjectures the meaning of the masked entity given the context; 2) Hypernym Generation, which predicts the hypernym of the entity based on the context. Experiment results on two dialogue corpus verify the effectiveness of our methods under both knowledge available and unavailable settings.

handling unseen entities knowledge enhanced fine-tuning handling unseen التعامل مع الكيانات غير المرئية المعرفة تعزيز ضبط الركود التعامل مع غير مرئي صناعة حمض الفوسفور المزيد..

تطوير موقع ويب تفاعلي يساهم بشكل منهجي في العلاج النفسي السلوكي للأطفال الذين يعانون من مرض التوحد

460 - Syrian Virtual University 2014 رسالة ماجستير

يعرف التوحد بأنه أحد الاضطرابات النمائية الشاملة يتسم بقصور نوعي في مهارات التواصل والانتباه ومهارات التفاعل الاجتماعي

مرض التوحد تطوير موقع ويب العلاج النفسي السلوكي

Generating Hypothetical Events for Abductive Inference

349 - Association for Computation Linguistics 2021 مقالة

Abductive reasoning starts from some observations and aims at finding the most plausible explanation for these observations. To perform abduction, humans often make use of temporal and causal inferences, and knowledge about how some hypothetical situ ation can result in different outcomes. This work offers the first study of how such knowledge impacts the Abductive NLI task -- which consists in choosing the more likely explanation for given observations. We train a specialized language model LMI that is tasked to generate what could happen next from a hypothetical scenario that evolves from a given event. We then propose a multi-task model MTL to solve the Abductive NLI task, which predicts a plausible explanation by a) considering different possible events emerging from candidate hypotheses -- events generated by LMI -- and b) selecting the one that is most similar to the observed outcome. We show that our MTL model improves over prior vanilla pre-trained LMs fine-tuned on Abductive NLI. Our manual evaluation and analysis suggest that learning about possible next events from different hypothetical scenarios supports abductive inference.

abductive nli task abductive nli abductive inference مهام NLI المختلة NLI. استنتاجنا صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization

Crossvqa: توليد المعايير بشكل متقن لاختبار نظام VQA بشكل منهجي

Ask ChatGPT about the research

Read More

suggested questions