New community

Subscribe to the gold package and get unlimited access to Shamra Academy

EaSe: A Diagnostic Tool for VQA based on Answer Diversity

سهولة: أداة تشخيصية ل VQA بناء على تنوع الإجابة

311 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visual question answering diagnostic tool answer diversity السؤال المرئي الرد أداة التشخيص الإجابة التنوع صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

We propose EASE, a simple diagnostic tool for Visual Question Answering (VQA) which quantifies the difficulty of an image, question sample. EASE is based on the pattern of answers provided by multiple annotators to a given question. In particular, it considers two aspects of the answers: (i) their Entropy; (ii) their Semantic content. First, we prove the validity of our diagnostic to identify samples that are easy/hard for state-of-art VQA models. Second, we show that EASE can be successfully used to select the most-informative samples for training/fine-tuning. Crucially, only information that is readily available in any VQA dataset is used to compute its scores.

References used

https://aclanthology.org/

rate research

Generating Answer Candidates for Quizzes and Answer-Aware Question Generators

344 - Association for Computation Linguistics 2021 مقالة

In education, quiz questions have become an important tool for assessing the knowledge of students. Yet, manually preparing such questions is a tedious task, and thus automatic question generation has been proposed as a possible alternative. So far, the vast majority of research has focused on generating the question text, relying on question answering datasets with readily picked answers, and the problem of how to come up with answer candidates in the first place has been largely ignored. Here, we aim to bridge this gap. In particular, we propose a model that can generate a specified number of answer candidates for a given passage of text, which can then be used by instructors to write questions manually or can be passed as an input to automatic answer-aware question generators. Our experiments show that our proposed answer candidate generation model outperforms several baselines.

answer-aware question generators answer candidates candidates for quizzes مجلدات استجواب الإجابة أجب على المرشحين المرشحين للاطلاع على الاختبارات صناعة حمض الفوسفور المزيد..

MiniVQA - A resource to build your tailored VQA competition

559 - Association for Computation Linguistics 2021 مقالة

MiniVQA is a Jupyter notebook to build a tailored VQA competition for your students. The resource creates all the needed resources to create a classroom competition that engages and inspires your students on the free, self-service Kaggle platform. InClass competitions make machine learning fun!.

tailored vqa competition tailored vqa vqa competition منافسة VQA مصممة مخصصة VQA. مسابقة VQA صناعة حمض الفوسفور المزيد..

Semantic Answer Similarity for Evaluating Question Answering Models

338 - Association for Computation Linguistics 2021 مقالة

The evaluation of question answering models compares ground-truth annotations with model predictions. However, as of today, this comparison is mostly lexical-based and therefore misses out on answers that have no lexical overlap but are still semanti cally similar, thus treating correct answers as false. This underestimation of the true performance of models hinders user acceptance in applications and complicates a fair comparison of different models. Therefore, there is a need for an evaluation metric that is based on semantics instead of pure string similarity. In this short paper, we present SAS, a cross-encoder-based metric for the estimation of semantic answer similarity, and compare it to seven existing metrics. To this end, we create an English and a German three-way annotated evaluation dataset containing pairs of answers along with human judgment of their semantic similarity, which we release along with an implementation of the SAS metric and the experiments. We find that semantic similarity metrics based on recent transformer models correlate much better with human judgment than traditional lexical similarity metrics on our two newly created datasets and one dataset from related work.

evaluating question answering evaluating question تقييم الإجابة على السؤال تقييم السؤال صناعة حمض الفوسفور

Diversity and Consistency: Exploring Visual Question-Answer Pair Generation

249 - Association for Computation Linguistics 2021 مقالة

Although showing promising values to downstream applications, generating question and answer together is under-explored. In this paper, we introduce a novel task that targets question-answer pair generation from visual images. It requires not only ge nerating diverse question-answer pairs but also keeping the consistency of them. We study different generation paradigms for this task and propose three models: the pipeline model, the joint model, and the sequential model. We integrate variational inference into these models to achieve diversity and consistency. We also propose region representation scaling and attention alignment to improve the consistency further. We finally devise an evaluator as a quantitative metric for consistency. We validate our approach on two benchmarks, VQA2.0 and Visual-7w, by automatically and manually evaluating diversity and consistency. Experimental results show the effectiveness of our models: they can generate diverse or consistent pairs. Moreover, this task can be used to improve visual question generation and visual question answering.

exploring visual question-answer exploring visual question-answer pair generation استكشاف إجابة السؤال المرئي استكشاف البصرية جيل زوج الإجابة الإجابة صناعة حمض الفوسفور المزيد..

CombAlign: a Tool for Obtaining High-Quality Word Alignments

624 - Association for Computation Linguistics 2021 مقالة

Being able to generate accurate word alignments is useful for a variety of tasks. While statistical word aligners can work well, especially when parallel training data are plentiful, multilingual embedding models have recently been shown to give good results in unsupervised scenarios. We evaluate an ensemble method for word alignment on four language pairs and demonstrate that by combining multiple tools, taking advantage of their different approaches, substantial gains can be made. This holds for settings ranging from very low-resource to high-resource. Furthermore, we introduce a new gold alignment test set for Icelandic and a new easy-to-use tool for creating manual word alignments.

obtaining high-quality word obtaining high-quality high-quality word alignments الحصول على كلمة عالية الجودة الحصول على جودة عالية محاذاة كلمة عالية الجودة صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

EaSe: A Diagnostic Tool for VQA based on Answer Diversity

سهولة: أداة تشخيصية ل VQA بناء على تنوع الإجابة

Ask ChatGPT about the research

Read More

suggested questions