Subscribe to the gold package and get unlimited access to Shamra Academy

COSMic: A Coherence-Aware Generation Metric for Image Descriptions

الكونية: متري تدوين التماسك من أوصاف الصورة

733 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Developers of text generation models rely on automated evaluation metrics as a stand-in for slow and expensive manual evaluations. However, image captioning metrics have struggled to give accurate learned estimates of the semantic and pragmatic success of output text. We address this weakness by introducing the first discourse-aware learned generation metric for evaluating image descriptions. Our approach is inspired by computational theories of discourse for capturing information goals using coherence. We present a dataset of image--description pairs annotated with coherence relations. We then train a coherence-aware metric on a subset of the Conceptual Captions dataset and measure its effectiveness---its ability to predict human ratings of output captions---on a test set composed of out-of-domain images. We demonstrate a higher Kendall Correlation Coefficient for our proposed metric with the human judgments for the results of a number of state-of-the-art coherence-aware caption generation models when compared to several other metrics including recently proposed learned metrics such as BLEURT and BERTScore.

References used

https://aclanthology.org/

rate research

Superenergetic Cosmic Particles and Electromagnetic Waves

1202 - Aِl-Baath University 2017 ورقة بحثية

This work aims to explane and analysise some cosmic particles that reach to the earth with super high energy using the hypothesis of increasing the speed of light dated to ancient past time ,based on the energy conservation law and the mechanism of transformation between particle state and wave state.As example ,the speed of neutrin has been taken for studying.

neutrino كوني النترينو الجسيم فائق الطاقة فيزياء الفلك Cosmic super energetic particle Astro physics المزيد..

Evaluating Document Coherence Modeling

1072 - Association for Computation Linguistics 2021 مقالة

Abstract While pretrained language models (LMs) have driven impressive gains over morpho-syntactic and semantic tasks, their ability to model discourse and pragmatic phenomena is less clear. As a step towards a better understanding of their discourse modeling capabilities, we propose a sentence intrusion detection task. We examine the performance of a broad range of pretrained LMs on this detection task for English. Lacking a dataset for the task, we introduce INSteD, a novel intruder sentence detection dataset, containing 170,000+ documents constructed from English Wikipedia and CNN news articles. Our experiments show that pretrained LMs perform impressively in in-domain evaluation, but experience a substantial drop in the cross-domain setting, indicating limited generalization capacity. Further results over a novel linguistic probe dataset show that there is substantial room for improvement, especially in the cross- domain setting.

evaluating document coherence document coherence modeling coherence modeling تقييم التماسك المستند نمذجة التماسك المستند نمذجة التماسك صناعة حمض الفوسفور المزيد..

Leveraging Slot Descriptions for Zero-Shot Cross-Domain Dialogue StateTracking

1039 - Association for Computation Linguistics 2021 مقالة

Zero-shot cross-domain dialogue state tracking (DST) enables us to handle unseen domains without the expense of collecting in-domain data. In this paper, we propose a slot descriptions enhanced generative approach for zero-shot cross-domain DST. Spec ifically, our model first encodes a dialogue context and a slot with a pre-trained self-attentive encoder, and generates slot value in auto-regressive manner. In addition, we incorporate Slot Type Informed Descriptions that capture the shared information of different slots to facilitates the cross-domain knowledge transfer. Experimental results on MultiWOZ shows that our model significantly improve existing state-of-the-art results in zero-shot cross-domain setting.

zero-shot cross-domain dialogue cross-domain dialogue statetracking leveraging slot descriptions Zero-Shot الحوار عبر المجال الحوار عبر المجال الحوار الاستفادة من الأوصاف الفتحة صناعة حمض الفوسفور المزيد..

Generating Diverse Descriptions from Semantic Graphs

824 - Association for Computation Linguistics 2021 مقالة

Text generation from semantic graphs is traditionally performed with deterministic methods, which generate a unique description given an input graph. However, the generation problem admits a range of acceptable textual outputs, exhibiting lexical, sy ntactic and semantic variation. To address this disconnect, we present two main contributions. First, we propose a stochastic graph-to-text model, incorporating a latent variable in an encoder-decoder model, and its use in an ensemble. Second, to assess the diversity of the generated sentences, we propose a new automatic evaluation metric which jointly evaluates output diversity and quality in a multi-reference setting. We evaluate the models on WebNLG datasets in English and Russian, and show an ensemble of stochastic models produces diverse sets of generated sentences while, retaining similar quality to state-of-the-art models.

generating diverse descriptions semantic graphs generating diverse توليد أوصاف متنوعة الرسوم البيانية الدلالية توليد متنوع صناعة حمض الفوسفور المزيد..

Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual Descriptions

713 - Association for Computation Linguistics 2021 مقالة

We study the impact of using rich and diverse textual descriptions of classes for zero-shot learning (ZSL) on ImageNet. We create a new dataset ImageNet-Wiki that matches each ImageNet class to its corresponding Wikipedia article. We show that merely employing these Wikipedia articles as class descriptions yields much higher ZSL performance than prior works. Even a simple model using this type of auxiliary data outperforms state-of-the-art models that rely on standard features of word embedding encodings of class names. These results highlight the usefulness and importance of textual descriptions for ZSL, as well as the relative importance of auxiliary data type compared to the algorithmic progress. Our experimental results also show that standard zero-shot learning approaches generalize poorly across categories of classes.

zero-shot image classification large-scale zero-shot image تصنيف صورة صفرية صورة صفرية واسعة النطاق تصنيف الصورة صناعة حمض الفوسفور

COSMic: A Coherence-Aware Generation Metric for Image Descriptions

الكونية: متري تدوين التماسك من أوصاف الصورة

Ask ChatGPT about the research

Read More

suggested questions