New community

Subscribe to the gold package and get unlimited access to Shamra Academy

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents

multionedoc2dial: حوارات النمذجة في مستندات متعددة

254 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

dialogues grounded multiple documents grounded in multiple وضع الحوارات وثائق متعددة تستند إلى عدة صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

We propose MultiDoc2Dial, a new task and dataset on modeling goal-oriented dialogues grounded in multiple documents. Most previous works treat document-grounded dialogue modeling as machine reading comprehension task based on a single given document or passage. In this work, we aim to address more realistic scenarios where a goal-oriented information-seeking conversation involves multiple topics, and hence is grounded on different documents. To facilitate such task, we introduce a new dataset that contains dialogues grounded in multiple documents from four different domains. We also explore modeling the dialogue-based and document-based contexts in the dataset. We present strong baseline approaches and various experimental results, aiming to support further research efforts on such a task.

References used

https://aclanthology.org/

rate research

PLATO-KAG: Unsupervised Knowledge-Grounded Conversation via Joint Modeling

354 - Association for Computation Linguistics 2021 مقالة

Large-scale conversation models are turning to leveraging external knowledge to improve the factual accuracy in response generation. Considering the infeasibility to annotate the external knowledge for large-scale dialogue corpora, it is desirable to learn the knowledge selection and response generation in an unsupervised manner. In this paper, we propose PLATO-KAG (Knowledge-Augmented Generation), an unsupervised learning approach for end-to-end knowledge-grounded conversation modeling. For each dialogue context, the top-k relevant knowledge elements are selected and then employed in knowledge-grounded response generation. The two components of knowledge selection and response generation are optimized jointly and effectively under a balanced objective. Experimental results on two publicly available datasets validate the superiority of PLATO-KAG.

احتج knowledge-grounded conversation modeling نمذجة المحادثة المحادثة المعرفة صناعة حمض الفوسفور

DialogSum Challenge: Summarizing Real-Life Scenario Dialogues

280 - Association for Computation Linguistics 2021 مقالة

We propose a shared task on summarizing real-life scenario dialogues, DialogSum Challenge, to encourage researchers to address challenges in dialogue summarization, which has been less studied by the summarization community. Real-life scenario dialog ue summarization has a wide potential application prospect in chat-bot and personal assistant. It contains unique challenges such as special discourse structure, coreference, pragmatics, and social common sense, which require specific representation learning technologies to deal with. We carefully annotate a large-scale dialogue summarization dataset based on multiple public dialogue corpus, opening the door to all kinds of summarization models.

summarizing real-life scenario real-life scenario dialogues real-life scenario تلخيص سيناريو الحياة الحقيقية حوارات سيناريو الحياة الحقيقية سيناريو الحياة الحقيقية صناعة حمض الفوسفور المزيد..

ReadTwice: Reading Very Large Documents with Memories

293 - Association for Computation Linguistics 2021 مقالة

Knowledge-intensive tasks such as question answering often require assimilating information from different sections of large inputs such as books or article collections. We propose ReadTwice, a simple and effective technique that combines several str engths of prior approaches to model long-range dependencies with Transformers. The main idea is to read text in small segments, in parallel, summarizing each segment into a memory table to be used in a second read of the text. We show that the method outperforms models of comparable size on several question answering (QA) datasets and sets a new state of the art on the challenging NarrativeQA task, with questions about entire books.

documents with memories large documents reading very large وثائق ذات الذكريات وثائق كبيرة قراءة كبيرة جدا صناعة حمض الفوسفور المزيد..

Modeling Endorsement for Multi-Document Abstractive Summarization

542 - Association for Computation Linguistics 2021 مقالة

A crucial difference between single- and multi-document summarization is how salient content manifests itself in the document(s). While such content may appear at the beginning of a single document, essential information is frequently reiterated in a set of documents related to a particular topic, resulting in an endorsement effect that increases information salience. In this paper, we model the cross-document endorsement effect and its utilization in multiple document summarization. Our method generates a synopsis from each document, which serves as an endorser to identify salient content from other documents. Strongly endorsed text segments are used to enrich a neural encoder-decoder model to consolidate them into an abstractive summary. The method has a great potential to learn from fewer examples to identify salient content, which alleviates the need for costly retraining when the set of documents is dynamically adjusted. Through extensive experiments on benchmark multi-document summarization datasets, we demonstrate the effectiveness of our proposed method over strong published baselines. Finally, we shed light on future research directions and discuss broader challenges of this task using a case study.

modeling endorsement multi-document abstractive summarization تأييد النمذجة تلخيص مبادرة متعددة الوثائق صناعة حمض الفوسفور

Commonsense-Focused Dialogues for Response Generation: An Empirical Study

288 - Association for Computation Linguistics 2021 مقالة

Smooth and effective communication requires the ability to perform latent or explicit commonsense inference. Prior commonsense reasoning benchmarks (such as SocialIQA and CommonsenseQA) mainly focus on the discriminative task of choosing the right an swer from a set of candidates, and do not involve interactive language generation as in dialogue. Moreover, existing dialogue datasets do not explicitly focus on exhibiting commonsense as a facet. In this paper, we present an empirical study of commonsense in dialogue response generation. We first auto-extract commonsensical dialogues from existing dialogue datasets by leveraging ConceptNet, a commonsense knowledge graph. Furthermore, building on social contexts/situations in SocialIQA, we collect a new dialogue dataset with 25K dialogues aimed at exhibiting social commonsense in an interactive setting. We evaluate response generation models trained using these datasets and find that models trained on both extracted and our collected data produce responses that consistently exhibit more commonsense than baselines. Finally we propose an approach for automatic evaluation of commonsense that relies on features derived from ConceptNet and pre-trained language and dialog models, and show reasonable correlation with human evaluation of responses' commonsense quality.

empirical study commonsense response generation الفطرة السليمة توليد الاستجابة صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents

multionedoc2dial: حوارات النمذجة في مستندات متعددة

Ask ChatGPT about the research

Read More

suggested questions