New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Nutri-bullets Hybrid: Consensual Multi-document Summarization

Nutri-Bullets Hybrid: تلخيص توثيق متعدد الوثائق

426 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

consensual multi-document summarization consensual multi-document التلخيص المتعدد الوثائق المتعددة وثيقة متعددة التوافق صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

We present a method for generating comparative summaries that highlight similarities and contradictions in input documents. The key challenge in creating such summaries is the lack of large parallel training data required for training typical summarization systems. To this end, we introduce a hybrid generation approach inspired by traditional concept-to-text systems. To enable accurate comparison between different sources, the model first learns to extract pertinent relations from input documents. The content planning component uses deterministic operators to aggregate these relations after identifying a subset for inclusion into a summary. The surface realization component lexicalizes this information using a text-infilling language model. By separately modeling content selection and realization, we can effectively train them with limited annotations. We implemented and tested the model in the domain of nutrition and health -- rife with inconsistencies. Compared to conventional methods, our framework leads to more faithful, relevant and aggregation-sensitive summarization -- while being equally fluent.

References used

https://aclanthology.org/

rate research

Topic-Guided Abstractive Multi-Document Summarization

547 - Association for Computation Linguistics 2021 مقالة

A critical point of multi-document summarization (MDS) is to learn the relations among various documents. In this paper, we propose a novel abstractive MDS model, in which we represent multiple documents as a heterogeneous graph, taking semantic node s of different granularities into account, and then apply a graph-to-sequence framework to generate summaries. Moreover, we employ a neural topic model to jointly discover latent topics that can act as cross-document semantic units to bridge different documents and provide global information to guide the summary generation. Since topic extraction can be viewed as a special type of summarization that summarizes'' texts into a more abstract format, i.e., a topic distribution, we adopt a multi-task learning strategy to jointly train the topic and summarization module, allowing the promotion of each other. Experimental results on the Multi-News dataset demonstrate that our model outperforms previous state-of-the-art MDS models on both Rouge scores and human evaluation, meanwhile learns high-quality topics.

topic-guided abstractive multi-document abstractive multi-document summarization abstractive mds model المبادرة متعددة المدى متعدد الوثائق مخبأ متعدد الوثائق تلخص مبادرة MDS نموذج صناعة حمض الفوسفور المزيد..

Extending Multi-Document Summarization Evaluation to the Interactive Setting

360 - Association for Computation Linguistics 2021 مقالة

Allowing users to interact with multi-document summarizers is a promising direction towards improving and customizing summary results. Different ideas for interactive summarization have been proposed in previous work but these solutions are highly di vergent and incomparable. In this paper, we develop an end-to-end evaluation framework for interactive summarization, focusing on expansion-based interaction, which considers the accumulating information along a user session. Our framework includes a procedure of collecting real user sessions, as well as evaluation measures relying on summarization standards, but adapted to reflect interaction. All of our solutions and resources are available publicly as a benchmark, allowing comparison of future developments in interactive summarization, and spurring progress in its methodological evaluation. We demonstrate the use of our framework by evaluating and comparing baseline implementations that we developed for this purpose, which will serve as part of our benchmark. Our extensive experimentation and analysis motivate the proposed evaluation framework design and support its viability.

interactive setting extending multi-document summarization interactive summarization الإعداد التفاعلي تمديد تلخيص المستندات المتعددة تلخيص تفاعلي صناعة حمض الفوسفور المزيد..

Unsupervised Multi-document Summarization for News Corpus with Key Synonyms and Contextual Embeddings

469 - Association for Computation Linguistics 2021 مقالة

Information overload has been one of the challenges regarding information from the Internet. It is not a matter of information access, instead, the focus had shifted towards the quality of the retrieved data. Particularly in the news domain, multiple outlets report on the same news events but may differ in details. This work considers that different news outlets are more likely to differ in their writing styles and the choice of words, and proposes a method to extract sentences based on their key information by focusing on the shared synonyms in each sentence. Our method also attempts to reduce redundancy through hierarchical clustering and arrange selected sentences on the proposed orderBERT. The results show that the proposed unsupervised framework successfully improves the coverage, coherence, and, meanwhile, reduces the redundancy for a generated summary. Moreover, due to the process of obtaining the dataset, we also propose a data refinement method to alleviate the problems of undesirable texts, which result from the process of automatic scraping.

unsupervised multi-document summarization contextual embeddings multi-document summarization تلخيص متعدد الوثائق غير المنسد embeddings السياقي تلخيص المستندات المتعددة صناعة حمض الفوسفور المزيد..

Error Analysis of using BART for Multi-Document Summarization: A Study for English and German Language

305 - Association for Computation Linguistics 2021 مقالة

Recent research using pre-trained language models for multi-document summarization task lacks deep investigation of potential erroneous cases and their possible application on other languages. In this work, we apply a pre-trained language model (BART ) for multi-document summarization (MDS) task using both fine-tuning and without fine-tuning. We use two English datasets and one German dataset for this study. First, we reproduce the multi-document summaries for English language by following one of the recent studies. Next, we show the applicability of the model to German language by achieving state-of-the-art performance on German MDS. We perform an in-depth error analysis of the followed approach for both languages, which leads us to identifying most notable errors, from made-up facts and topic delimitation, and quantifying the amount of extractiveness.

نماذج اللغة الأم multi-document summarization task german language مهمة تلخيص المستندات متعددة الوثائق اللغة الالمانية صناعة حمض الفوسفور

Efficiently Summarizing Text and Graph Encodings of Multi-Document Clusters

226 - Association for Computation Linguistics 2021 مقالة

This paper presents an efficient graph-enhanced approach to multi-document summarization (MDS) with an encoder-decoder Transformer model. This model is based on recent advances in pre-training both encoder and decoder on very large text data (Lewis e t al., 2019), and it incorporates an efficient encoding mechanism (Beltagy et al., 2020) that avoids the quadratic memory growth typical for traditional Transformers. We show that this powerful combination not only scales to large input documents commonly found when summarizing news clusters; it also enables us to process additional input in the form of auxiliary graph representations, which we derive from the multi-document clusters. We present a mechanism to incorporate such graph information into the encoder-decoder model that was pre-trained on text only. Our approach leads to significant improvements on the Multi-News dataset, overall leading to an average 1.8 ROUGE score improvement over previous work (Li et al., 2020). We also show improvements in a transfer-only setup on the DUC-2004 dataset. The graph encodings lead to summaries that are more abstractive. Human evaluation shows that they are also more informative and factually more consistent with their input documents.

efficiently summarizing text multi-document clusters efficiently summarizing تلخيص النص بكفاءة مجموعات متعددة الوثائق تلخص بكفاءة صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Nutri-bullets Hybrid: Consensual Multi-document Summarization

Nutri-Bullets Hybrid: تلخيص توثيق متعدد الوثائق

Ask ChatGPT about the research

Read More

suggested questions