Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Semantic Similarity Based Evaluation for Abstractive News Summarization

التقييم القائم على التشابه الدلالي لتلخيص الأخبار الجماعية

333 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

similarity based evaluation semantic similarity based based evaluation التقييم القائم على التشابه التشابه الدلالي مقرها تقييم مقرها صناعة حمض الفوسفور

visit our facebook page

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

ROUGE is a widely used evaluation metric in text summarization. However, it is not suitable for the evaluation of abstractive summarization systems as it relies on lexical overlap between the gold standard and the generated summaries. This limitation becomes more apparent for agglutinative languages with very large vocabularies and high type/token ratios. In this paper, we present semantic similarity models for Turkish and apply them as evaluation metrics for an abstractive summarization task. To achieve this, we translated the English STSb dataset into Turkish and presented the first semantic textual similarity dataset for Turkish as well. We showed that our best similarity models have better alignment with average human judgments compared to ROUGE in both Pearson and Spearman correlations.

References used

https://aclanthology.org/

rate research

Integrating Semantic Scenario and Word Relations for Abstractive Sentence Summarization

526 - Association for Computation Linguistics 2021 مقالة

Recently graph-based methods have been adopted for Abstractive Text Summarization. However, existing graph-based methods only consider either word relations or structure information, which neglect the correlation between them. To simultaneously captu re the word relations and structure information from sentences, we propose a novel Dual Graph network for Abstractive Sentence Summarization. Specifically, we first construct semantic scenario graph and semantic word relation graph based on FrameNet, and subsequently learn their representations and design graph fusion method to enhance their correlation and obtain better semantic representation for summary generation. Experimental results show our model outperforms existing state-of-the-art methods on two popular benchmark datasets, i.e., Gigaword and DUC 2004.

النص غير المدلل abstractive text summarization sentence summarization تلخيص النص المبشري تلخيص الجملة صناعة حمض الفوسفور

Evaluation Datasets for Cross-lingual Semantic Textual Similarity

449 - Association for Computation Linguistics 2021 مقالة

Semantic textual similarity (STS) systems estimate the degree of the meaning similarity between two sentences. Cross-lingual STS systems estimate the degree of the meaning similarity between two sentences, each in a different language. State-of-the-a rt algorithms usually employ a strongly supervised, resource-rich approach difficult to use for poorly-resourced languages. However, any approach needs to have evaluation data to confirm the results. In order to simplify the evaluation process for poorly-resourced languages (in terms of STS evaluation datasets), we present new datasets for cross-lingual and monolingual STS for languages without this evaluation data. We also present the results of several state-of-the-art methods on these data which can be used as a baseline for further research. We believe that this article will not only extend the current STS research to other languages, but will also encourage competition on this new evaluation data.

semantic textual similarity cross-lingual semantic textual semantic textual التشابه الدلالي النصي النص الدلالي عبر اللغات نص الدلالي صناعة حمض الفوسفور المزيد..

QuestEval: Summarization Asks for Fact-based Evaluation

658 - Association for Computation Linguistics 2021 مقالة

Summarization evaluation remains an open research problem: current metrics such as ROUGE are known to be limited and to correlate poorly with human judgments. To alleviate this issue, recent work has proposed evaluation metrics which rely on question answering models to assess whether a summary contains all the relevant information in its source document. Though promising, the proposed approaches have so far failed to correlate better than ROUGE with human judgments. In this paper, we extend previous approaches and propose a unified framework, named QuestEval. In contrast to established metrics such as ROUGE or BERTScore, QuestEval does not require any ground-truth reference. Nonetheless, QuestEval substantially improves the correlation with human judgments over four evaluation dimensions (consistency, coherence, fluency, and relevance), as shown in extensive experiments.

fact-based evaluation summarization evaluation remains human judgments تقييم الحقائق تقييم التلخصات الأحكام الإنسانية صناعة حمض الفوسفور المزيد..

Gradient-Based Adversarial Factual Consistency Evaluation for Abstractive Summarization

347 - Association for Computation Linguistics 2021 مقالة

Neural abstractive summarization systems have gained significant progress in recent years. However, abstractive summarization often produce inconsisitent statements or false facts. How to automatically generate highly abstract yet factually correct s ummaries? In this paper, we proposed an efficient weak-supervised adversarial data augmentation approach to form the factual consistency dataset. Based on the artificial dataset, we train an evaluation model that can not only make accurate and robust factual consistency discrimination but is also capable of making interpretable factual errors tracing by backpropagated gradient distribution on token embeddings. Experiments and analysis conduct on public annotated summarization and factual consistency datasets demonstrate our approach effective and reasonable.

شبكة النسخ المصقول gradient-based adversarial factual neural abstractive summarization الواقعي المقصود المستند إلى التدرج تلخيص المبشور العصبي صناعة حمض الفوسفور

A Finer-grain Universal Dialogue Semantic Structures based Model For Abstractive Dialogue Summarization

311 - Association for Computation Linguistics 2021 مقالة

Although abstractive summarization models have achieved impressive results on document summarization tasks, their performance on dialogue modeling is much less satisfactory due to the crude and straight methods for dialogue encoding. To address this question, we propose a novel end-to-end Transformer-based model FinDS for abstractive dialogue summarization that leverages Finer-grain universal Dialogue semantic Structures to model dialogue and generates better summaries. Experiments on the SAMsum dataset show that FinDS outperforms various dialogue summarization approaches and achieves new state-of-the-art (SOTA) ROUGE results. Finally, we apply FinDS to a more complex scenario, showing the robustness of our model. We also release our source code.

finer-grain universal dialogue dialogue semantic structures universal dialogue semantic الحوار العالمي غرين الحوار الهياكل الدلالية الحوار العالمي الدلالي صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Semantic Similarity Based Evaluation for Abstractive News Summarization

التقييم القائم على التشابه الدلالي لتلخيص الأخبار الجماعية

Ask ChatGPT about the research

Read More

suggested questions