Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Towards Document-Level Paraphrase Generation with Sentence Rewriting and Reordering

نحو توليد إعادة صياغة صياغة المستندات مع إعادة كتابة الجملة وإعادة ترتيبها

514 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

paraphrase generation document-level paraphrase generation rewriting and reordering إعادة صياغة إعادة صياغة إعادة صياغة صياغة مستوى المستند إعادة كتابة وإعادة ترتيبها صناعة حمض الفوسفور

visit our facebook page

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Paraphrase generation is an important task in natural language processing. Previous works focus on sentence-level paraphrase generation, while ignoring document-level paraphrase generation, which is a more challenging and valuable task. In this paper, we explore the task of document-level paraphrase generation for the first time and focus on the inter-sentence diversity by considering sentence rewriting and reordering. We propose CoRPG (Coherence Relationship guided Paraphrase Generation), which leverages graph GRU to encode the coherence relationship graph and get the coherence-aware representation for each sentence, which can be used for re-arranging the multiple (possibly modified) input sentences. We create a pseudo document-level paraphrase dataset for training CoRPG. Automatic evaluation results show CoRPG outperforms several strong baseline models on the BERTScore and diversity scores. Human evaluation also shows our model can generate document paraphrase with more diversity and semantic preservation.

References used

https://aclanthology.org/

rate research

AESOP: Paraphrase Generation with Adaptive Syntactic Control

380 - Association for Computation Linguistics 2021 مقالة

We propose to control paraphrase generation through carefully chosen target syntactic structures to generate more proper and higher quality paraphrases. Our model, AESOP, leverages a pretrained language model and adds deliberately chosen syntactical control via a retrieval-based selection module to generate fluent paraphrases. Experiments show that AESOP achieves state-of-the-art performances on semantic preservation and syntactic conformation on two benchmark datasets with ground-truth syntactic control from human-annotated exemplars. Moreover, with the retrieval-based target syntax selection module, AESOP generates paraphrases with even better qualities than the current best model using human-annotated target syntactic parses according to human evaluation. We further demonstrate the effectiveness of AESOP to improve classification models' robustness to syntactic perturbation by data augmentation on two GLUE tasks.

adaptive syntactic control generation with adaptive control paraphrase generation السيطرة على النحوية التكيفية جيل مع التكيف التحكم في إعادة صياغة النص صناعة حمض الفوسفور المزيد..

Contrastive Representation Learning for Exemplar-Guided Paraphrase Generation

378 - Association for Computation Linguistics 2021 مقالة

Exemplar-Guided Paraphrase Generation (EGPG) aims to generate a target sentence which conforms to the style of the given exemplar while encapsulating the content information of the source sentence. In this paper, we propose a new method with the goal of learning a better representation of the style and the content. This method is mainly motivated by the recent success of contrastive learning which has demonstrated its power in unsupervised feature extraction tasks. The idea is to design two contrastive losses with respect to the content and the style by considering two problem characteristics during training. One characteristic is that the target sentence shares the same content with the source sentence, and the second characteristic is that the target sentence shares the same style with the exemplar. These two contrastive losses are incorporated into the general encoder-decoder paradigm. Experiments on two datasets, namely QQP-Pos and ParaNMT, demonstrate the effectiveness of our proposed constrastive losses.

exemplar-guided paraphrase generation exemplar-guided paraphrase إعادة صياغة نص صياغة التركيب إعادة صياغة الترشيح صناعة حمض الفوسفور

Towards Domain-Generalizable Paraphrase Identification by Avoiding the Shortcut Learning

625 - Association for Computation Linguistics 2021 مقالة

In this paper, we investigate the Domain Generalization (DG) problem for supervised Paraphrase Identification (PI). We observe that the performance of existing PI models deteriorates dramatically when tested in an out-of-distribution (OOD) domain. We conjecture that it is caused by shortcut learning, i.e., these models tend to utilize the cue words that are unique for a particular dataset or domain. To alleviate this issue and enhance the DG ability, we propose a PI framework based on Optimal Transport (OT). Our method forces the network to learn the necessary features for all the words in the input, which alleviates the shortcut learning problem. Experimental results show that our method improves the DG ability for the PI models.

domain-generalizable paraphrase identification supervised paraphrase identification paraphrase identification التعرف على إعادة صياغة المجال تحديد الصياغة الإشراف إعادة صياغة التعريف صناعة حمض الفوسفور المزيد..

Paraphrase Generation: A Survey of the State of the Art

407 - Association for Computation Linguistics 2021 مقالة

This paper focuses on paraphrase generation,which is a widely studied natural language generation task in NLP. With the development of neural models, paraphrase generation research has exhibited a gradual shift to neural methods in the recent years. This has provided architectures for contextualized representation of an input text and generating fluent, diverseand human-like paraphrases. This paper surveys various approaches to paraphrase generation with a main focus on neural methods.

تحسين المفهوم حالة صناعة حمض الفوسفور

Knowledge-Guided Paraphrase Identification

579 - Association for Computation Linguistics 2021 مقالة

Paraphrase identification (PI), a fundamental task in natural language processing, is to identify whether two sentences express the same or similar meaning, which is a binary classification problem. Recently, BERT-like pre-trained language models hav e been a popular choice for the frameworks of various PI models, but almost all existing methods consider general domain text. When these approaches are applied to a specific domain, existing models cannot make accurate predictions due to the lack of professional knowledge. In light of this challenge, we propose a novel framework, namely , which can leverage the external unstructured Wikipedia knowledge to accurately identify paraphrases. We propose to mine outline knowledge of concepts related to given sentences from Wikipedia via BM25 model. After retrieving related outline knowledge, makes predictions based on both the semantic information of two sentences and the outline knowledge. Besides, we propose a gating mechanism to aggregate the semantic information-based prediction and the knowledge-based prediction. Extensive experiments are conducted on two public datasets: PARADE (a computer science domain dataset) and clinicalSTS2019 (a biomedical domain dataset). The results show that the proposed outperforms state-of-the-art methods.

knowledge-guided paraphrase identification knowledge-guided paraphrase تعريف إعادة صياغة المعرفة إعادة صياغة تحويل المعرفة صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Towards Document-Level Paraphrase Generation with Sentence Rewriting and Reordering

نحو توليد إعادة صياغة صياغة المستندات مع إعادة كتابة الجملة وإعادة ترتيبها

Ask ChatGPT about the research

Read More

suggested questions