Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Supertagging-based Parsing with Linear Context-free Rewriting Systems

التحليل القائم على السوبر مع أنظمة إعادة كتابة سياق خالية من السياق

495 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

context-free rewriting systems linear context-free rewriting rewriting systems أنظمة إعادة كتابة الخالية من السياق إعادة كتابة خالية من السياق الخطي إعادة كتابة الأنظمة صناعة حمض الفوسفور

visit our facebook page

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

We present the first supertagging-based parser for linear context-free rewriting systems (LCFRS). It utilizes neural classifiers and outperforms previous LCFRS-based parsers in both accuracy and parsing speed by a wide margin. Our results keep up with the best (general) discontinuous parsers, particularly the scores for discontinuous constituents establish a new state of the art. The heart of our approach is an efficient lexicalization procedure which induces a lexical LCFRS from any discontinuous treebank. We describe a modification to usual chart-based LCFRS parsing that accounts for supertagging and introduce a procedure that transforms lexical LCFRS derivations into equivalent parse trees of the original treebank. Our approach is evaluated on the English Discontinuous Penn Treebank and the German treebanks Negra and Tiger.

References used

https://aclanthology.org/

rate research

Towards Document-Level Paraphrase Generation with Sentence Rewriting and Reordering

543 - Association for Computation Linguistics 2021 مقالة

Paraphrase generation is an important task in natural language processing. Previous works focus on sentence-level paraphrase generation, while ignoring document-level paraphrase generation, which is a more challenging and valuable task. In this paper , we explore the task of document-level paraphrase generation for the first time and focus on the inter-sentence diversity by considering sentence rewriting and reordering. We propose CoRPG (Coherence Relationship guided Paraphrase Generation), which leverages graph GRU to encode the coherence relationship graph and get the coherence-aware representation for each sentence, which can be used for re-arranging the multiple (possibly modified) input sentences. We create a pseudo document-level paraphrase dataset for training CoRPG. Automatic evaluation results show CoRPG outperforms several strong baseline models on the BERTScore and diversity scores. Human evaluation also shows our model can generate document paraphrase with more diversity and semantic preservation.

paraphrase generation document-level paraphrase generation rewriting and reordering إعادة صياغة إعادة صياغة إعادة صياغة صياغة مستوى المستند إعادة كتابة وإعادة ترتيبها صناعة حمض الفوسفور المزيد..

Combining Context-Free and Contextualized Representations for Arabic Sarcasm Detection and Sentiment Identification

609 - Association for Computation Linguistics 2021 مقالة

Since their inception, transformer-based language models have led to impressive performance gains across multiple natural language processing tasks. For Arabic, the current state-of-the-art results on most datasets are achieved by the AraBERT languag e model. Notwithstanding these recent advancements, sarcasm and sentiment detection persist to be challenging tasks in Arabic, given the language's rich morphology, linguistic disparity and dialectal variations. This paper proffers team SPPU-AASM's submission for the WANLP ArSarcasm shared-task 2021, which centers around the sarcasm and sentiment polarity detection of Arabic tweets. The study proposes a hybrid model, combining sentence representations from AraBERT with static word vectors trained on Arabic social media corpora. The proposed system achieves a F1-sarcastic score of 0.62 and a F-PN score of 0.715 for the sarcasm and sentiment detection tasks, respectively. Simulation results show that the proposed system outperforms multiple existing approaches for both the tasks, suggesting that the amalgamation of context-free and context-dependent text representations can help capture complementary facets of word meaning in Arabic. The system ranked second and tenth in the respective sub-tasks of sarcasm detection and sentiment identification.

أرابيرت عائلية النموذج contextualized representations context-free and contextualized تمثيلات السياق خالية من السياق والسياق صناعة حمض الفوسفور

Investigating Dominant Word Order on Universal Dependencies with Graph Rewriting

357 - Association for Computation Linguistics 2021 مقالة

This paper details experiments we performed on the Universal Dependencies 2.7 corpora in order to investigate the dominant word order in the available languages. For this purpose, we used a graph rewriting tool, GREW, which allowed us to go beyond th e surface annotations and identify the implicit subjects. We first measured the distribution of the six different word orders (SVO, SOV, VSO, VOS, OVS, OSV) in the corpora and investigated when there was a significant difference in the corpora within a given language. Then, we compared the obtained results with information provided in the WALS database (Dryer and Haspelmath, 2013) and in ( ̈Ostling, 2015). Finally, we examined the impact of using a graph rewriting tool for this task. The tools and resources used for this research are all freely available.

dominant word order investigating dominant word universal dependencies كلمة مهيمنة أمر التحقيق في كلمة مهيمنة التبعيات العالمية صناعة حمض الفوسفور المزيد..

PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols

384 - Association for Computation Linguistics 2021 مقالة

Probabilistic context-free grammars (PCFGs) with neural parameterization have been shown to be effective in unsupervised phrase-structure grammar induction. However, due to the cubic computational complexity of PCFG representation and parsing, previo us approaches cannot scale up to a relatively large number of (nonterminal and preterminal) symbols. In this work, we present a new parameterization form of PCFGs based on tensor decomposition, which has at most quadratic computational complexity in the symbol number and therefore allows us to use a much larger number of symbols. We further use neural parameterization for the new form to improve unsupervised parsing performance. We evaluate our model across ten languages and empirically demonstrate the effectiveness of using more symbols.

inducing probabilistic context-free probabilistic context-free grammars inducing probabilistic حث خالية من السياق الاحتمالية قواعد النحوية الخالية من السياق حث الاحتمالية صناعة حمض الفوسفور المزيد..

Graph Rewriting for Enhanced Universal Dependencies

684 - Association for Computation Linguistics 2021 مقالة

This paper describes a system proposed for the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies (EUD). We propose a Graph Rewriting based system for computing Enhanced Universal Dependencies, given the Basic Universal Dependencies (UD).

المهام المشتركة EUD basic universal dependencies التبعيات العالمية الأساسية صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Supertagging-based Parsing with Linear Context-free Rewriting Systems

التحليل القائم على السوبر مع أنظمة إعادة كتابة سياق خالية من السياق

Ask ChatGPT about the research

Read More

suggested questions