New community

Subscribe to the gold package and get unlimited access to Shamra Academy

On the Evolution of Word Order

على تطور أمر كلمة

206 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Most natural languages have a predominant or fixed word order. For example in English the word order is usually Subject-Verb-Object. This work attempts to explain this phenomenon as well as other typological findings regarding word order from a functional perspective. In particular, we examine whether fixed word order provides a functional advantage, explaining why these languages are prevalent. To this end, we consider an evolutionary model of language and demonstrate, both theoretically and using genetic algorithms, that a language with a fixed word order is optimal. We also show that adding information to the sentence, such as case markers and noun-verb distinction, reduces the need for fixed word order, in accordance with the typological findings.

References used

https://aclanthology.org/

rate research

Modeling the Evolution of Word Senses with Force-Directed Layouts of Co-occurrence Networks

342 - Association for Computation Linguistics 2021 مقالة

Languages evolve over time and the meaning of words can shift. Furthermore, individual words can have multiple senses. However, existing language models often only reflect one word sense per word and do not reflect semantic changes over time. While t here are language models that can either model semantic change of words or multiple word senses, none of them cover both aspects simultaneously. We propose a novel force-directed graph layout algorithm to draw a network of frequently co-occurring words. In this way, we are able to use the drawn graph to visualize the evolution of word senses. In addition, we hope that jointly modeling semantic change and multiple senses of words results in improvements for the individual tasks.

co-occurrence networks word senses layouts of co-occurrence شبكات حدوث مشتركة حواس كلمة تخطيطات التعاون صناعة حمض الفوسفور المزيد..

Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little

291 - Association for Computation Linguistics 2021 مقالة

A possible explanation for the impressive performance of masked language model (MLM) pre-training is that such models have learned to represent the syntactic structures prevalent in classical NLP pipelines. In this paper, we propose a different expla nation: MLMs succeed on downstream tasks almost entirely due to their ability to model higher-order word co-occurrence statistics. To demonstrate this, we pre-train MLMs on sentences with randomly shuffled word order, and show that these models still achieve high accuracy after fine-tuning on many downstream tasks---including tasks specifically designed to be challenging for models that ignore word order. Our models perform surprisingly well according to some parametric syntactic probes, indicating possible deficiencies in how we test representations for syntactic information. Overall, our results show that purely distributional information largely explains the success of pre-training, and underscore the importance of curating challenging evaluation datasets that require deeper linguistic knowledge.

معلومات بايزي المتبادلة word matters pre-training order word matters كلمة الأمور قبل التدريب طلب كلمة الأمور صناعة حمض الفوسفور

Overcoming Poor Word Embeddings with Word Definitions

457 - Association for Computation Linguistics 2021 مقالة

Modern natural language understanding models depend on pretrained subword embeddings, but applications may need to reason about words that were never or rarely seen during pretraining. We show that examples that depend critically on a rarer word are more challenging for natural language inference models. Then we explore how a model could learn to use definitions, provided in natural text, to overcome this handicap. Our model's understanding of a definition is usually weaker than a well-modeled word embedding, but it recovers most of the performance gap from using a completely untrained word.

overcoming poor word poor word embeddings overcoming poor التغلب على كلمة سيئة سوء الكلمة embeddings. التغلب على الفقراء صناعة حمض الفوسفور المزيد..

Learning grounded word meaning representations on similarity graphs

567 - Association for Computation Linguistics 2021 مقالة

This paper introduces a novel approach to learn visually grounded meaning representations of words as low-dimensional node embeddings on an underlying graph hierarchy. The lower level of the hierarchy models modality-specific word representations, co nditioned to another modality, through dedicated but communicating graphs, while the higher level puts these representations together on a single graph to learn a representation jointly from both modalities. The topology of each graph models similarity relations among words, and is estimated jointly with the graph embedding. The assumption underlying this model is that words sharing similar meaning correspond to communities in an underlying graph in a low-dimensional space. We named this model Hierarchical Multi-Modal Similarity Graph Embedding (HM-SGE). Experimental results validate the ability of HM-SGE to simulate human similarity judgments and concept categorization, outperforming the state of the art.

learning grounded word grounded meaning representations learning grounded تعلم الكلمة المحددة تعني المعنى المحدد تعلم الأساس صناعة حمض الفوسفور المزيد..

Investigating Dominant Word Order on Universal Dependencies with Graph Rewriting

303 - Association for Computation Linguistics 2021 مقالة

This paper details experiments we performed on the Universal Dependencies 2.7 corpora in order to investigate the dominant word order in the available languages. For this purpose, we used a graph rewriting tool, GREW, which allowed us to go beyond th e surface annotations and identify the implicit subjects. We first measured the distribution of the six different word orders (SVO, SOV, VSO, VOS, OVS, OSV) in the corpora and investigated when there was a significant difference in the corpora within a given language. Then, we compared the obtained results with information provided in the WALS database (Dryer and Haspelmath, 2013) and in ( ̈Ostling, 2015). Finally, we examined the impact of using a graph rewriting tool for this task. The tools and resources used for this research are all freely available.

dominant word order investigating dominant word universal dependencies كلمة مهيمنة أمر التحقيق في كلمة مهيمنة التبعيات العالمية صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

On the Evolution of Word Order

على تطور أمر كلمة

Ask ChatGPT about the research

Read More

suggested questions