Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Automatic Discrimination between Inherited and Borrowed Latin Words in Romance Languages

التمييز التلقائي بين الكلمات اللاتينية الموروثة والمتوسطة في اللغات الرومانسية

260 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

In this paper, we address the problem of automatically discriminating between inherited and borrowed Latin words. We introduce a new dataset and investigate the case of Romance languages (Romanian, Italian, French, Spanish, Portuguese and Catalan), where words directly inherited from Latin coexist with words borrowed from Latin, and explore whether automatic discrimination between them is possible. Having entered the language at a later stage, borrowed words are no longer subject to historical sound shift rules, hence they are presumably less eroded, which is why we expect them to have a different intrinsic structure distinguishable by computational means. We employ several machine learning models to automatically discriminate between inherited and borrowed words and compare their performance with various feature sets. We analyze the models' predictive power on two versions of the datasets, orthographic and phonetic. We also investigate whether prior knowledge of the etymon provides better results, employing n-gram character features extracted from the word-etymon pairs and from their alignment.

References used

https://aclanthology.org/

rate research

Cultural and Geographical Influences on Image Translatability of Words across Languages

307 - Association for Computation Linguistics 2021 مقالة

Neural Machine Translation (NMT) models have been observed to produce poor translations when there are few/no parallel sentences to train the models. In the absence of parallel data, several approaches have turned to the use of images to learn transl ations. Since images of words, e.g., horse may be unchanged across languages, translations can be identified via images associated with words in different languages that have a high degree of visual similarity. However, translating via images has been shown to improve upon text-only models only marginally. To better understand when images are useful for translation, we study image translatability of words, which we define as the translatability of words via images, by measuring intra- and inter-cluster similarities of image representations of words that are translations of each other. We find that images of words are not always invariant across languages, and that language pairs with shared culture, meaning having either a common language family, ethnicity or religion, have improved image translatability (i.e., have more similar images for similar words) compared to its converse, regardless of their geographic proximity. In addition, in line with previous works that show images help more in translating concrete words, we found that concrete words have improved image translatability compared to abstract ones.

geographical influences cultural and geographical التأثيرات الجغرافية الثقافية والجغرافية كلمات صناعة حمض الفوسفور

Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions

257 - Association for Computation Linguistics 2021 مقالة

This work proposes an extensive analysis of the Transformer architecture in the Neural Machine Translation (NMT) setting. Focusing on the encoder-decoder attention mechanism, we prove that attention weights systematically make alignment errors by rel ying mainly on uninformative tokens from the source sequence. However, we observe that NMT models assign attention to these tokens to regulate the contribution in the prediction of the two contexts, the source and the prefix of the target sequence. We provide evidence about the influence of wrong alignments on the model behavior, demonstrating that the encoder-decoder attention mechanism is well suited as an interpretability method for NMT. Finally, based on our analysis, we propose methods that largely reduce the word alignment error rate compared to standard induced alignments from attention weights.

nmt fail aligning fail aligning words transformer nmt fail NMT تفشل محاذاة تفشل محاذاة الكلمات محول NMT فشل صناعة حمض الفوسفور المزيد..

Tracking Semantic Change in Cognate Sets for English and Romance Languages

453 - Association for Computation Linguistics 2021 مقالة

Semantic divergence in related languages is a key concern of historical linguistics. We cross-linguistically investigate the semantic divergence of cognate pairs in English and Romance languages, by means of word embeddings. To this end, we introduce a new curated dataset of cognates in all pairs of those languages. We describe the types of errors that occurred during the automated cognate identification process and manually correct them. Additionally, we label the English cognates according to their etymology, separating them into two groups: old borrowings and recent borrowings. On this curated dataset, we analyse word properties such as frequency and polysemy, and the distribution of similarity scores between cognate sets in different languages. We automatically identify different clusters of English cognates, setting a new direction of research in cognates, borrowings and possibly false friends analysis in related languages.

tracking semantic change romance languages تتبع التغيير الدلالي لغات الرومانسية صناعة حمض الفوسفور

The Effectiveness Discriminating of the Part One of the Adaptive Behavior Scale in Discrimination Between the Ages - Field Study One Sample of Children in Damascus Governorate Schools

1129 - Aِl-Baath University 2017 ورقة بحثية

This research aimed to identify the discrimination capacity of the part one of AAMR adaptive behavior scale in discrimination between children from different ages, through studying the differences between children's performance who including in th e research, where the research sample consisted of (490) children aged (11-15) year, they are students in the second stage basic teaching students in Damascus governorate schools.

الفاعلية التحليل التمييزي السلوك التكيفي Effective Discriminating Analysis Adaptive Behavior

The Effectiveness Discriminating of the Part One of the Adaptive Behavior Scale in Discrimination Between the Ages - Field Study One Sample of Children in Damascus Governorate Schools

1260 - Aِl-Baath University 2017 ورقة بحثية

This research aimed to identify the discrimination capacity of the part one of AAMR adaptive behavior scale in discrimination between children from different ages, through studying the differences between children's performance who including in t he research, where the research sample consisted of (490) children aged (11-15) year, they are students in the second stage basic teaching students in Damascus governorate schools.

الفاعلية التحليل التمييزي السلوك التكيفي Effective Discriminating Analysis Adaptive Behavior

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Automatic Discrimination between Inherited and Borrowed Latin Words in Romance Languages

التمييز التلقائي بين الكلمات اللاتينية الموروثة والمتوسطة في اللغات الرومانسية

Ask ChatGPT about the research

Read More

suggested questions