Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Neural Borrowing Detection with Monolingual Lexical Models

اكتشاف الاقتراض العصبي مع نماذج معجمية أحادية

801 1 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

lexical borrowings lexical borrowing detection القروض المعجمية lexical. اكتشاف الاقتراض صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Identification of lexical borrowings, transfer of words between languages, is an essential practice of historical linguistics and a vital tool in analysis of language contact and cultural events in general. We seek to improve tools for automatic detection of lexical borrowings, focusing here on detecting borrowed words from monolingual wordlists. Starting with a recurrent neural lexical language model and competing entropies approach, we incorporate a more current Transformer based lexical model. From there we experiment with several different models and approaches including a lexical donor model with augmented wordlist. The Transformer model reduces execution time and minimally improves borrowing detection. The augmented donor model shows some promise. A substantive change in approach or model is needed to make significant gains in identification of lexical borrowings.

References used

https://aclanthology.org/

rate research

Neural Metaphor Detection with Visibility Embeddings

580 - Association for Computation Linguistics 2021 مقالة

We present new results for the problem of sequence metaphor labeling, using the recently developed Visibility Embeddings. We show that concatenating such embeddings to the input of a BiLSTM obtains consistent and significant improvements at almost no cost, and we present further improved results when visibility embeddings are combined with BERT.

neural metaphor detection metaphor detection visibility embeddings اكتشاف الاستعارة العصبي اكتشاف الاستعارة وظائف الرؤية صناعة حمض الفوسفور المزيد..

Tribrid: Stance Classification with Neural Inconsistency Detection

951 - Association for Computation Linguistics 2021 مقالة

We study the problem of performing automatic stance classification on social media with neural architectures such as BERT. Although these architectures deliver impressive results, their level is not yet comparable to the one of humans and they might produce errors that have a significant impact on the downstream task (e.g., fact-checking). To improve the performance, we present a new neural architecture where the input also includes automatically generated negated perspectives over a given claim. The model is jointly learned to make simultaneously multiple predictions, which can be used either to improve the classification of the original perspective or to filter out doubtful predictions. In the first case, we propose a weakly supervised method for combining the predictions into a final one. In the second case, we show that using the confidence scores to remove doubtful predictions allows our method to achieve human-like performance over the retained information, which is still a sizable part of the original input.

neural inconsistency detection inconsistency detection neural inconsistency كشف عدم التناقض العصبي الكشف عن التناقض عدم التناقض العصبي صناعة حمض الفوسفور المزيد..

Automatically Exposing Problems with Neural Dialog Models

586 - Association for Computation Linguistics 2021 مقالة

Neural dialog models are known to suffer from problems such as generating unsafe and inconsistent responses. Even though these problems are crucial and prevalent, they are mostly manually identified by model designers through interactions. Recently, some research instructs crowdworkers to goad the bots into triggering such problems. However, humans leverage superficial clues such as hate speech, while leaving systematic problems undercover. In this paper, we propose two methods including reinforcement learning to automatically trigger a dialog model into generating problematic responses. We show the effect of our methods in exposing safety and contradiction issues with state-of-the-art dialog models.

neural dialog models neural dialog dialog models نماذج الحوار العصبي مربع الحوار العصبي نماذج الحوار صناعة حمض الفوسفور المزيد..

Monolingual and Cross-Lingual Acceptability Judgments with the Italian CoLA corpus

557 - Association for Computation Linguistics 2021 مقالة

The development of automated approaches to linguistic acceptability has been greatly fostered by the availability of the English CoLA corpus, which has also been included in the widely used GLUE benchmark. However, this kind of research for languages other than English, as well as the analysis of cross-lingual approaches, has been hindered by the lack of resources with a comparable size in other languages. We have therefore developed the ItaCoLA corpus, containing almost 10,000 sentences with acceptability judgments, which has been created following the same approach and the same steps as the English one. In this paper we describe the corpus creation, we detail its content, and we present the first experiments on this new resource. We compare in-domain and out-of-domain classification, and perform a specific evaluation of nine linguistic phenomena. We also present the first cross-lingual experiments, aimed at assessing whether multilingual transformer-based approaches can benefit from using sentences in two languages during fine-tuning.

italian cola corpus english cola corpus italian cola الكولا الإيطالي كولبا كولا سوربوس الكولا الإيطالي صناعة حمض الفوسفور المزيد..

Should we Stop Training More Monolingual Models, and Simply Use Machine Translation Instead?

564 - Association for Computation Linguistics 2021 مقالة

Most work in NLP makes the assumption that it is desirable to develop solutions in the native language in question. There is consequently a strong trend towards building native language models even for low-resource languages. This paper questions thi s development, and explores the idea of simply translating the data into English, thereby enabling the use of pretrained, and large-scale, English language models. We demonstrate empirically that a large English language model coupled with modern machine translation outperforms native language models in most Scandinavian languages. The exception to this is Finnish, which we assume is due to inferior translation quality. Our results suggest that machine translation is a mature technology, which raises a serious counter-argument for training native language models for low-resource languages. This paper therefore strives to make a provocative but important point. As English language models are improving at an unprecedented pace, which in turn improves machine translation, it is from an empirical and environmental stand-point more effective to translate data from low-resource languages into English, than to build language models for such languages.

english language models native language models نماذج اللغة الإنجليزية نماذج اللغة الأم صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Neural Borrowing Detection with Monolingual Lexical Models

اكتشاف الاقتراض العصبي مع نماذج معجمية أحادية

Ask ChatGPT about the research

Read More

suggested questions