Research papers, master and doctoral theses about Modelling

A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek

753 - Association for Computation Linguistics 2021 مقالة

This paper presents a pilot study to automatic linguistic preprocessing of Ancient and Byzantine Greek, and morphological analysis more specifically. To this end, a novel subword-based BERT language model was trained on the basis of a varied corpus o f Modern, Ancient and Post-classical Greek texts. Consequently, the obtained BERT embeddings were incorporated to train a fine-grained Part-of-Speech tagger for Ancient and Byzantine Greek. In addition, a corpus of Greek Epigrams was manually annotated and the resulting gold standard was used to evaluate the performance of the morphological analyser on Byzantine Greek. The experimental results show very good perplexity scores (4.9) for the BERT language model and state-of-the-art performance for the fine-grained Part-of-Speech tagger for in-domain data (treebanks containing a mixture of Classical and Medieval Greek), as well as for the newly created Byzantine Greek gold standard data set. The language models and associated code are made available for use at https://github.com/pranaydeeps/Ancient-Greek-BERT

bert language modelling byzantine greek pilot study نمذجة لغة بيرت البيزنطية اليونانية دراسة الطيار صناعة حمض الفوسفور المزيد..

A Dataset for Research on Modelling Depression Severity in Online Forum Data

1114 - Association for Computation Linguistics 2021 مقالة

People utilize online forums to either look for information or to contribute it. Because of their growing popularity, certain online forums have been created specifically to provide support, assistance, and opinions for people suffering from mental i llness. Depression is one of the most frequent psychological illnesses worldwide. People communicate more with online forums to find answers for their psychological disease. However, there is no mechanism to measure the severity of depression in each post and give higher importance to those who are diagnosed more severely depressed. Despite the fact that numerous researches based on online forum data and the identification of depression have been conducted, the severity of depression is rarely explored. In addition, the absence of datasets will stymie the development of novel diagnostic procedures for practitioners. From this study, we offer a dataset to support research on depression severity evaluation. The computational approach to measure an automatic process, identified severity of depression here is quite novel approach. Nonetheless, this elaborate measuring severity of depression in online forum posts is needed to ensure the measurement scales used in our research meets the expected norms of scientific research.

modelling depression severity online forum data modelling depression نمذجة الشدة الاكتئاب بيانات المنتدى عبر الإنترنت نمذجة الاكتئاب صناعة حمض الفوسفور المزيد..

Cultural Topic Modelling over Novel Wikipedia Corpora for South-Slavic Languages

634 - Association for Computation Linguistics 2021 مقالة

There is a shortage of high-quality corpora for South-Slavic languages. Such corpora are useful to computer scientists and researchers in social sciences and humanities alike, focusing on numerous linguistic, content analysis, and natural language pr ocessing applications. This paper presents a workflow for mining Wikipedia content and processing it into linguistically-processed corpora, applied on the Bosnian, Bulgarian, Croatian, Macedonian, Serbian, Serbo-Croatian and Slovenian Wikipedia. We make the resulting seven corpora publicly available. We showcase these corpora by comparing the content of the underlying Wikipedias, our assumption being that the content of the Wikipedias reflects broadly the interests in various topics in these Balkan nations. We perform the content comparison by using topic modelling algorithms and various distribution comparisons. The results show that all Wikipedias are topically rather similar, with all of them covering art, culture, and literature, whereas they contain differences in geography, politics, history and science.

south-slavic languages cultural topic modelling corpora لغات جنوب سلافية نمذجة الموضوع الثقافي سورانيا صناعة حمض الفوسفور المزيد..

Cross-Lingual Wolastoqey-English Definition Modelling

530 - Association for Computation Linguistics 2021 مقالة

Definition modelling is the task of automatically generating a dictionary-style definition given a target word. In this paper, we consider cross-lingual definition generation. Specifically, we generate English definitions for Wolastoqey (Malecite-Pas samaquoddy) words. Wolastoqey is an endangered, low-resource polysynthetic language. We hypothesize that sub-word representations based on byte pair encoding (Sennrich et al., 2016) can be leveraged to represent morphologically-complex Wolastoqey words and overcome the challenge of not having large corpora available for training. Our experimental results demonstrate that this approach outperforms baseline methods in terms of BLEU score.

wolastoqey-english definition modelling definition modelling cross-lingual wolastoqey-english definition Wolastoqey-English تعريف النمذجة تعريف النمذجة تعريف Wolastoqey عبر اللغات صناعة حمض الفوسفور المزيد..

Script Parsing with Hierarchical Sequence Modelling

732 - Association for Computation Linguistics 2021 مقالة

Scripts capture commonsense knowledge about everyday activities and their participants. Script knowledge proved useful in a number of NLP tasks, such as referent prediction, discourse classification, and story generation. A crucial step for the explo itation of script knowledge is script parsing, the task of tagging a text with the events and participants from a certain activity. This task is challenging: it requires information both about the ways events and participants are usually uttered in surface language as well as the order in which they occur in the world. We show how to do accurate script parsing with a hierarchical sequence model and transfer learning. Our model improves the state of the art of event parsing by over 16 points F-score and, for the first time, accurately tags script participants.

hierarchical sequence modelling sequence modelling script parsing التسلسل الهرمي النمذجة نموذج التسلسل تحليل النصي صناعة حمض الفوسفور المزيد..

LangResearchLab NC at SemEval-2021 Task 1: Linguistic Feature Based Modelling for Lexical Complexity

732 - Association for Computation Linguistics 2021 مقالة

The present work aims at assigning a complexity score between 0 and 1 to a target word or phrase in a given sentence. For each Single Word Target, a Random Forest Regressor is trained on a feature set consisting of lexical, semantic, and syntactic in formation about the target. For each Multiword Target, a set of individual word features is taken along with single word complexities in the feature space. The system yielded the Pearson correlation of 0.7402 and 0.8244 on the test set for the Single and Multiword Targets, respectively.

linguistic feature based feature based modelling based modelling الميزة اللغوية القائمة ميزة القائمة على النمذجة نمذجة مقرها صناعة حمض الفوسفور المزيد..

Large-Scale Contextualised Language Modelling for Norwegian

886 - Association for Computation Linguistics 2021 مقالة

We present the ongoing NorLM initiative to support the creation and use of very large contextualised language models for Norwegian (and in principle other Nordic languages), including a ready-to-use software environment, as well as an experience repo rt for data preparation and training. This paper introduces the first large-scale monolingual language models for Norwegian, based on both the ELMo and BERT frameworks. In addition to detailing the training process, we present contrastive benchmark results on a suite of NLP tasks for Norwegian. For additional background and access to the data, models, and software, please see: http://norlm.nlpl.eu

contextualised language modelling modelling for norwegian contextualised language models النمذجة اللغة السياقية النمذجة للنرويجية صناعة حمض الفوسفور

Inspecting the behavior of precast concrete model pile driven in sand

1570 - Aِl-Baath University 2017 ورقة بحثية

This study concentrate on the driven pile in sand soils, to study and inspect this type of piles via minimized laboratory models in conditions similar to field conditions, and compare research result with actual load tests.

نموذج وتد أوتاد مدقوقة أوتاد في الرمل نمذجة فيزيائية Model pile Driven piles pile in sand physical modelling المزيد..

The modelling of recycled pavements using foamed bitumen

2051 - Aِl-Baath University 2017 ورقة بحثية

The research discusses roads rehabilitation using foamed bitumen,which is a relatively new method and hasn’t been used in Syria yet due to the lack of the needed labrotory equipments ,and that’s why the main aim of the research is to develop a mo del to stimulate the labrotory experiments needed to evaluate the performance of the rehabilitated pavement (surface deflection – strain ). The model was developed using the finite elements method with the help of ABAQUS modelling software.

rehabilitation نمذجة Modelling إعادة تأهيل البيتومين الرغوي Foamed bitumen

Verification of Security Protocols Using Formal Methods

1858 - Tishreen University 2017 ورقة بحثية

There are many of Formal Methods for testing security protocols detecting being safe or not. Including Avispa, Casper, ProVerif, Scyther. Previously a comparisons using two of mentioned methods (ProVerif, Scyther). In this, research a comparison b etween the four mentioned methods in terms of the same used parameters in the previous comparison: working style, the modeling language, user interface, input, and output. As a result, the user provided with options to choose the appropriate method depending on the desired parameter. Six different of security protocols have been tested and finally the results have been compared; these protocols are Kao Chow Authentication Protocol, 3-D Secure Protocol, Needham-Schroeder Public Key Protocol, Diffie–Hellman key exchange, Andrew Secure RPC Protocol, and Challenge Handshake Authentication Protocol

Security الأمن الطرق الرسمية بروتوكول أمني نمذجة البروتوكول عمليات التفحص أدوات التحليل Formal Methods Security Protocol Protocol Modelling Checking Process Analysis Toolkit المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد