تقدم هذه الورقة النظام الذي قدمناه إلى المهمة المشتركة التي نقدمها لأول المهمة المشتركة (LCP) 2021. توفر المهمة المشتركة للمشاركين مع مجموعة بيانات جديدة باللغة الإنجليزية تتضمن سياق الكلمة المستهدفة.نحن نشارك في المهمة الفرعية للتنبؤ بكلمة واحدة والتركيز على هندسة الميزة.يتم تدريب أفضل نظامنا على الميزات اللغوية و Adgeddings Word (درجة بيرسون من 0.7942).ومع ذلك، نوضح أن مجموعة ميزة أبسط تحقق نتائج مماثلة وتقديم نموذج تدرب على 36 ميزات لغوية (درجة بيرسون من 0.7925).
This paper presents the system we submitted to the first Lexical Complexity Prediction (LCP) Shared Task 2021. The Shared Task provides participants with a new English dataset that includes context of the target word. We participate in the single-word complexity prediction sub-task and focus on feature engineering. Our best system is trained on linguistic features and word embeddings (Pearson's score of 0.7942). We demonstrate, however, that a simpler feature set achieves comparable results and submit a model trained on 36 linguistic features (Pearson's score of 0.7925).
References used
https://aclanthology.org/
This paper describes our contribution to SemEval 2021 Task 1 (Shardlow et al., 2021): Lexical Complexity Prediction. In our approach, we leverage the ELECTRA model and attempt to mirror the data annotation scheme. Although the task is a regression ta
Lexical complexity plays an important role in reading comprehension. lexical complexity prediction (LCP) can not only be used as a part of Lexical Simplification systems, but also as a stand-alone application to help people better reading. This paper
In this contribution, we describe the system presented by the PolyU CBS-Comp Team at the Task 1 of SemEval 2021, where the goal was the estimation of the complexity of words in a given sentence context. Our top system, based on a combination of lexic
This paper describes a system submitted by team BigGreen to LCP 2021 for predicting the lexical complexity of English words in a given context. We assemble a feature engineering-based model with a deep neural network model founded on BERT. While BERT
Evaluating the complexity of a target word in a sentential context is the aim of the Lexical Complexity Prediction task at SemEval-2021. This paper presents the system created to assess single words lexical complexity, combining linguistic and psycho