نقدم نهجنا في التنبؤ بالتعقيد المعجمي للكلمات في سياقات محددة، على النحو الذي أدخلته المهمة المشتركة LCP 1 في Semeval 2021. يتكون النهج من الجمل الفاصلة إلى قطع أصغر، وتضمينها مع SENT2VEC، وتقليل المدينات إلى متجه أبسط يستخدم كمدخلإلى شبكة عصبية، هذا الأخير للتنبؤ بعقد الكلمات والتعبيرات.تشير النتائج إلى أن تضيير الجملة المدربة مسبقا غير قادرة على التقاط التعقيد المعجمي من اللغة عند تطبيقها في تطبيقات عبر المجال.
We present our approach to predicting lexical complexity of words in specific contexts, as entered LCP Shared Task 1 at SemEval 2021. The approach consists of separating sentences into smaller chunks, embedding them with Sent2Vec, and reducing the embeddings into a simpler vector used as input to a neural network, the latter for predicting the complexity of words and expressions. Results show that the pre-trained sentence embeddings are not able to capture lexical complexity from the language when applied in cross-domain applications.
References used
https://aclanthology.org/
In this paper, we present three supervised systems for English lexical complexity prediction of single and multiword expressions for SemEval-2021 Task 1. We explore the use of statistical baseline features, masked language models, and character-level
This paper revisits feature engineering approaches for predicting the complexity level of English words in a particular context using regression techniques. Our best submission to the Lexical Complexity Prediction (LCP) shared task was ranked 3rd out
This paper describes a system submitted by team BigGreen to LCP 2021 for predicting the lexical complexity of English words in a given context. We assemble a feature engineering-based model with a deep neural network model founded on BERT. While BERT
In this paper, we propose a method of fusing sentence information and word frequency information for the SemEval 2021 Task 1-Lexical Complexity Prediction (LCP) shared task. In our system, the sentence information comes from the RoBERTa model, and th
In this paper we describe our participation in the Lexical Complexity Prediction (LCP) shared task of SemEval 2021, which involved predicting subjective ratings of complexity for English single words and multi-word expressions, presented in context.