Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

IITK@LCP at SemEval-2021 Task 1: Classification for Lexical Complexity Regression Task

IITK @ LCP في مهمة Semeval-2021 1: تصنيف مهمة انحدار التعقيد المعجمي

973 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

This paper describes our contribution to SemEval 2021 Task 1 (Shardlow et al., 2021): Lexical Complexity Prediction. In our approach, we leverage the ELECTRA model and attempt to mirror the data annotation scheme. Although the task is a regression task, we show that we can treat it as an aggregation of several classification and regression models. This somewhat counter-intuitive approach achieved an MAE score of 0.0654 for Sub-Task 1 and MAE of 0.0811 on Sub-Task 2. Additionally, we used the concept of weak supervision signals from Gloss-BERT in our work, and it significantly improved the MAE score in Sub-Task 1.

References used

https://aclanthology.org/

rate research

PolyU CBS-Comp at SemEval-2021 Task 1: Lexical Complexity Prediction (LCP)

781 - Association for Computation Linguistics 2021 مقالة

In this contribution, we describe the system presented by the PolyU CBS-Comp Team at the Task 1 of SemEval 2021, where the goal was the estimation of the complexity of words in a given sentence context. Our top system, based on a combination of lexic al, syntactic, word embeddings and Transformers-derived features and on a Gradient Boosting Regressor, achieves a top correlation score of 0.754 on the subtask 1 for single words and 0.659 on the subtask 2 for multiword expressions.

alejandro mosquera. polyu cbs-comp team فريق Polyu CBS-Comp صناعة حمض الفوسفور

LCP-RIT at SemEval-2021 Task 1: Exploring Linguistic Features for Lexical Complexity Prediction

703 - Association for Computation Linguistics 2021 مقالة

This paper describes team LCP-RIT's submission to the SemEval-2021 Task 1: Lexical Complexity Prediction (LCP). The task organizers provided participants with an augmented version of CompLex (Shardlow et al., 2020), an English multi-domain dataset in which words in context were annotated with respect to their complexity using a five point Likert scale. Our system uses logistic regression and a wide range of linguistic features (e.g. psycholinguistic features, n-grams, word frequency, POS tags) to predict the complexity of single words in this dataset. We analyze the impact of different linguistic features on the classification performance and we evaluate the results in terms of mean absolute error, mean squared error, Pearson correlation, and Spearman correlation.

انحدار التعقيد المعجمي exploring linguistic features استكشاف الميزات اللغوية صناعة حمض الفوسفور

BigGreen at SemEval-2021 Task 1: Lexical Complexity Prediction with Assembly Models

799 - Association for Computation Linguistics 2021 مقالة

This paper describes a system submitted by team BigGreen to LCP 2021 for predicting the lexical complexity of English words in a given context. We assemble a feature engineering-based model with a deep neural network model founded on BERT. While BERT itself performs competitively, our feature engineering-based model helps in extreme cases, eg. separating instances of easy and neutral difficulty. Our handcrafted features comprise a breadth of lexical, semantic, syntactic, and novel phonological measures. Visualizations of BERT attention maps offer insight into potential features that Transformers models may learn when fine-tuned for lexical complexity prediction. Our ensembled predictions score reasonably well for the single word subtask, and we demonstrate how they can be harnessed to perform well on the multi word expression subtask too.

نماذج التوزيع assembly models نماذج التجميع صناعة حمض الفوسفور

JCT at SemEval-2021 Task 1: Context-aware Representation for Lexical Complexity Prediction

691 - Association for Computation Linguistics 2021 مقالة

In this paper, we present our contribution in SemEval-2021 Task 1: Lexical Complexity Prediction, where we integrate linguistic, statistical, and semantic properties of the target word and its context as features within a Machine Learning (ML) framew ork for predicting lexical complexity. In particular, we use BERT contextualized word embeddings to represent the semantic meaning of the target word and its context. We participated in the sub-task of predicting the complexity score of single words

مجموعة ميزة غير متجانسة context-aware representation تمثيل السياق صناعة حمض الفوسفور

archer at SemEval-2021 Task 1: Contextualising Lexical Complexity

711 - Association for Computation Linguistics 2021 مقالة

Evaluating the complexity of a target word in a sentential context is the aim of the Lexical Complexity Prediction task at SemEval-2021. This paper presents the system created to assess single words lexical complexity, combining linguistic and psycho linguistic variables in a set of experiments involving random forest and XGboost regressors. Beyond encoding out-of-context information about the lemma, we implemented features based on pre-trained language models to model the target word's in-context complexity.

contextualising lexical complexity contextualising lexical السياق التعقيد المعجمي السياق المعجمية صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

IITK@LCP at SemEval-2021 Task 1: Classification for Lexical Complexity Regression Task

IITK @ LCP في مهمة Semeval-2021 1: تصنيف مهمة انحدار التعقيد المعجمي

Ask ChatGPT about the research

Read More

suggested questions