New community

Subscribe to the gold package and get unlimited access to Shamra Academy

S-NLP at SemEval-2021 Task 5: An Analysis of Dual Networks for Sequence Tagging

S-NLP في مهمة Semeval-2021 5: تحليل الشبكات المزدوجة لعلامات التسلسل

340 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

analysis of dual dual networks networks for sequence تحليل المزدوج الشبكات المزدوجة الشبكات للتسلسل صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

The SemEval 2021 task 5: Toxic Spans Detection is a task of identifying considered-toxic spans in text, which provides a valuable, automatic tool for moderating online contents. This paper represents the second-place method for the task, an ensemble of two approaches. While one approach relies on combining different embedding methods to extract diverse semantic and syntactic representations of words in context; the other utilizes extra data with a slightly customized Self-training, a semi-supervised learning technique, for sequence tagging problems. Both of our architectures take advantage of a strong language model, which was fine-tuned on a toxic classification task. Although experimental evidence indicates higher effectiveness of the first approach than the second one, combining them leads to our best results of 70.77 F1-score on the test dataset.

References used

https://aclanthology.org/

rate research

NLP\_UIOWA at Semeval-2021 Task 5: Transferring Toxic Sets to Tag Toxic Spans

397 - Association for Computation Linguistics 2021 مقالة

We leverage a BLSTM with attention to identify toxic spans in texts. We explore different dimensions which affect the model's performance. The first dimension explored is the toxic set the model is trained on. Besides the provided dataset, we explore the transferability of 5 different toxic related sets, including offensive, toxic, abusive, and hate sets. We find that the solely offensive set shows the highest promise of transferability. The second dimension we explore is methodology, including leveraging attention, employing a greedy remove method, using a frequency ratio, and examining hybrid combinations of multiple methods. We conduct an error analysis to examine which types of toxic spans were missed and which were wrongly inferred as toxic along with the main reasons why they occurred. Finally, we extend our method via ensembles, which achieves our highest F1 score of 55.1.

tag toxic spans transferring toxic sets transferring toxic علامة السامة يمتد نقل مجموعات سامة نقل السامة صناعة حمض الفوسفور المزيد..

macech at SemEval-2021 Task 5: Toxic Spans Detection

276 - Association for Computation Linguistics 2021 مقالة

Toxic language is often present in online forums, especially when politics and other polarizing topics arise, and can lead to people becoming discouraged from joining or continuing conversations. In this paper, we use data consisting of comments with the indices of toxic text labelled to train an RNN to deter-mine which parts of the comments make them toxic, which could aid online moderators. We compare results using both the original dataset and an augmented set, as well as GRU versus LSTM RNN models.

حوار SRPOL. صناعة حمض الفوسفور

HLE-UPC at SemEval-2021 Task 5: Multi-Depth DistilBERT for Toxic Spans Detection

458 - Association for Computation Linguistics 2021 مقالة

This paper presents our submission to SemEval-2021 Task 5: Toxic Spans Detection. The purpose of this task is to detect the spans that make a text toxic, which is a complex labour for several reasons. Firstly, because of the intrinsic subjectivity of toxicity, and secondly, due to toxicity not always coming from single words like insults or offends, but sometimes from whole expressions formed by words that may not be toxic individually. Following this idea of focusing on both single words and multi-word expressions, we study the impact of using a multi-depth DistilBERT model, which uses embeddings from different layers to estimate the final per-token toxicity. Our quantitative results show that using information from multiple depths boosts the performance of the model. Finally, we also analyze our best model qualitatively.

انتباه مقرها صناعة حمض الفوسفور

Manchester Metropolitan at SemEval-2021 Task 1: Convolutional Networks for Complex Word Identification

501 - Association for Computation Linguistics 2021 مقالة

We present two convolutional neural networks for predicting the complexity of words and phrases in context on a continuous scale. Both models utilize word and character embeddings alongside lexical features as inputs. Our system displays reasonable r esults with a Pearson correlation of 0.7754 on the task as a whole. We highlight the limitations of this method in properly assessing the context of the target text, and explore the effectiveness of both systems across a range of genres. Both models were submitted as part of LCP 2021, which focuses on the identification of complex words and phrases as a context dependent, regression based task.

manchester metropolitan convolutional networks مانشستر متروبوليتان الشبكات العصبية التنافسية الشبكات التفافية صناعة حمض الفوسفور

SemEval-2021 Task 5: Toxic Spans Detection

280 - Association for Computation Linguistics 2021 مقالة

The Toxic Spans Detection task of SemEval-2021 required participants to predict the spans of toxic posts that were responsible for the toxic label of the posts. The task could be addressed as supervised sequence labeling, using training data with gol d toxic spans provided by the organisers. It could also be treated as rationale extraction, using classifiers trained on potentially larger external datasets of posts manually annotated as toxic or not, without toxic span annotations. For the supervised sequence labeling approach and evaluation purposes, posts previously labeled as toxic were crowd-annotated for toxic spans. Participants submitted their predicted spans for a held-out test set and were scored using character-based F1. This overview summarises the work of the 36 teams that provided system descriptions.

toxic spans detection spans detection task spans detection يمتد يمتد السامة يمتد مهمة الكشف عنها يمتد الكشف صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

S-NLP at SemEval-2021 Task 5: An Analysis of Dual Networks for Sequence Tagging

S-NLP في مهمة Semeval-2021 5: تحليل الشبكات المزدوجة لعلامات التسلسل

Ask ChatGPT about the research

Read More

suggested questions