Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

fBERT: A Neural Transformer for Identifying Offensive Content

Fbert: محول عصبي لتحديد المحتوى الهجومي

577 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

neural transformer identifying offensive content transformer for identifying محول العصبي تحديد المحتوى الهجومي محول لتحديد صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Transformer-based models such as BERT, XLNET, and XLM-R have achieved state-of-the-art performance across various NLP tasks including the identification of offensive language and hate speech, an important problem in social media. In this paper, we present fBERT, a BERT model retrained on SOLID, the largest English offensive language identification corpus available with over 1.4 million offensive instances. We evaluate fBERT's performance on identifying offensive content on multiple English datasets and we test several thresholds for selecting instances from SOLID. The fBERT model will be made freely available to the community.

References used

https://aclanthology.org/

rate research

WLV-RIT at SemEval-2021 Task 5: A Neural Transformer Framework for Detecting Toxic Spans

838 - Association for Computation Linguistics 2021 مقالة

In recent years, the widespread use of social media has led to an increase in the generation of toxic and offensive content on online platforms. In response, social media platforms have worked on developing automatic detection methods and employing h uman moderators to cope with this deluge of offensive content. While various state-of-the-art statistical models have been applied to detect toxic posts, there are only a few studies that focus on detecting the words or expressions that make a post offensive. This motivates the organization of the SemEval-2021 Task 5: Toxic Spans Detection competition, which has provided participants with a dataset containing toxic spans annotation in English posts. In this paper, we present the WLV-RIT entry for the SemEval-2021 Task 5. Our best performing neural transformer model achieves an 0.68 F1-Score. Furthermore, we develop an open-source framework for multilingual detection of offensive spans, i.e., MUDES, based on neural transformers that detect toxic spans in texts.

مطابقة الفهم القراءة toxic سامة صناعة حمض الفوسفور

Identifying the Importance of Content Overlap for Better Cross-lingual Embedding Mappings

608 - Association for Computation Linguistics 2021 مقالة

In this work, we analyze the performance and properties of cross-lingual word embedding models created by mapping-based alignment methods. We use several measures of corpus and embedding similarity to predict BLI scores of cross-lingual embedding map pings over three types of corpora, three embedding methods and 55 language pairs. Our experimental results corroborate that instead of mere size, the amount of common content in the training corpora is essential. This phenomenon manifests in that i) despite of the smaller corpus sizes, using only the comparable parts of Wikipedia for training the monolingual embedding spaces to be mapped is often more efficient than relying on all the contents of Wikipedia, ii) the smaller, in return less diversified Spanish Wikipedia works almost always much better as a training corpus for bilingual mappings than the ubiquitously used English Wikipedia.

cross-lingual embedding mappings identifying the importance content overlap تعريض تضمين التضمين تحديد الأهمية تداخل المحتوى صناعة حمض الفوسفور المزيد..

A Globally Normalized Neural Model for Semantic Parsing

788 - Association for Computation Linguistics 2021 مقالة

In this paper, we propose a globally normalized model for context-free grammar (CFG)-based semantic parsing. Instead of predicting a probability, our model predicts a real-valued score at each step and does not suffer from the label bias problem. Exp eriments show that our approach outperforms locally normalized models on small datasets, but it does not yield improvement on a large dataset.

globally normalized neural normalized neural model based semantic parsing طبقة عالميا العصبية نموذج عصبي طبيعي التحليل الدلالي القائم صناعة حمض الفوسفور المزيد..

DA-Transformer: Distance-aware Transformer

634 - Association for Computation Linguistics 2021 مقالة

Transformer has achieved great success in the NLP field by composing various advanced models like BERT and GPT. However, Transformer and its existing variants may not be optimal in capturing token distances because the position or distance embeddings used by these methods usually cannot keep the precise information of real distances, which may not be beneficial for modeling the orders and relations of contexts. In this paper, we propose DA-Transformer, which is a distance-aware Transformer that can exploit the real distance. We propose to incorporate the real distances between tokens to re-scale the raw self-attention weights, which are computed by the relevance between attention query and key. Concretely, in different self-attention heads the relative distance between each pair of tokens is weighted by different learnable parameters, which control the different preferences on long- or short-term information of these heads. Since the raw weighted real distances may not be optimal for adjusting self-attention weights, we propose a learnable sigmoid function to map them into re-scaled coefficients that have proper ranges. We first clip the raw self-attention weights via the ReLU function to keep non-negativity and introduce sparsity, and then multiply them with the re-scaled coefficients to encode real distance information into self-attention. Extensive experiments on five benchmark datasets show that DA-Transformer can effectively improve the performance of many tasks and outperform the vanilla Transformer and its several variants.

distance-aware transformer bert and gpt محول عن بعد بيرت و GPT. صناعة حمض الفوسفور

A Study for Identifying Characteristics of Reconstruction Projects Management in Syria

2295 - Aِl-Baath University 2017 ورقة بحثية

This paper aims to identify key characteristics projects in reconstruction stage in order to assist decision makers to produce appropriate approach to manage those projects effectively. A list of characteristics that may exist in reconstruction projects were identified through intensive literature review and pilot study with various stakeholders involved in in reconstruction stage.

خصائص مشاريع إعادة الإعمار إدارة مشاريع إعادة الإعمار صنع القرار في مشاريع إعادة الإعمار إعادة إعمار سوريا Characteristics of Reconstruction projects Post – disaster Reconstruction Management of reconstruction projects Decision Making in Reconstruction projects Rebuild Syria المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

fBERT: A Neural Transformer for Identifying Offensive Content

Fbert: محول عصبي لتحديد المحتوى الهجومي

Ask ChatGPT about the research

Read More

suggested questions