Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

BERT meets Shapley: Extending SHAP Explanations to Transformer-based Classifiers

بيرت يلتقي shemley: تمديد شرح الأشكال إلى المصنفين القائم على المحولات

397 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Transformer-based neural networks offer very good classification performance across a wide range of domains, but do not provide explanations of their predictions. While several explanation methods, including SHAP, address the problem of interpreting deep learning models, they are not adapted to operate on state-of-the-art transformer-based neural networks such as BERT. Another shortcoming of these methods is that their visualization of explanations in the form of lists of most relevant words does not take into account the sequential and structurally dependent nature of text. This paper proposes the TransSHAP method that adapts SHAP to transformer models including BERT-based text classifiers. It advances SHAP visualizations by showing explanations in a sequential manner, assessed by human evaluators as competitive to state-of-the-art solutions.

References used

https://aclanthology.org/

rate research

Using BERT for choosing classifiers in Mandarin

231 - Association for Computation Linguistics 2021 مقالة

Choosing the most suitable classifier in a linguistic context is a well-known problem in the production of Mandarin and many other languages. The present paper proposes a solution based on BERT, compares this solution to previous neural and rule-base d models, and argues that the BERT model performs particularly well on those difficult cases where the classifier adds information to the text.

محددات تعلم mandarin bert model performs الماندرين نموذج بيرت يؤدي صناعة حمض الفوسفور

Compressing Large-Scale Transformer-Based Models: A Case Study on BERT

277 - Association for Computation Linguistics 2021 مقالة

Abstract Pre-trained Transformer-based models have achieved state-of-the-art performance for various Natural Language Processing (NLP) tasks. However, these models often have billions of parameters, and thus are too resource- hungry and computation-i ntensive to suit low- capability devices or applications with strict latency requirements. One potential remedy for this is model compression, which has attracted considerable research attention. Here, we summarize the research in compressing Transformers, focusing on the especially popular BERT model. In particular, we survey the state of the art in compression for BERT, we clarify the current best practices for compressing large-scale Transformer models, and we provide insights into the workings of various methods. Our categorization and analysis also shed light on promising future research directions for achieving lightweight, accurate, and generic NLP models.

نماذج اللغة المستقبلية abstract pre-trained transformer-based مجردة محول المدرب مسبقا صناعة حمض الفوسفور

Progressive Transformer-Based Generation of Radiology Reports

661 - Association for Computation Linguistics 2021 مقالة

Inspired by Curriculum Learning, we propose a consecutive (i.e., image-to-text-to-text) generation framework where we divide the problem of radiology report generation into two steps. Contrary to generating the full radiology report from the image at once, the model generates global concepts from the image in the first step and then reforms them into finer and coherent texts using transformer-based architecture. We follow the transformer-based sequence-to-sequence paradigm at each step. We improve upon the state-of-the-art on two benchmark datasets.

radiology report generation progressive transformer-based generation radiology report جيل تقرير الأشعة الجيل القائم على المحولات التقدمية تقرير الأشعة صناعة حمض الفوسفور المزيد..

BERT based Adverse Drug Effect Tweet Classification

297 - Association for Computation Linguistics 2021 مقالة

This paper describes models developed for the Social Media Mining for Health (SMM4H) 2021 shared tasks. Our team participated in the first subtask that classifies tweets with Adverse Drug Effect (ADE) mentions. Our best performing model utilizes BERT weet followed by a single layer of BiLSTM. The system achieves an F-score of 0.45 on the test set without the use of any auxiliary resources such as Part-of-Speech tags, dependency tags, or knowledge from medical dictionaries.

bert based adverse adverse drug effect effect tweet classification بيرت مقرها تأثير المخدرات السلبي تأثير تصنيف تغريد صناعة حمض الفوسفور المزيد..

Zero-shot Sequence Labeling for Transformer-based Sentence Classifiers

350 - Association for Computation Linguistics 2021 مقالة

We investigate how sentence-level transformers can be modified into effective sequence labelers at the token level without any direct supervision. Existing approaches to zero-shot sequence labeling do not perform well when applied on transformer-base d architectures. As transformers contain multiple layers of multi-head self-attention, information in the sentence gets distributed between many tokens, negatively affecting zero-shot token-level performance. We find that a soft attention module which explicitly encourages sharpness of attention weights can significantly outperform existing methods.

transformer-based sentence classifiers sentence classifiers zero-shot sequence labeling منصوص السلبية القائمة على المحولات منصوص السجن صفر تسلسل تسلسل صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

BERT meets Shapley: Extending SHAP Explanations to Transformer-based Classifiers

بيرت يلتقي shemley: تمديد شرح الأشكال إلى المصنفين القائم على المحولات

Ask ChatGPT about the research

Read More

suggested questions