Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

NICE: Neural Image Commenting with Empathy

لطيفة: الصورة العصبية التعليق مع التعاطف

491 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

العاطفة والتعاطف هي أمثلة على الصفات البشرية التي تفتقر إلى العديد من التفاعلات البشرية. الهدف من عملنا هو توليد حوار جذاب في صورة مشتركة من المستخدمين مع زيادة العاطفة والتعاطف مع تقليل النواتج غير اللائق أو الهجومية الاجتماعية. ونحن نفرج عن الصورة العصبية التعليق مع مجموعة بيانات التعاطف (لطيفة) تتكون من ما يقرب من مليوني صورة وتعليقات مقابلة للإنسان، ومجموعة من التعليقات الشروحية البشرية والأداء الأساسي في مجموعة من النماذج. في الموقف عن الاعتماد على المشاعر المسمى يدويا، نستخدم أيضا تمثيل اللغوي الذي تم إنشاؤه تلقائيا كمصدر للملصقات الخاضعة للإشراف. بناء على هذه التعليقات التوضيحية، نحدد مهامين مختلفة لمجموعة البيانات الجميلة. بعد ذلك، نقترح نموذجا روايا قبل التدريب - النمذجة تؤثر على جيل للحصول على تعليقات الصورة (السحر) - والتي تهدف إلى توليد تعليقات للصور، مشروطة على التمثيل اللغوي الذي التقاط النمط والتأثير، والمساعدة في توليد أكثر تعاطفا وعاطفيا وجذابا و تعليقات اجتماعية مناسبة. باستخدام هذا النموذج، نحقق الأداء الحديث في واحدة من مهامنا الجميلة. تظهر التجارب أن النهج يمكن أن يولد المزيد من التعليقات التي تشبه الإنسان وإشراكها للإشراك.

Emotion and empathy are examples of human qualities lacking in many human-machine interactions. The goal of our work is to generate engaging dialogue grounded in a user-shared image with increased emotion and empathy while minimizing socially inappropriate or offensive outputs. We release the Neural Image Commenting with Empathy (NICE) dataset consisting of almost two million images and the corresponding human-generated comments, a set of human annotations, and baseline performance on a range of models. In-stead of relying on manually labeled emotions, we also use automatically generated linguistic representations as a source of weakly supervised labels. Based on these annotations, we define two different tasks for the NICE dataset. Then, we propose a novel pre-training model - Modeling Affect Generation for Image Comments (MAGIC) - which aims to generate comments for images, conditioned on linguistic representations that capture style and affect, and to help generate more empathetic, emotional, engaging and socially appropriate comments. Using this model we achieve state-of-the-art performance on one of our NICE tasks. The experiments show that the approach can generate more human-like and engaging image comments.

References used

https://aclanthology.org/

rate research

Multilingual Image Corpus: Annotation Protocol

782 - Association for Computation Linguistics 2021 مقالة

In this paper, we present work in progress aimed at the development of a new image dataset with annotated objects. The Multilingual Image Corpus consists of an ontology of visual objects (based on WordNet) and a collection of thematically related ima ges annotated with segmentation masks and object classes. We identified 277 dominant classes and 1,037 parent and attribute classes, and grouped them into 10 thematic domains such as sport, medicine, education, food, security, etc. For the selected classes a large-scale web image search is being conducted in order to compile a substantial collection of high-quality copyright free images. The focus of the paper is the annotation protocol which we established to facilitate the annotation process: the Ontology of visual objects and the conventions for image selection and for object segmentation. The dataset is designed both for image classification and object detection and for semantic segmentation. In addition, the object annotations will be supplied with multilingual descriptions by using freely available wordnets.

multilingual image corpus image corpus image corpus consists صورة متعددة اللغات Corpus. صورة Corpus. الصورة Corpus تتكون صناعة حمض الفوسفور المزيد..

Distilling Knowledge for Empathy Detection

581 - Association for Computation Linguistics 2021 مقالة

Empathy is the link between self and others. Detecting and understanding empathy is a key element for improving human-machine interaction. However, annotating data for detecting empathy at a large scale is a challenging task. This paper employs multi -task training with knowledge distillation to incorporate knowledge from available resources (emotion and sentiment) to detect empathy from the natural language in different domains. This approach yields better results on an existing news-related empathy dataset compared to strong baselines. In addition, we build a new dataset for empathy prediction with fine-grained empathy direction, seeking or providing empathy, from Twitter. We release our dataset for research purposes.

empathy detection الكشف عن التعاطف صناعة حمض الفوسفور

Image classification with Deep Convolutional Neural Network Using Tensorflow and Transfer of Learning

3086 - جامعة بغداد 2020 مقالة

The deep learning algorithm has recently achieved a lot of success, especially in the field of computer vision. This research aims to describe the classification method applied to the dataset of multiple types of images (Synthetic Aperture Radar (SAR ) images and non-SAR images). In such a classification, transfer learning was used followed by fine-tuning methods. Besides, pre-trained architectures were used on the known image database ImageNet. The model VGG16 was indeed used as a feature extractor and a new classifier was trained based on extracted features.The input data mainly focused on the dataset consist of five classes including the SAR images class (houses) and the non-SAR images classes (Cats, Dogs, Horses, and Humans). The Convolutional Neural Network (CNN) has been chosen as a better option for the training process because it produces a high accuracy. The final accuracy has reached 91.18% in five different classes. The results are discussed in terms of the probability of accuracy for each class in the image classification in percentage. Cats class got 99.6 %, while houses class got 100 %.Other types of classes were with an average score of 90 % and above.

CNN الشبكات العصبونية الالتفافية convolutional neural network Synthetic Aperture Radar Transfer learning TensorFlow Tensor flow Visual Geometry Group المزيد..

Neural Machine Translation with Inflected Lexicon

730 - Association for Computation Linguistics 2021 مقالة

The paper presents experiments in neural machine translation with lexical constraints into a morphologically rich language. In particular and we introduce a method and based on constrained decoding and which handles the inflected forms of lexical ent ries and does not require any modification to the training data or model architecture. To evaluate its effectiveness and we carry out experiments in two different scenarios: general and domain-specific. We compare our method with baseline translation and i.e. translation without lexical constraints and in terms of translation speed and translation quality. To evaluate how well the method handles the constraints and we propose new evaluation metrics which take into account the presence and placement and duplication and inflectional correctness of lexical terms in the output sentence.

الآلة العصبية التفاعلية inflected lexicon نسخ معجم صناعة حمض الفوسفور

Improving Neural Language Processing with Named Entities

765 - Association for Computation Linguistics 2021 مقالة

Pretraining-based neural network models have demonstrated state-of-the-art (SOTA) performances on natural language processing (NLP) tasks. The most frequently used sentence representation for neural-based NLP methods is a sequence of subwords that is different from the sentence representation of non-neural methods that are created using basic NLP technologies, such as part-of-speech (POS) tagging, named entity (NE) recognition, and parsing. Most neural-based NLP models receive only vectors encoded from a sequence of subwords obtained from an input text. However, basic NLP information, such as POS tags, NEs, parsing results, etc, cannot be obtained explicitly from only the large unlabeled text used in pretraining-based models. This paper explores use of NEs on two Japanese tasks; document classification and headline generation using Transformer-based models, to reveal the effectiveness of basic NLP information. The experimental results with eight basic NEs and approximately 200 extended NEs show that NEs improve accuracy although a large pretraining-based model trained using 70 GB text data was used.

improving neural language neural language processing تحسين اللغة العصبية معالجة اللغة العصبية صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

NICE: Neural Image Commenting with Empathy

لطيفة: الصورة العصبية التعليق مع التعاطف

Ask ChatGPT about the research

Read More

suggested questions