Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

MUSER: MUltimodal Stress detection using Emotion Recognition as an Auxiliary Task

Muser: كشف الإجهاد المتعدد الوسائط باستخدام التعرف على العاطفة كامرأة مساعدة

1295 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

الإطار الدلالي stress detection multimodal stress detection اكتشاف الإجهاد اكتشاف الإجهاد متعددة الوسائط صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

The capability to automatically detect human stress can benefit artificial intelligent agents involved in affective computing and human-computer interaction. Stress and emotion are both human affective states, and stress has proven to have important implications on the regulation and expression of emotion. Although a series of methods have been established for multimodal stress detection, limited steps have been taken to explore the underlying inter-dependence between stress and emotion. In this work, we investigate the value of emotion recognition as an auxiliary task to improve stress detection. We propose MUSER -- a transformer-based model architecture and a novel multi-task learning algorithm with speed-based dynamic sampling strategy. Evaluation on the Multimodal Stressed Emotion (MuSE) dataset shows that our model is effective for stress detection with both internal and external auxiliary tasks, and achieves state-of-the-art results.

References used

https://aclanthology.org/

rate research

MinD at SemEval-2021 Task 6: Propaganda Detection using Transfer Learning and Multimodal Fusion

672 - Association for Computation Linguistics 2021 مقالة

We describe our systems of subtask1 and subtask3 for SemEval-2021 Task 6 on Detection of Persuasion Techniques in Texts and Images. The purpose of subtask1 is to identify propaganda techniques given textual content, and the goal of subtask3 is to det ect them given both textual and visual content. For subtask1, we investigate transfer learning based on pre-trained language models (PLMs) such as BERT, RoBERTa to solve data sparsity problems. For subtask3, we extract heterogeneous visual representations (i.e., face features, OCR features, and multimodal representations) and explore various multimodal fusion strategies to combine the textual and visual representations. The official evaluation shows our ensemble model ranks 1st for subtask1 and 2nd for subtask3.

الكشف عن إقناع صناعة حمض الفوسفور

Emotion-Infused Models for Explainable Psychological Stress Detection

865 - Association for Computation Linguistics 2021 مقالة

The problem of detecting psychological stress in online posts, and more broadly, of detecting people in distress or in need of help, is a sensitive application for which the ability to interpret models is vital. Here, we present work exploring the us e of a semantically related task, emotion detection, for equally competent but more explainable and human-like psychological stress detection as compared to a black-box model. In particular, we explore the use of multi-task learning as well as emotion-based language model fine-tuning. With our emotion-infused models, we see comparable results to state-of-the-art BERT. Our analysis of the words used for prediction show that our emotion-infused models mirror psychological components of stress.

psychological stress detection explainable psychological stress اكتشاف الإجهاد النفسي تمفس الإجهاد النفسي الإجهاد النفسي صناعة حمض الفوسفور

Speech Emotion Recognition Based on CNN+LSTM Model

1275 - Association for Computation Linguistics 2021 مقالة

Due to the popularity of intelligent dialogue assistant services, speech emotion recognition has become more and more important. In the communication between humans and machines, emotion recognition and emotion analysis can enhance the interaction be tween machines and humans. This study uses the CNN+LSTM model to implement speech emotion recognition (SER) processing and prediction. From the experimental results, it is known that using the CNN+LSTM model achieves better performance than using the traditional NN model.

emotion recognition based speech emotion recognition emotion recognition العاطفة الاعتراف مقرها التعرف على العاطفة الكلام العاطفة الاعتراف صناعة حمض الفوسفور المزيد..

Multimodal End-to-End Sparse Model for Emotion Recognition

987 - Association for Computation Linguistics 2021 مقالة

Existing works in multimodal affective computing tasks, such as emotion recognition and personality recognition, generally adopt a two-phase pipeline by first extracting feature representations for each single modality with hand crafted algorithms, a nd then performing end-to-end learning with extracted features. However, the extracted features are fixed and cannot be further fine-tuned on different target tasks, and manually finding feature extracting algorithms does not generalize or scale well to different tasks, which can lead to sub-optimal performance. In this paper, we develop a fully end-to-end model that connects the two phases and optimizes them jointly. In addition, we restructure the current datasets to enable the fully end-to-end training. Furthermore, to reduce the computational overhead brought by the end-to-end model, we introduce a sparse cross-modal attention mechanism for the feature extraction. Experimental results show that our fully end-to-end model significantly surpasses the current state-of-the-art models based on the two-phase pipeline. Moreover, by adding the sparse cross-modal attention, our model can maintain the performance with around half less computation in the feature extraction part of the model.

نهج التعلم متري صناعة حمض الفوسفور

Few-Shot Emotion Recognition in Conversation with Sequential Prototypical Networks

790 - Association for Computation Linguistics 2021 مقالة

Several recent studies on dyadic human-human interactions have been done on conversations without specific business objectives. However, many companies might benefit from studies dedicated to more precise environments such as after sales services or customer satisfaction surveys. In this work, we place ourselves in the scope of a live chat customer service in which we want to detect emotions and their evolution in the conversation flow. This context leads to multiple challenges that range from exploiting restricted, small and mostly unlabeled datasets to finding and adapting methods for such context. We tackle these challenges by using Few-Shot Learning while making the hypothesis it can serve conversational emotion classification for different languages and sparse labels. We contribute by proposing a variation of Prototypical Networks for sequence labeling in conversation that we name ProtoSeq. We test this method on two datasets with different languages: daily conversations in English and customer service chat conversations in French. When applied to emotion classification in conversations, our method proved to be competitive even when compared to other ones.

sequential prototypical networks few-shot emotion recognition sequential prototypical شبكات النماذج النموذجية المتسلسلة عدد قليل من العاطفة الاعتراف النماذج النموذجية المتسلسلة صناعة حمض الفوسفور المزيد..

MUSER: MUltimodal Stress detection using Emotion Recognition as an Auxiliary Task

Muser: كشف الإجهاد المتعدد الوسائط باستخدام التعرف على العاطفة كامرأة مساعدة

Ask ChatGPT about the research

Read More

suggested questions