Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Sentence-Level Arabic Sentiment Analysis

تحليل المشاعر في اللغة العربية على مستوى الجملة

2093 3 0 0.0 ( 0 )

Download Cite

Added by الجامعة الأميركية في القاهرة ورقة بحثية

Publication date 2012

fields Informatics Engineering

and research's language is العربية

Authors أميرة شكري( باحث )

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Arabic sentiment analysis research existing currently is very limited. While sentiment analysis has many applications in English, the Arabic language is still recognizing its early steps in this field. In this paper, we show an application on Arabic sentiment analysis by implementing a sentiment classification for Arabic tweets. The retrieved tweets are analyzed to provide their sentiments polarity (positive, or negative). Since, this data is collected from the social network Twitter; it has its importance for the Middle East region, which mostly speaks Arabic

Artificial intelligence review:

Upgrade your account to view the content

Research summary

تتناول هذه الورقة البحثية تحليل المشاعر في اللغة العربية على مستوى الجملة، مع التركيز على التغريدات العربية على تويتر. يهدف البحث إلى تصنيف التغريدات إلى مشاعر إيجابية أو سلبية باستخدام تقنيات التعلم الآلي، وتحديدًا مصنفات Naive Bayes وSupport Vector Machines. تم جمع 1000 تغريدة (500 إيجابية و500 سلبية) لتدريب المصنفات. تتناول الورقة أيضًا التحديات المرتبطة بتحليل المشاعر في اللغة العربية، مثل قلة الأدوات المتاحة وتعقيد اللغة من حيث البنية والصرف. تم استخدام برنامج Weka Suite لإجراء عمليات التصنيف، وأظهرت النتائج أن مصنف SVM يتفوق على مصنف NB في دقة التصنيف. تناقش الورقة أيضًا تأثير إزالة الكلمات التوقفية على دقة التصنيف وتوصي بمزيد من العمل لتطوير قائمة موثوقة من الكلمات التوقفية لتحسين الأداء.

Critical review

دراسة نقدية: تقدم هذه الورقة مساهمة قيمة في مجال تحليل المشاعر في اللغة العربية، وهو مجال لا يزال في مراحله الأولى مقارنة باللغات الأخرى مثل الإنجليزية. ومع ذلك، هناك بعض النقاط التي يمكن تحسينها. أولاً، حجم العينة المستخدمة في التدريب (1000 تغريدة) قد يكون غير كافٍ للحصول على نتائج دقيقة وقابلة للتعميم. ثانيًا، لم يتم تناول تأثير اللهجات المختلفة في اللغة العربية بشكل كافٍ، حيث أن اللهجات قد تؤثر بشكل كبير على دقة التصنيف. ثالثًا، كان من المفيد تضمين مقارنة مع تقنيات أخرى غير التعلم الآلي، مثل التحليل الدلالي، لتقديم صورة أكثر شمولية. وأخيرًا، كان من الممكن تحسين الورقة بإضافة مزيد من التفاصيل حول كيفية اختيار وتطوير قائمة الكلمات التوقفية المستخدمة.

Questions related to the research

ما هو الهدف الرئيسي من هذه الورقة البحثية؟

الهدف الرئيسي هو تصنيف التغريدات العربية على تويتر إلى مشاعر إيجابية أو سلبية باستخدام تقنيات التعلم الآلي.
ما هي المصنفات المستخدمة في هذه الدراسة؟

تم استخدام مصنفات Naive Bayes وSupport Vector Machines.
ما هي التحديات الرئيسية التي تواجه تحليل المشاعر في اللغة العربية؟

التحديات تشمل قلة الأدوات المتاحة وتعقيد اللغة من حيث البنية والصرف.
ما هي النتائج الرئيسية التي توصلت إليها الدراسة؟

أظهرت النتائج أن مصنف SVM يتفوق على مصنف NB في دقة التصنيف.

Keywords

تحليل المشاعر التغريدات العربية التعلم الآلي Naive Bayes Support Vector Machines الكلمات التوقفية

References used

K. Yessenov, and S. Misailovic, “Sentiment Analysis of Movie Review Comments”, Graduation project. 17th, May, 2009

A. Abbasi, H. Chen, and A. Salem, “Sentiment analysis in multiple languages: Feature selection for opinion classification in web forums” ACM Transactions on Information Systems (TOIS), Vol 26, Issue 3, June 2008

rate research

A Language Model-based Generative Classifier for Sentence-level Discourse Parsing

444 - Association for Computation Linguistics 2021 مقالة

Discourse segmentation and sentence-level discourse parsing play important roles for various NLP tasks to consider textual coherence. Despite recent achievements in both tasks, there is still room for improvement due to the scarcity of labeled data. To solve the problem, we propose a language model-based generative classifier (LMGC) for using more information from labels by treating the labels as an input while enhancing label representations by embedding descriptions for each label. Moreover, since this enables LMGC to make ready the representations for labels, unseen in the pre-training step, we can effectively use a pre-trained language model in LMGC. Experimental results on the RST-DT dataset show that our LMGC achieved the state-of-the-art F1 score of 96.72 in discourse segmentation. It further achieved the state-of-the-art relation F1 scores of 84.69 with gold EDU boundaries and 81.18 with automatically segmented boundaries, respectively, in sentence-level discourse parsing.

sentence-level discourse parsing model-based generative classifier language model-based generative تحليل خطاب مستوى الجملة مصنف المولد النموذجي لغة اللغة القائمة على نموذج صناعة حمض الفوسفور المزيد..

Well-Defined Morphology is Sentence-Level Morphology

437 - Association for Computation Linguistics 2021 مقالة

Morphological tasks have gained decent popularity within the NLP community in the recent years, with large multi-lingual datasets providing morphological analysis of words, either in or out of context. However, the lack of a clear linguistic definiti on for words destines the annotative work to be incomplete and mired in inconsistencies, especially cross-linguistically. In this work we expand morphological inflection of words to inflection of sentences to provide true universality disconnected from orthographic traditions of white-space usage. To allow annotation for sentence-inflection we define a morphological annotation scheme by a fixed set of inflectional features. We present a small cross-linguistic dataset including semi-manually generated simple sentences in 4 typologically diverse languages annotated according to our suggested scheme, and show that the task of reinflection gets substantially more difficult but that the change of scope from words to well-defined sentences allows interface with contextualized language models.

sentence-level morphology morphology is sentence-level well-defined morphology التشكل على مستوى الجملة التشكل هو الجملة على مستوى مورفولوجيا محددة جيدا صناعة حمض الفوسفور المزيد..

Sarcasm and Sentiment Detection in Arabic: investigating the interest of character-level features

464 - Association for Computation Linguistics 2021 مقالة

We present three methods developed for the Shared Task on Sarcasm and Sentiment Detection in Arabic. We present a baseline that uses character n-gram features. We also propose two more sophisticated methods: a recurrent neural network with a word lev el representation and an ensemble classifier relying on word and character-level features. We chose to present results from an ensemble classifier but it was not very successful as compared to the best systems : 22th/37 on sarcasm detection and 15th/22 on sentiment detection. It finally appeared that our baseline could have been improved and beat those results.

كلمة satic word embeddings. detection in arabic investigating the interest الكشف باللغة العربية التحقيق في الفائدة صناعة حمض الفوسفور

Exploring Sentence Community for Document-Level Event Extraction

437 - Association for Computation Linguistics 2021 مقالة

Document-level event extraction is critical to various natural language processing tasks for providing structured information. Existing approaches by sequential modeling neglect the complex logic structures for long texts. In this paper, we leverage the entity interactions and sentence interactions within long documents and transform each document into an undirected unweighted graph by exploiting the relationship between sentences. We introduce the Sentence Community to represent each event as a subgraph. Furthermore, our framework SCDEE maintains the ability to extract multiple events by sentence community detection using graph attention networks and alleviate the role overlapping issue by predicting arguments in terms of roles. Experiments demonstrate that our framework achieves competitive results over state-of-the-art methods on the large-scale document-level event extraction dataset.

الرسم البياني تعلم exploring sentence community استكشاف مجتمع الجملة صناعة حمض الفوسفور

Controlled Neural Sentence-Level Reframing of News Articles

406 - Association for Computation Linguistics 2021 مقالة

Framing a news article means to portray the reported event from a specific perspective, e.g., from an economic or a health perspective. Reframing means to change this perspective. Depending on the audience or the submessage, reframing can become nece ssary to achieve the desired effect on the readers. Reframing is related to adapting style and sentiment, which can be tackled with neural text generation techniques. However, it is more challenging since changing a frame requires rewriting entire sentences rather than single phrases. In this paper, we study how to computationally reframe sentences in news articles while maintaining their coherence to the context. We treat reframing as a sentence-level fill-in-the-blank task for which we train neural models on an existing media frame corpus. To guide the training, we propose three strategies: framed-language pretraining, named-entity preservation, and adversarial learning. We evaluate respective models automatically and manually for topic consistency, coherence, and successful reframing. Our results indicate that generating properly-framed text works well but with tradeoffs.

controlled neural sentence-level reframing controlled neural السمة العصبية التي تسيطر عليها المستوى Reframing. الخيانة العصبية صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Sentence-Level Arabic Sentiment Analysis

تحليل المشاعر في اللغة العربية على مستوى الجملة

Ask ChatGPT about the research

Read More

suggested questions