يعد انتشار الأخبار المزيفة من القضايا الحالية التي تؤثر على عدد من المجالات المهمة في المجتمع ، مثل السياسة والاقتصاد والصحة.
في مجال معالجة اللغة الطبيعية ، حاولت المبادرات الأخيرة الكشف عن الأخبار المزيفة بطرق مختلفة، بدءًا من الأساليب القائمة على اللغة إلى التحقق القائم على المحتوى.
في مثل هذه الأساليب ، يعد اختيار ميزات تصنيف الأخبار الكاذبة والحقيقية أحد أهم أجزاء العملية. تقدم هذه الورقة دراسة حول تأثير ميزات سهولة القراءة للكشف عن الأخبار المزيفة للغة البرتغالية البرازيلية. تظهر النتائج أن هذه الميزات ذات صلة بالمهمة (تحقق بمفردها دقة تصنيف تصل إلى 92٪) وقد تحسن نتائج التصنيف السابقة.
(بحث انكليزي)
The proliferation of fake news is a current issue that influences a number of important areas of society, such as politics, economy and health. In the Natural Language Processing area, recent initiatives tried to detect fake news in different ways, ranging from language-based approaches to content-based verification. In such approaches, the choice of the features for the classification of fake and true news is one of the most important parts of the process. This paper presents a study on the impact of readability features to detect fake news for the Brazilian Portuguese language. The results show that such features are relevant to the task (achieving, alone, up to 92% classification accuracy) and may improve previous classification results.
References used
Perez-Rosas, V., Kleinberg, B., Lefevre, A., and Mihalcea, ´ R. (2017). Automatic detection of fake news. CoRR, abs/1708.07104.
Perez-Rosas, V. and Mihalcea, R. (2015). Experiments in ´ open domain deception detection. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 1120–1125.
Statements that are intentionally misstated (or manipulated) are of considerable interest to researchers, government, security, and financial systems. According to deception literature, there are reliable cues for detecting deception and the belief t
As the world continues to fight the COVID-19 pandemic, it is simultaneously fighting an infodemic' -- a flood of disinformation and spread of conspiracy theories leading to health threats and the division of society. To combat this infodemic, there i
Fake news causes significant damage to society. To deal with these fake news, several studies on building detection models and arranging datasets have been conducted. Most of the fake news datasets depend on a specific time period. Consequently, the
The widespread use of the Internet and the rapid dissemination of information poses the challenge of identifying the veracity of its content. Stance detection, which is the task of predicting the position of a text in regard to a specific target (e.g
The exponential growth of the internet and social media in the past decade gave way to the increase in dissemination of false or misleading information. Since the 2016 US presidential election, the term fake news'' became increasingly popular and thi