عادة ما تحقق الأساليب الخاضعة للإشراف أفضل أداء في مشكلة غموض معنى الكلمة.ومع ذلك، فإن عدم توفر إحساس كبير مشروح بالنسبة للعديد من اللغات منخفضة الموارد يجعل هذه الأساليب غير قابل للتطبيق لها في الممارسة العملية.في هذه الورقة، نقوم بتخفيف هذه المشكلة باللغة الفارسية من خلال اقتراح نهج أوتوماتيكي بالكامل للحصول على فارسية الفارسية (Bredemcor)، ككائن مشروح من كيس الفارسية (القوس).قمنا بتقييم الصرص على حد سواء بشكل جوهري ودخله وأظهر أنه يمكن استخدامه بفعالية كمجموعات تدريبية لأنظمة WSD الإشرافية الفارسية.لتشجيع البحث في المستقبل على الغموض في مجال الإحساس بالكلمة الفارسية، فإننا نطلق الولادة في http://nlp.sbu.ac.ir.
Supervised approaches usually achieve the best performance in the Word Sense Disambiguation problem. However, the unavailability of large sense annotated corpora for many low-resource languages make these approaches inapplicable for them in practice. In this paper, we mitigate this issue for the Persian language by proposing a fully automatic approach for obtaining Persian SemCor (PerSemCor), as a Persian Bag-of-Word (BoW) sense-annotated corpus. We evaluated PerSemCor both intrinsically and extrinsically and showed that it can be effectively used as training sets for Persian supervised WSD systems. To encourage future research on Persian Word Sense Disambiguation, we release the PerSemCor in http://nlp.sbu.ac.ir.
References used
https://aclanthology.org/
As a result of unstructured sentences and some misspellings and errors, finding named entities in a noisy environment such as social media takes much more effort. ParsTwiNER contains about 250k tokens, based on standard instructions like MUC-6 or CoN
Abstract Despite the progress made in recent years in addressing natural language understanding (NLU) challenges, the majority of this progress remains to be concentrated on resource-rich languages like English. This work focuses on Persian language,
This paper presents an attempt at multiword expressions (MWEs) discovery in the Persian language. It focuses on extracting MWEs containing lemmas of a particular group: loanwords in Persian and their equivalents proposed by the Academy of Persian Lan
The wide reach of social media platforms, such as Twitter, have enabled many users to share their thoughts, opinions and emotions on various topics online. The ability to detect these emotions automatically would allow social scientists, as well as,
In parataxis languages like Chinese, word meanings are constructed using specific word-formations, which can help to disambiguate word senses. However, such knowledge is rarely explored in previous word sense disambiguation (WSD) methods. In this pap