يصف هذا العمل تقديم Edinburgh إلى المهمة Sigmorphon 2021 المشتركة 2 على تجميع النموذج المورفولوجي غير المقترح.إعطاء إدخال النص الخام، وكانت المهمة لتعيين كل رمز رمزية إلى كتلة مع الرموز الأخرى من نفس النموذج.نحن نستخدم تجزئة محول القواعد جنبا إلى جنب مع الاستدلال القائم على التردد للتنبؤ مجموعات النماذج.حقق نظامنا أعلى متوسط درجة F1 عبر 9 لغات اختبار، ووضع أولا من 15 رسالة.
This work describes the Edinburgh submission to the SIGMORPHON 2021 Shared Task 2 on unsupervised morphological paradigm clustering. Given raw text input, the task was to assign each token to a cluster with other tokens from the same paradigm. We use Adaptor Grammar segmentations combined with frequency-based heuristics to predict paradigm clusters. Our system achieved the highest average F1 score across 9 test languages, placing first out of 15 submissions.
References used
https://aclanthology.org/
We describe the second SIGMORPHON shared task on unsupervised morphology: the goal of the SIGMORPHON 2021 Shared Task on Unsupervised Morphological Paradigm Clustering is to cluster word types from a raw text corpus into paradigms. To this end, we re
This paper describes the submission of the CU-UBC team for the SIGMORPHON 2021 Shared Task 2: Unsupervised morphological paradigm clustering. Our system generates paradigms using morphological transformation rules which are discovered from raw data.
This paper describes our system for the SIGMORPHON 2021 Shared Task on Unsupervised Morphological Paradigm Clustering, which asks participants to group inflected forms together according their underlying lemma without the aid of annotated training da
This paper presents two different systems for unsupervised clustering of morphological paradigms, in the context of the SIGMORPHON 2021 Shared Task 2. The goal of this task is to correctly cluster words in a given language by their inflectional parad
A reliable clustering algorithm for task-oriented dialogues can help developer analysis and define dialogue tasks efficiently. It is challenging to directly apply prior normal text clustering algorithms for task-oriented dialogues, due to the inheren