تحديد النمذجة هي مهمة توليد تعريف نمط قاموس تلقائيا بالنظر إلى كلمة مستهدفة.في هذه الورقة، نعتبر توليد تعريف عبر اللغات.على وجه التحديد، نولد تعريفات باللغة الإنجليزية لكلمات Wolastoqey (Malecite-Passamaquoddy).WOLASTOQEY هي لغة خلية مهددة بالانقراض والموارد المنخفضة.نحن نفترض أن تمثيل تمثيلات فرعية على أساس ترميز زوج البايت (Sennrich et al.، 2016) يمكن الاستفادة منها لتمثيل كلمات Wolastoqey المعقدة المورفولوجية والتغلب على التحدي المتمثل في عدم وجود كورسا كبيرة متاح للتدريب.توضح نتائجنا التجريبية أن هذا النهج يتفوق على أساليب خط الأساس من حيث النتيجة بلو.
Definition modelling is the task of automatically generating a dictionary-style definition given a target word. In this paper, we consider cross-lingual definition generation. Specifically, we generate English definitions for Wolastoqey (Malecite-Passamaquoddy) words. Wolastoqey is an endangered, low-resource polysynthetic language. We hypothesize that sub-word representations based on byte pair encoding (Sennrich et al., 2016) can be leveraged to represent morphologically-complex Wolastoqey words and overcome the challenge of not having large corpora available for training. Our experimental results demonstrate that this approach outperforms baseline methods in terms of BLEU score.
References used
https://aclanthology.org/
Definition generation techniques aim to generate a definition of a target word or phrase given a context. In previous studies, researchers have faced various issues such as the out-of-vocabulary problem and over/under-specificity problems. Over-speci
This paper presents the University of Edinburgh's constrained submissions of English-German and English-Hausa systems to the WMT 2021 shared task on news translation. We build En-De systems in three stages: corpus filtering, back-translation, and fin
We present the ongoing NorLM initiative to support the creation and use of very large contextualised language models for Norwegian (and in principle other Nordic languages), including a ready-to-use software environment, as well as an experience repo
We are using a semi-automated test suite in order to provide a fine-grained linguistic evaluation for state-of-the-art machine translation systems. The evaluation includes 18 German to English and 18 English to German systems, submitted to the Transl
The research
discusses roads rehabilitation using foamed bitumen,which is a
relatively new method and hasn’t been used in Syria yet due to the
lack of the needed labrotory equipments ,and that’s why the main
aim of the research is to develop a mo