تعد التعامل الدقيق مع أي نوع من أنواع الغموض مهمة رئيسية في معالجة اللغة الطبيعية، حيث وصلت إلى تقدير كبير مؤخرا بسبب تطوير نماذج اللغة التي تعتمد على السياق واستخدام Word أو Adgeddings.في هذا السياق، يهدف عملنا إلى تحديد كيفية ربط نموذج تمثيل اللغة الشعبي بمكافحة غموض الأسماء في العدد النحوي والجنس بلغات مختلفة.نظهر أن النماذج المدربة على لغة واحدة محددة تحقق نتائج أفضل لعملية الغموض من النماذج متعددة اللغات.أيضا، يتم تناول الغموض بشكل عام بشكل عام في العدد النحوي مما هو عليه في النوع الاجتماعي النحوي، حيث وصلت إلى قيم مسافة أكبر من واحد إلى آخر في مقارنات مباشرة من الحواس الفردية.تظهر النتائج الإجمالية أيضا أن مقدار البيانات اللازمة لتدريب نماذج أحادية التدريب وكذلك يجب عدم التقليل من التقديم.
Accurately dealing with any type of ambiguity is a major task in Natural Language Processing, with great advances recently reached due to the development of context dependent language models and the use of word or sentence embeddings. In this context, our work aimed at determining how the popular language representation model BERT handle ambiguity of nouns in grammatical number and gender in different languages. We show that models trained on one specific language achieve better results for the disambiguation process than multilingual models. Also, ambiguity is generally better dealt with in grammatical number than it is in grammatical gender, reaching greater distance values from one to another in direct comparisons of individual senses. The overall results show also that the amount of data needed for training monolingual models as well as application should not be underestimated.
References used
https://aclanthology.org/
Grammatical gender may be determined by semantics, orthography, phonology, or could even be arbitrary. Identifying patterns in the factors that govern noun genders can be useful for language learners, and for understanding innate linguistic sources o
Grammatical Error Correction (GEC) aims to correct writing errors and help language learners improve their writing skills. However, existing GEC models tend to produce spurious corrections or fail to detect lots of errors. The quality estimation mode
We live in the era of his trademark is the use of figures and numbers in each of
the significant aspects of life on the launch, but there are always difficulties face
many people when reading the number correctly and properly, where some of
them r
Passage retrieval and ranking is a key task in open-domain question answering and information retrieval. Current effective approaches mostly rely on pre-trained deep language model-based retrievers and rankers. These methods have been shown to effect
In today's society, the rapid development of communication technology allows us to communicate with people from different parts of the world. In the process of communication, each person treats others differently. Some people are used to using offens