كلمة الويب الأيسلندية الجديدة (IW) هي إعادة تصميم تكنولوجيا لغة تركز على قاعدة بيانات معجمية للإدخالات ذات الصلة الدوالية.تم تنفيذ كيانات IW والعلاقات وخطط البيانات الوصفية والتصنيف من الصفر في أنظمة اثنين، Ontolex and Skos.بعد إجراء بعض التعديلات على نظام التشغيل البيني Ontolex و SKOS، كان من الممكن أيضا تنفيذ ميزات IW المحددة التي، في حين أن المحتمل غير المحتمل، تشكل جزءا لا يتجزأ من الوظيفة المعجمية للإنترنت Word.أيضا في هذا التنفيذ يتم الوصول إلى كمية أكبر من البيانات اللغوية، ومجموعة أكبر من خيارات البحث، وإمكانية معالجة الآلي، والقدرة على إجراء البحوث من خلال Sparql دون امتلاك إتقان أيسلندي.
The new Icelandic Word Web (IW) is a language technology focused redesign of a lexicosemantic database of semantically related entries. The IW's entities, relations, metadata and categorization scheme have all been implemented from scratch in two systems, OntoLex and SKOS. After certain adjustments were made to OntoLex and SKOS interoperability, it was also possible to implement specific IW features that, while potentially nonstandard, form an integral part of the Word Web's lexicosemantic functionality. Also new in this implementation are access to a larger amount of linguistic data, a greater variety of search options, the possibility of automated processing, and the ability to conduct research through SPARQL without possessing a mastery of Icelandic.
References used
https://aclanthology.org/
Word representations empowered with additional linguistic information have been widely studied and proved to outperform traditional embeddings. Current methods mainly focus on learning embeddings for words while embeddings of linguistic information (
This contribution describes a two-course module that seeks to provide humanities majors with a basic understanding of language technology and its applications using Python. The learning materials consist of interactive Jupyter Notebooks and accompanying YouTube videos, which are openly available with a Creative Commons licence.
We introduce a new approach for smoothing and improving the quality of word embeddings. We consider a method of fusing word embeddings that were trained on the same corpus but with different initializations. We project all the models to a shared vect
We present Query2Prod2Vec, a model that grounds lexical representations for product search in product embeddings: in our model, meaning is a mapping between words and a latent space of products in a digital shop. We leverage shopping sessions to lear
In this paper we introduce a vision towards establishing the Malta National Language Technology Platform; an ongoing effort that aims to provide a basis for enhancing Malta's official languages, namely Maltese and English, using Machine Translation.