نقدم العمل الجاري لتقييم، لمعرفتنا، أول نموذج لغز إذن كبير تم تدريبه على التحدث باللغة السويدية، باستخدام البيانات من Flashback من مناقشة النقاش عبر الإنترنت.نقوم بإجراء دراسة تجريبية للتقييم البشري تشير إلى أن النموذج غالبا ما يكون في الغالب من الاستجابة للمحادثات بطريقة تشبه الإنسان والمعلومات، على مجموعة متنوعة من الموضوعات.في حين أن البيانات من المنتديات عبر الإنترنت يمكن أن تكون مفيدة لبناء أنظمة محادثة، فإننا نفكر في العواقب السلبية التي قد يكون لها تطبيق غير حكيم، والحاجة إلى اتخاذ تدابير فعالة لحماية ضدهم.
We present on-going work of evaluating the, to our knowledge, first large generative language model trained to converse in Swedish, using data from the online discussion forum Flashback. We conduct a human evaluation pilot study that indicates the model is often able to respond to conversations in both a human-like and informative manner, on a diverse set of topics. While data from online forums can be useful to build conversational systems, we reflect on the negative consequences that incautious application might have, and the need for taking active measures to safeguard against them.
References used
https://aclanthology.org/
The use of pretrained language models, fine-tuned to perform a specific downstream task, has become widespread in NLP. Using a generic language model in specialized domains may, however, be sub-optimal due to differences in language use and vocabular
Enabling open-domain dialogue systems to ask clarifying questions when appropriate is an important direction for improving the quality of the system response. Namely, for cases when a user request is not specific enough for a conversation system to p
We present DART, an open domain structured DAta Record to Text generation dataset with over 82k instances (DARTs). Data-to-text annotations can be a costly process, especially when dealing with tables which are the major source of structured data and
In this paper we present a new Massive Open Online Course on Natural Language Processing, targeted at non-English speaking students. The course lasts 12 weeks, every week consists of lectures, practical sessions and quiz assigments. Three weeks out o
In this work, we are proposing a new model for knowledge discovery in database (KDD) named "SCRUM-BI". It based on SCRUM agile methodology to enhance the way of building Business Intelligence and Data Mining applications. This model characterized as