يعد تطوير آليات تكييف أنظمة الحوار المرنة للمهام والمجالات غير المرئية تحديا كبيرا في أبحاث الحوار.تحفظ النماذج العصبية ضمنيا سياسات الحوار الخاصة بمهام المهام من بيانات التدريب.نؤخر أن هذه الحفظ الضمنية قد حظرت التعلم تحويل الصفر بالرصاص.تحقيقا لهذه الغاية، نستفيد من النموذج الموجه المخطط، حيث يتم توفير سياسة الحوار الخاصة بمهام المهام بشكل صريح للنموذج.نقدم نموذج اهتمام المخطط (SAM) وتحسين تمثيلات المخطط للحصول على ستار كوربوس.يحصل SAM على تحسين كبير في إعدادات طلقة صفرية، مع تحسن درجة +22 F1 على العمل السابق.هذه النتائج التحقق من صحة جدوى عملية التعميم الصفري في مربع الحوار.يتم أيضا تقديم تجارب الاجتثاث لإظهار فعالية SAM.
Developing mechanisms that flexibly adapt dialog systems to unseen tasks and domains is a major challenge in dialog research. Neural models implicitly memorize task-specific dialog policies from the training data. We posit that this implicit memorization has precluded zero-shot transfer learning. To this end, we leverage the schema-guided paradigm, wherein the task-specific dialog policy is explicitly provided to the model. We introduce the Schema Attention Model (SAM) and improved schema representations for the STAR corpus. SAM obtains significant improvement in zero-shot settings, with a +22 F1 score improvement over prior work. These results validate the feasibility of zero-shot generalizability in dialog. Ablation experiments are also presented to demonstrate the efficacy of SAM.
References used
https://aclanthology.org/
Natural Language Generation (NLG) for task-oriented dialogue systems focuses on communicating specific content accurately, fluently, and coherently. While these attributes are crucial for a successful dialogue, it is also desirable to simultaneously
Pretrained transformer-based encoders such as BERT have been demonstrated to achieve state-of-the-art performance on numerous NLP tasks. Despite their success, BERT style encoders are large in size and have high latency during inference (especially o
Humans can distinguish new categories very efficiently with few examples, largely due to the fact that human beings can leverage knowledge obtained from relevant tasks. However, deep learning based text classification model tends to struggle to achie
Zero-shot cross-domain dialogue state tracking (DST) enables us to handle unseen domains without the expense of collecting in-domain data. In this paper, we propose a slot descriptions enhanced generative approach for zero-shot cross-domain DST. Spec
Coupled with the availability of large scale datasets, deep learning architectures have enabled rapid progress on the Question Answering task. However, most of those datasets are in English, and the performances of state-of-the-art multilingual model