تصف هذه الورقة طريقة للتعلم من ردود فعل تصحيحية غير موثوق بها لمعلم المعلم في إعداد مراقبة مهمة تفاعلية.يستخدم النموذج الرسومي خطاب التماسك للتعلم بشكل مشترك عن رمز التأريض ومفاهيم المجال والخطط الصالحة.تبين تجاربنا أن الوكيل يتعلم مهمته على مستوى المجال على الرغم من أخطاء المعلم.
This paper describes a method for learning from a teacher's potentially unreliable corrective feedback in an interactive task learning setting. The graphical model uses discourse coherence to jointly learn symbol grounding, domain concepts and valid plans. Our experiments show that the agent learns its domain-level task in spite of the teacher's mistakes.
References used
https://aclanthology.org/
Linguistic representations derived from text alone have been criticized for their lack of grounding, i.e., connecting words to their meanings in the physical world. Vision-and- Language (VL) models, trained jointly on text and image or video data, ha
We investigate grounded language learning through real-world data, by modelling a teacher-learner dynamics through the natural interactions occurring between users and search engines; in particular, we explore the emergence of semantic generalization
Phrase grounding aims to map textual phrases to their associated image regions, which can be a prerequisite for multimodal reasoning and can benefit tasks requiring identifying objects based on language. With pre-trained vision-and-language models ac
Extracting temporal information is critical to process health-related text. Temporal information extraction is a challenging task for language models because it requires processing both texts and numbers. Moreover, the fundamental challenge is how to
Word vector representations are an essential part of an NLP curriculum. Here, we describe a homework that has students implement a popular method for learning word vectors, word2vec. Students implement the core parts of the method, including text pre