New community

Subscribe to the gold package and get unlimited access to Shamra Academy

MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks

Mindcraft: نظرية العقل النمذجة للحوار الموقع في المهام التعاونية

235 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

An ideal integration of autonomous agents in a human world implies that they are able to collaborate on human terms. In particular, theory of mind plays an important role in maintaining common ground during human collaboration and communication. To enable theory of mind modeling in situated interactions, we introduce a fine-grained dataset of collaborative tasks performed by pairs of human subjects in the 3D virtual blocks world of Minecraft. It provides information that captures partners' beliefs of the world and of each other as an interaction unfolds, bringing abundant opportunities to study human collaborative behaviors in situated language communication. As a first step towards our goal of developing embodied AI agents able to infer belief states of collaborative partners in situ, we build and present results on computational models for several theory of mind tasks.

References used

https://aclanthology.org/

rate research

Pretraining the Noisy Channel Model for Task-Oriented Dialogue

490 - Association for Computation Linguistics 2021 مقالة

Abstract Direct decoding for task-oriented dialogue is known to suffer from the explaining-away effect, manifested in models that prefer short and generic responses. Here we argue for the use of Bayes' theorem to factorize the dialogue task into two models, the distribution of the context given the response, and the prior for the response itself. This approach, an instantiation of the noisy channel model, both mitigates the explaining-away effect and allows the principled incorporation of large pretrained models for the response prior. We present extensive experiments showing that a noisy channel model decodes better responses compared to direct decoding and that a two-stage pretraining strategy, employing both open-domain and task-oriented dialogue data, improves over randomly initialized models.

noisy channel model task-oriented dialogue channel model نموذج القناة صاخبة حوار موجه نحو المهام نموذج القناة صناعة حمض الفوسفور المزيد..

Textual Time Travel: A Temporally Informed Approach to Theory of Mind

329 - Association for Computation Linguistics 2021 مقالة

Natural language processing systems such as dialogue agents should be able to reason about other people's beliefs, intentions and desires. This capability, called theory of mind (ToM), is crucial, as it allows a model to predict and interpret the nee ds of users based on their mental states. A recent line of research evaluates the ToM capability of existing memory-augmented neural models through question-answering. These models perform poorly on false belief tasks where beliefs differ from reality, especially when the dataset contains distracting sentences. In this paper, we propose a new temporally informed approach for improving the ToM capability of memory-augmented neural models. Our model incorporates priors about the entities' minds and tracks their mental states as they evolve over time through an extended passage. It then responds to queries through textual time travel--i.e., by accessing the stored memory of an earlier time step. We evaluate our model on ToM datasets and find that this approach improves performance, particularly by correcting the predicted mental states to match the false belief.

temporally informed approach temporally informed textual time travel نهج مستنير مؤقتا أبلغ مؤقتا السفر النص النصي صناعة حمض الفوسفور المزيد..

Speaker Turn Modeling for Dialogue Act Classification

357 - Association for Computation Linguistics 2021 مقالة

Dialogue Act (DA) classification is the task of classifying utterances with respect to the function they serve in a dialogue. Existing approaches to DA classification model utterances without incorporating the turn changes among speakers throughout t he dialogue, therefore treating it no different than non-interactive written text. In this paper, we propose to integrate the turn changes in conversations among speakers when modeling DAs. Specifically, we learn conversation-invariant speaker turn embeddings to represent the speaker turns in a conversation; the learned speaker turn embeddings are then merged with the utterance embeddings for the downstream task of DA classification. With this simple yet effective mechanism, our model is able to capture the semantics from the dialogue content while accounting for different speaker turns in a conversation. Validation on three benchmark public datasets demonstrates superior performance of our model.

dialogue act classification act classification تصنيف قانون الحوار تصنيف التصنيف صناعة حمض الفوسفور

Intention Reasoning Network for Multi-Domain End-to-end Task-Oriented Dialogue

310 - Association for Computation Linguistics 2021 مقالة

Recent years has witnessed the remarkable success in end-to-end task-oriented dialog system, especially when incorporating external knowledge information. However, the quality of most existing models' generated response is still limited, mainly due t o their lack of fine-grained reasoning on deterministic knowledge (w.r.t. conceptual tokens), which makes them difficult to capture the concept shifts and identify user's real intention in cross-task scenarios. To address these issues, we propose a novel intention mechanism to better model deterministic entity knowledge. Based on such a mechanism, we further propose an intention reasoning network (IR-Net), which consists of joint and multi-hop reasoning, to obtain intention-aware representations of conceptual tokens that can be used to capture the concept shifts involved in task-oriented conversations, so as to effectively identify user's intention and generate more accurate responses. Experimental results verify the effectiveness of IR-Net, showing that it achieves the state-of-the-art performance on two representative multi-domain dialog datasets.

حوار المعرفة intention reasoning network reasoning network شبكة بناء الشبكة شبكة التفكير صناعة حمض الفوسفور

Builder, we have done it: Evaluating \& Extending Dialogue-AMR NLU Pipeline for Two Collaborative Domains

172 - Association for Computation Linguistics 2021 مقالة

We adopt, evaluate, and improve upon a two-step natural language understanding (NLU) pipeline that incrementally tames the variation of unconstrained natural language input and maps to executable robot behaviors. The pipeline first leverages Abstract Meaning Representation (AMR) parsing to capture the propositional content of the utterance, and second converts this into Dialogue-AMR,'' which augments standard AMR with information on tense, aspect, and speech acts. Several alternative approaches and training datasets are evaluated for both steps and corresponding components of the pipeline, some of which outperform the original. We extend the Dialogue-AMR annotation schema to cover a different collaborative instruction domain and evaluate on both domains. With very little training data, we achieve promising performance in the new domain, demonstrating the scalability of this approach.

extending dialogue-amr nlu dialogue-amr nlu pipeline تمديد الحوار-عمرو NLU الحوار-عمرو نلو خط أنابيب صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks

Mindcraft: نظرية العقل النمذجة للحوار الموقع في المهام التعاونية

Ask ChatGPT about the research

Read More

suggested questions