New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Unsupervised Conversation Disentanglement through Co-Training

محادثة غير مدفوعة من خلال التدريب المشترك

304 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

unsupervised conversation disentanglement conversation disentanglement conversation disentanglement aims محادثة غير محفوظة المحادثة Deventangle. المحادثة Deventangle تداول صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

محادثة Deventangle تهدف إلى فصل الرسائل المتداخلة إلى جلسات منفصلة، وهي مهمة أساسية في فهم المحادثات متعددة الأحزاب. يعتمد العمل الحالي في محادثة DEVENTANGLEMELE بشكل كبير على مجموعات البيانات المشروح البشرية، وهي مكلفة للحصول عليها في الممارسة العملية. في هذا العمل، نستكشف تدريب نموذج محادثة محادثة دون الرجوع إلى أي شروح بشرية. تم بناء طريقتنا على خوارزمية التدريب العميق، والتي تتكون من شبكات اثنين من الشبكات العصبية: مصنف رسالة للزوج وفيديو الجلسة. السابق هو المسؤول عن استرجاع العلاقات المحلية بين رسالتين بينما يقتصر الأخير رسالة إلى جلسة من خلال التقاط معلومات السياق. يتم تهيئة كلتا الشبكتين على التوالي مع بيانات زائفة مبنية من Corpus غير المخلفات. خلال عملية التدريب التعويضي العميق، نستخدم مصنف الجلسة كمكون تعليمي للتعزيز لتعلم جلسة تعيين سياسة من خلال تعظيم المكافآت المحلية التي قدمها مصنف زوج الرسائل. بالنسبة إلى مصنف زوج الرسائل، فإننا نشعر بإثراء بيانات التدريب الخاصة بها عن طريق استرداد أزواج الرسائل بثقة عالية من جلسات DESTANGLED المتوقعة من قبل مصنف الجلسة. النتائج التجريبية على مجموعة بيانات حوار السينما الكبيرة تثبت أن نهجنا المقترح يحقق أداء تنافسي مقارنة بالأساليب الخاضعة للإشراف السابقة. تشير المزيد من التجارب إلى أن محادثات الإعصابات المتوقعة يمكن أن تعزز الأداء على المهمة المصب لمختيار استجابة متعددة الأحزاب.

Conversation disentanglement aims to separate intermingled messages into detached sessions, which is a fundamental task in understanding multi-party conversations. Existing work on conversation disentanglement relies heavily upon human-annotated datasets, which is expensive to obtain in practice. In this work, we explore training a conversation disentanglement model without referencing any human annotations. Our method is built upon the deep co-training algorithm, which consists of two neural networks: a message-pair classifier and a session classifier. The former is responsible of retrieving local relations between two messages while the latter categorizes a message to a session by capturing context-aware information. Both the two networks are initialized respectively with pseudo data built from the unannotated corpus. During the deep co-training process, we use the session classifier as a reinforcement learning component to learn a session assigning policy by maximizing the local rewards given by the message-pair classifier. For the message-pair classifier, we enrich its training data by retrieving message pairs with high confidence from the disentangled sessions predicted by the session classifier. Experimental results on the large Movie Dialogue Dataset demonstrate that our proposed approach achieves competitive performance compared to previous supervised methods. Further experiments show that the predicted disentangled conversations can promote the performance on the downstream task of multi-party response selection.

References used

https://aclanthology.org/

rate research

PLATO-KAG: Unsupervised Knowledge-Grounded Conversation via Joint Modeling

358 - Association for Computation Linguistics 2021 مقالة

Large-scale conversation models are turning to leveraging external knowledge to improve the factual accuracy in response generation. Considering the infeasibility to annotate the external knowledge for large-scale dialogue corpora, it is desirable to learn the knowledge selection and response generation in an unsupervised manner. In this paper, we propose PLATO-KAG (Knowledge-Augmented Generation), an unsupervised learning approach for end-to-end knowledge-grounded conversation modeling. For each dialogue context, the top-k relevant knowledge elements are selected and then employed in knowledge-grounded response generation. The two components of knowledge selection and response generation are optimized jointly and effectively under a balanced objective. Experimental results on two publicly available datasets validate the superiority of PLATO-KAG.

احتج knowledge-grounded conversation modeling نمذجة المحادثة المحادثة المعرفة صناعة حمض الفوسفور

Self-Training for Unsupervised Neural Machine Translation in Unbalanced Training Data Scenarios

429 - Association for Computation Linguistics 2021 مقالة

Unsupervised neural machine translation (UNMT) that relies solely on massive monolingual corpora has achieved remarkable results in several translation tasks. However, in real-world scenarios, massive monolingual corpora do not exist for some extreme ly low-resource languages such as Estonian, and UNMT systems usually perform poorly when there is not adequate training corpus for one language. In this paper, we first define and analyze the unbalanced training data scenario for UNMT. Based on this scenario, we propose UNMT self-training mechanisms to train a robust UNMT system and improve its performance in this case. Experimental results on several language pairs show that the proposed methods substantially outperform conventional UNMT systems.

آلة ذات مستوى المستند صناعة حمض الفوسفور

ForumSum: A Multi-Speaker Conversation Summarization Dataset

264 - Association for Computation Linguistics 2021 مقالة

Abstractive summarization quality had large improvements since recent language pretraining techniques. However, currently there is a lack of datasets for the growing needs of conversation summarization applications. Thus we collected ForumSum, a dive rse and high-quality conversation summarization dataset with human written summaries. The conversations in ForumSum dataset are collected from a wide variety of internet forums. To make the dataset easily expandable, we also release the process of dataset creation. Our experiments show that models trained on ForumSum have better zero-shot and few-shot transferability to other datasets than the existing large chat summarization dataset SAMSum. We also show that using a conversational corpus for pre-training improves the quality of the chat summarization model.

multi-speaker conversation summarization conversation summarization dataset تلخيص محادثة متعددة المتكلم محادثة ملخص DataSet. صناعة حمض الفوسفور

Unsupervised Representation Disentanglement of Text: An Evaluation on Synthetic Datasets

604 - Association for Computation Linguistics 2021 مقالة

To highlight the challenges of achieving representation disentanglement for text domain in an unsupervised setting, in this paper we select a representative set of successfully applied models from the image domain. We evaluate these models on 6 disen tanglement metrics, as well as on downstream classification tasks and homotopy. To facilitate the evaluation, we propose two synthetic datasets with known generative factors. Our experiments highlight the existing gap in the text domain and illustrate that certain elements such as representation sparsity (as an inductive bias), or representation coupling with the decoder could impact disentanglement. To the best of our knowledge, our work is the first attempt on the intersection of unsupervised representation disentanglement and text, and provides the experimental framework and datasets for examining future developments in this direction.

unsupervised representation disentanglement representation disentanglement synthetic datasets devent الانسحاب غير المدعوم تمثيل disentanglement مجموعات البيانات الاصطناعية صناعة حمض الفوسفور المزيد..

Co-Teaching Student-Model through Submission Results of Shared Task

228 - Association for Computation Linguistics 2021 مقالة

Shared tasks have a long history and have become the mainstream of NLP research. Most of the shared tasks require participants to submit only system outputs and descriptions. It is uncommon for the shared task to request submission of the system itse lf because of the license issues and implementation differences. Therefore, many systems are abandoned without being used in real applications or contributing to better systems. In this research, we propose a scheme to utilize all those systems which participated in the shared tasks. We use all participated system outputs as task teachers in this scheme and develop a new model as a student aiming to learn the characteristics of each system. We call this scheme Co-Teaching.'' This scheme creates a unified system that performs better than the task's single best system. It only requires the system outputs, and slightly extra effort is needed for the participants and organizers. We apply this scheme to the SHINRA2019-JP'' shared task, which has nine participants with various output accuracies, confirming that the unified system outperforms the best system. Moreover, the code used in our experiments has been released.

submission results نتائج التقديم صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Unsupervised Conversation Disentanglement through Co-Training

محادثة غير مدفوعة من خلال التدريب المشترك

Ask ChatGPT about the research

Read More

suggested questions