تلخيص الحوار عبارة عن مهمة طويلة الأمد في مجال NLP، وعدة مجموعات بيانات مع حوارات ووجود ملخصات مكتوبة بشرية من الأنماط المختلفة موجودة.ومع ذلك، فمن غير الواضح لأي نوع من الحوار أي نوع الموجز هو الأنسب.لهذا السبب، نطبق النموذج اللغوي لأنواع الحوار لاستخلاص عناصر ملخص مطابقة ومهام NLP.يتيح لنا هذا تعيين بيانات تلخيص الحوار الموجودة في هذا النموذج وتحديد الفجوات والاتجاهات المحتملة للعمل في المستقبل.كجزء من هذه العملية، نقدم أيضا نظرة عامة واسعة النطاق عن مجموعات بيانات تلخيص الحوار الموجودة.
Dialogue summarization is a long-standing task in the field of NLP, and several data sets with dialogues and associated human-written summaries of different styles exist. However, it is unclear for which type of dialogue which type of summary is most appropriate. For this reason, we apply a linguistic model of dialogue types to derive matching summary items and NLP tasks. This allows us to map existing dialogue summarization data sets into this model and identify gaps and potential directions for future work. As part of this process, we also provide an extensive overview of existing dialogue summarization data sets.
References used
https://aclanthology.org/
Summarizing conversations via neural approaches has been gaining research traction lately, yet it is still challenging to obtain practical solutions. Examples of such challenges include unstructured information exchange in dialogues, informal interac
In this position paper, we present a research agenda and ideas for facilitating exposure to diverse viewpoints in news recommendation. Recommending news from diverse viewpoints is important to prevent potential filter bubble effects in news consumpti
Discourse parsers recognize the intentional and inferential relationships that organize extended texts. They have had a great influence on a variety of NLP tasks as well as theoretical studies in linguistics and cognitive science. However it is often
This paper introduces MediaSum, a large-scale media interview dataset consisting of 463.6K transcripts with abstractive summaries. To create this dataset, we collect interview transcripts from NPR and CNN and employ the overview and topic description
We present a comprehensive survey of available corpora for multi-party dialogue. We survey over 300 publications related to multi-party dialogue and catalogue all available corpora in a novel taxonomy. We analyze methods of data collection for multi-