Research papers, master and doctoral theses about systems

``It doesn't look good for a date'': Transforming Critiques into Preferences for Conversational Recommendation Systems

368 - Association for Computation Linguistics 2021 مقالة

Conversations aimed at determining good recommendations are iterative in nature. People often express their preferences in terms of a critique of the current recommendation (e.g., It doesn't look good for a date''), requiring some degree of common se nse for a preference to be inferred. In this work, we present a method for transforming a user critique into a positive preference (e.g., I prefer more romantic'') in order to retrieve reviews pertaining to potentially better recommendations (e.g., Perfect for a romantic dinner''). We leverage a large neural language model (LM) in a few-shot setting to perform critique-to-preference transformation, and we test two methods for retrieving recommendations: one that matches embeddings, and another that fine-tunes an LM for the task. We instantiate this approach in the restaurant domain and evaluate it using a new dataset of restaurant critiques. In an ablation study, we show that utilizing critique-to-preference transformation improves recommendations, and that there are at least three general cases that explain this improved performance.

conversational recommendation systems recommendation systems أنظمة توصية المحادثة أنظمة التوصية صناعة حمض الفوسفور

Retrieval, Analogy, and Composition: A framework for Compositional Generalization in Image Captioning

326 - Association for Computation Linguistics 2021 مقالة

Image captioning systems are expected to have the ability to combine individual concepts when describing scenes with concept combinations that are not observed during training. In spite of significant progress in image captioning with the help of the autoregressive generation framework, current approaches fail to generalize well to novel concept combinations. We propose a new framework that revolves around probing several similar image caption training instances (retrieval), performing analogical reasoning over relevant entities in retrieved prototypes (analogy), and enhancing the generation process with reasoning outcomes (composition). Our method augments the generation model by referring to the neighboring instances in the training set to produce novel concept combinations in generated captions. We perform experiments on the widely used image captioning benchmarks. The proposed models achieve substantial improvement over the compared baselines on both composition-related evaluation metrics and conventional image captioning metrics.

نص كوربوس image captioning systems نظم تسمية الصورة صناعة حمض الفوسفور

Multilingual Paraphrase Generation For Bootstrapping New Features in Task-Oriented Dialog Systems

256 - Association for Computation Linguistics 2021 مقالة

The lack of labeled training data for new features is a common problem in rapidly changing real-world dialog systems. As a solution, we propose a multilingual paraphrase generation model that can be used to generate novel utterances for a target feat ure and target language. The generated utterances can be used to augment existing training data to improve intent classification and slot labeling models. We evaluate the quality of generated utterances using intrinsic evaluation metrics and by conducting downstream evaluation experiments with English as the source language and nine different target languages. Our method shows promise across languages, even in a zero-shot setting where no seed data is available.

task-oriented dialog systems dialog systems multilingual paraphrase generation أنظمة الحوار إعادة صياغة إعادة صياغة متعددة اللغات صناعة حمض الفوسفور

Sentence Bottleneck Autoencoders from Transformer Language Models

185 - Association for Computation Linguistics 2021 مقالة

Representation learning for text via pretraining a language model on a large corpus has become a standard starting point for building NLP systems. This approach stands in contrast to autoencoders, also trained on raw text, but with the objective of l earning to encode each input as a vector that allows full reconstruction. Autoencoders are attractive because of their latent space structure and generative properties. We therefore explore the construction of a sentence-level autoencoder from a pretrained, frozen transformer language model. We adapt the masked language modeling objective as a generative, denoising one, while only training a sentence bottleneck and a single-layer modified transformer decoder. We demonstrate that the sentence representations discovered by our model achieve better quality than previous methods that extract representations from pretrained transformers on text similarity tasks, style transfer (an example of controlled generation), and single-sentence classification tasks in the GLUE benchmark, while using fewer parameters than large pretrained models.

building nlp systems بناء أنظمة NLP. صناعة حمض الفوسفور

The NiuTrans Machine Translation Systems for WMT21

203 - Association for Computation Linguistics 2021 مقالة

This paper describes NiuTrans neural machine translation systems of the WMT 2021 news translation tasks. We made submissions to 9 language directions, including English2Chinese, Japanese, Russian, Icelandic and English2Hausa tasks. Our primary system s are built on several effective variants of Transformer, e.g., Transformer-DLCL, ODE-Transformer. We also utilize back-translation, knowledge distillation, post-ensemble, and iterative fine-tuning techniques to enhance the model performance further.

machine translation systems niutrans machine translation أنظمة الترجمة الآلية niutrans ترجمة آلة صناعة حمض الفوسفور

DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization

244 - Association for Computation Linguistics 2021 مقالة

Identifying relevant knowledge to be used in conversational systems that are grounded in long documents is critical to effective response generation. We introduce a knowledge identification model that leverages the document structure to provide dialo gue-contextualized passage encodings and better locate knowledge relevant to the conversation. An auxiliary loss captures the history of dialogue-document connections. We demonstrate the effectiveness of our model on two document-grounded conversational datasets and provide analyses showing generalization to unseen documents and long dialogue contexts.

conversational systems dialogue-document contextualization knowledge identification نظم المحادثة حوار بوثيقة وثيقة تحديد المعرفة صناعة حمض الفوسفور المزيد..

The LMU Munich Systems for the WMT21 Unsupervised and Very Low-Resource Translation Task

277 - Association for Computation Linguistics 2021 مقالة

We present our submissions to the WMT21 shared task in Unsupervised and Very Low Resource machine translation between German and Upper Sorbian, German and Lower Sorbian, and Russian and Chuvash. Our low-resource systems (German↔Upper Sorbian, Russian ↔Chuvash) are pre-trained on high-resource pairs of related languages. We fine-tune those systems using the available authentic parallel data and improve by iterated back-translation. The unsupervised German↔Lower Sorbian system is initialized by the best Upper Sorbian system and improved by iterated back-translation using monolingual data only.

lmu munich systems lmu munich upper sorbian نظم LMU ميونيخ LMU ميونيخ السامية العليا صناعة حمض الفوسفور المزيد..

CUNI Systems in WMT21: Revisiting Backtranslation Techniques for English-Czech NMT

157 - Association for Computation Linguistics 2021 مقالة

We describe our two NMT systems submitted to the WMT2021 shared task in English-Czech news translation: CUNI-DocTransformer (document-level CUBBITT) and CUNI-Marian-Baselines. We improve the former with a better sentence-segmentation pre-processing a nd a post-processing for fixing errors in numbers and units. We use the latter for experiments with various backtranslation techniques.

revisiting backtranslation techniques cuni systems revisiting backtranslation إعادة النظر في تقنيات الخلفية أنظمة CUNI إعادة النظر وراء الترجمة صناعة حمض الفوسفور المزيد..

Countering the Influence of Essay Length in Neural Essay Scoring

62 - Association for Computation Linguistics 2021 مقالة

Previous work has shown that automated essay scoring systems, in particular machine learning-based systems, are not capable of assessing the quality of essays, but are relying on essay length, a factor irrelevant to writing proficiency. In this work, we first show that state-of-the-art systems, recent neural essay scoring systems, might be also influenced by the correlation between essay length and scores in a standard dataset. In our evaluation, a very simple neural model shows the state-of-the-art performance on the standard dataset. To consider essay content without taking essay length into account, we introduce a simple neural model assessing the similarity of content between an input essay and essays assigned different scores. This neural model achieves performance comparable to the state of the art on a standard dataset as well as on a second dataset. Our findings suggest that neural essay scoring systems should consider the characteristics of datasets to focus on text quality.

neural essay scoring essay scoring systems countering the influence مقال عصبي التسجيل مقال سجل النظم مواجهة التأثير صناعة حمض الفوسفور المزيد..

Simulated Chats for Building Dialog Systems: Learning to Generate Conversations from Instructions

57 - Association for Computation Linguistics 2021 مقالة

Popular dialog datasets such as MultiWOZ are created by providing crowd workers an instruction, expressed in natural language, that describes the task to be accomplished. Crowd workers play the role of a user and an agent to generate dialogs to accom plish tasks involving booking restaurant tables, calling a taxi etc. In this paper, we present a data creation strategy that uses the pre-trained language model, GPT2, to simulate the interaction between crowd workers by creating a user bot and an agent bot. We train the simulators using a smaller percentage of actual crowd-generated conversations and their corresponding instructions. We demonstrate that by using the simulated data, we achieve significant improvements in low-resource settings on two publicly available datasets - MultiWOZ dataset and the Persona chat dataset.

building dialog systems نظم حوار البناء صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد