Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Augmenting BERT-style Models with Predictive Coding to Improve Discourse-level Representations

زيادة نماذج نمط بيرت مع ترميز تنبؤي لتحسين تمثيلات مستوى الخطاب

718 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

augmenting bert-style models discourse-level representations predictive coding زيادة نماذج نمط بيرت تمثيلات مستوى الخطاب الترميز التنبئي صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Current language models are usually trained using a self-supervised scheme, where the main focus is learning representations at the word or sentence level. However, there has been limited progress in generating useful discourse-level representations. In this work, we propose to use ideas from predictive coding theory to augment BERT-style language models with a mechanism that allows them to learn suitable discourse-level representations. As a result, our proposed approach is able to predict future sentences using explicit top-down connections that operate at the intermediate layers of the network. By experimenting with benchmarks designed to evaluate discourse-related knowledge using pre-trained sentence representations, we demonstrate that our approach improves performance in 6 out of 11 tasks by excelling in discourse relationship detection.

References used

https://aclanthology.org/

rate research

Text Style Transfer: Leveraging a Style Classifier on Entangled Latent Representations

976 - Association for Computation Linguistics 2021 مقالة

Learning a good latent representation is essential for text style transfer, which generates a new sentence by changing the attributes of a given sentence while preserving its content. Most previous works adopt disentangled latent representation learn ing to realize style transfer. We propose a novel text style transfer algorithm with entangled latent representation, and introduce a style classifier that can regulate the latent structure and transfer style. Moreover, our algorithm for style transfer applies to both single-attribute and multi-attribute transfer. Extensive experimental results show that our method generally outperforms state-of-the-art approaches.

text style transfer style transfer latent representation نقل نمط النص نقل النمط التمثيل الكامن صناعة حمض الفوسفور المزيد..

Probing Contextual Language Models for Common Ground with Visual Representations

1027 - Association for Computation Linguistics 2021 مقالة

The success of large-scale contextual language models has attracted great interest in probing what is encoded in their representations. In this work, we consider a new question: to what extent contextual representations of concrete nouns are aligned with corresponding visual representations? We design a probing model that evaluates how effective are text-only representations in distinguishing between matching and non-matching visual representations. Our findings show that language representations alone provide a strong signal for retrieving image patches from the correct object categories. Moreover, they are effective in retrieving specific instances of image patches; textual context plays an important role in this process. Visually grounded language models slightly outperform text-only language models in instance retrieval, but greatly under-perform humans. We hope our analyses inspire future research in understanding and improving the visual capabilities of language models.

يمنع الانجراف الدلالي contextual language models صناعة حمض الفوسفور

EDTC: A Corpus for Discourse-Level Topic Chain Parsing

885 - Association for Computation Linguistics 2021 مقالة

Discourse analysis has long been known to be fundamental in natural language processing. In this research, we present our insight on discourse-level topic chain (DTC) parsing which aims at discovering new topics and investigating how these topics evo lve over time within an article. To address the lack of data, we contribute a new discourse corpus with DTC-style dependency graphs annotated upon news articles. In particular, we ensure the high reliability of the corpus by utilizing a two-step annotation strategy to build the data and filtering out the annotations with low confidence scores. Based on the annotated corpus, we introduce a simple yet robust system for automatic discourse-level topic chain parsing.

discourse-level topic chain topic chain parsing topic chain سلسلة موضوع الخطاب تخليل سلسلة الموضوعات سلسلة موضوع صناعة حمض الفوسفور المزيد..

ur-iw-hnt at GermEval 2021: An Ensembling Strategy with Multiple BERT Models

733 - Association for Computation Linguistics 2021 مقالة

This paper describes our approach (ur-iw-hnt) for the Shared Task of GermEval2021 to identify toxic, engaging, and fact-claiming comments. We submitted three runs using an ensembling strategy by majority (hard) voting with multiple different BERT mod els of three different types: German-based, Twitter-based, and multilingual models. All ensemble models outperform single models, while BERTweet is the winner of all individual models in every subtask. Twitter-based models perform better than GermanBERT models, and multilingual models perform worse but by a small margin.

multiple bert models ensembling strategy bert models نماذج بيرت متعددة استراتيجية شبه نماذج بيرت صناعة حمض الفوسفور المزيد..

Sesame Street to Mount Sinai: BERT-constrained character-level Moses models for multilingual lexical normalization

694 - Association for Computation Linguistics 2021 مقالة

This paper describes the HEL-LJU submissions to the MultiLexNorm shared task on multilingual lexical normalization. Our system is based on a BERT token classification preprocessing step, where for each token the type of the necessary transformation i s predicted (none, uppercase, lowercase, capitalize, modify), and a character-level SMT step where the text is translated from original to normalized given the BERT-predicted transformation constraints. For some languages, depending on the results on development data, the training data was extended by back-translating OpenSubtitles data. In the final ordering of the ten participating teams, the HEL-LJU team has taken the second place, scoring better than the previous state-of-the-art.

bert-constrained character-level moses multilingual lexical normalization character-level moses models بريه مقيدة مستوى الطابع موسى التطبيع المعجمي متعدد اللغات طرازات موسى مستوى الأحرف صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Augmenting BERT-style Models with Predictive Coding to Improve Discourse-level Representations

زيادة نماذج نمط بيرت مع ترميز تنبؤي لتحسين تمثيلات مستوى الخطاب

Ask ChatGPT about the research

Read More

suggested questions