Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Mitigating Temporal-Drift: A Simple Approach to Keep NER Models Crisp

التخفيف من الانجراف الزمني: نهج بسيط للحفاظ على نماذج نير هش

845 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

ner models crisp mitigating temporal-drift models crisp نماذج نير هش تخفيف الانجراف الزمني نماذج هش صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Performance of neural models for named entity recognition degrades over time, becoming stale. This degradation is due to temporal drift, the change in our target variables' statistical properties over time. This issue is especially problematic for social media data, where topics change rapidly. In order to mitigate the problem, data annotation and retraining of models is common. Despite its usefulness, this process is expensive and time-consuming, which motivates new research on efficient model updating. In this paper, we propose an intuitive approach to measure the potential trendiness of tweets and use this metric to select the most informative instances to use for training. We conduct experiments on three state-of-the-art models on the Temporal Twitter Dataset. Our approach shows larger increases in prediction accuracy with less training data than the alternatives, making it an attractive, practical solution.

References used

https://aclanthology.org/

rate research

A Simple and Efficient Multi-Task Learning Approach for Conditioned Dialogue Generation

809 - Association for Computation Linguistics 2021 مقالة

Conditioned dialogue generation suffers from the scarcity of labeled responses. In this work, we exploit labeled non-dialogue text data related to the condition, which are much easier to collect. We propose a multi-task learning approach to leverage both labeled dialogue and text data. The 3 tasks jointly optimize the same pre-trained Transformer -- conditioned dialogue generation task on the labeled dialogue data, conditioned language encoding task and conditioned language generation task on the labeled text data. Experimental results show that our approach outperforms the state-of-the-art models by leveraging the labeled texts, and it also obtains larger improvement in performance comparing to the previous methods to leverage text data.

simple and efficient conditioned dialogue generation efficient multi-task learning بسيطة وفعالة توليد الحوار مشروط التعلم متعدد المهام فعالة صناعة حمض الفوسفور المزيد..

Learning Numeracy: A Simple Yet Effective Number Embedding Approach Using Knowledge Graph

857 - Association for Computation Linguistics 2021 مقالة

Numeracy plays a key role in natural language understanding. However, existing NLP approaches, not only traditional word2vec approach or contextualized transformer-based language models, fail to learn numeracy. As the result, the performance of these models is limited when they are applied to number-intensive applications in clinical and financial domains. In this work, we propose a simple number embedding approach based on knowledge graph. We construct a knowledge graph consisting of number entities and magnitude relations. Knowledge graph embedding method is then applied to obtain number vectors. Our approach is easy to implement, and experiment results on various numeracy-related NLP tasks demonstrate the effectiveness and efficiency of our method.

effective number embedding effective number رقم فعال تضمينه رقم فعال صناعة حمض الفوسفور

A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code

563 - Association for Computation Linguistics 2021 مقالة

There is an emerging interest in the application of natural language processing models to source code processing tasks. One of the major problems in applying deep learning to software engineering is that source code often contains a lot of rare ident ifiers, resulting in huge vocabularies. We propose a simple, yet effective method, based on identifier anonymization, to handle out-of-vocabulary (OOV) identifiers. Our method can be treated as a preprocessing step and, therefore, allows for easy implementation. We show that the proposed OOV anonymization method significantly improves the performance of the Transformer in two code processing tasks: code completion and bug fixing.

approach for handling simple approach source code نهج التعامل نهج بسيط مصدر الرمز صناعة حمض الفوسفور المزيد..

ECONET: Effective Continual Pretraining of Language Models for Event Temporal Reasoning

662 - Association for Computation Linguistics 2021 مقالة

While pre-trained language models (PTLMs) have achieved noticeable success on many NLP tasks, they still struggle for tasks that require event temporal reasoning, which is essential for event-centric applications. We present a continual pre-training approach that equips PTLMs with targeted knowledge about event temporal relations. We design self-supervised learning objectives to recover masked-out event and temporal indicators and to discriminate sentences from their corrupted counterparts (where event or temporal indicators got replaced). By further pre-training a PTLM with these objectives jointly, we reinforce its attention to event and temporal information, yielding enhanced capability on event temporal reasoning. This **E**ffective **CON**tinual pre-training framework for **E**vent **T**emporal reasoning (ECONET) improves the PTLMs' fine-tuning performances across five relation extraction and question answering tasks and achieves new or on-par state-of-the-art performances in most of our downstream tasks.

effective continual pretraining event temporal reasoning احتجاج مستمر فعال المنطق الزمني للحدث صناعة حمض الفوسفور

Multitasking Inhibits Semantic Drift

704 - Association for Computation Linguistics 2021 مقالة

When intelligent agents communicate to accomplish shared goals, how do these goals shape the agents' language? We study the dynamics of learning in latent language policies (LLPs), in which instructor agents generate natural-language subgoal descript ions and executor agents map these descriptions to low-level actions. LLPs can solve challenging long-horizon reinforcement learning problems and provide a rich model for studying task-oriented language use. But previous work has found that LLP training is prone to semantic drift (use of messages in ways inconsistent with their original natural language meanings). Here, we demonstrate theoretically and empirically that multitask training is an effective counter to this problem: we prove that multitask training eliminates semantic drift in a well-studied family of signaling games, and show that multitask training of neural LLPs in a complex strategy game reduces drift and while improving sample efficiency.

multitasking inhibits semantic multitasking inhibits inhibits semantic drift تعدد المهام يمنع الدلالية تعدد المهام يمنع يمنع الانجراف الدلالي صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Mitigating Temporal-Drift: A Simple Approach to Keep NER Models Crisp

التخفيف من الانجراف الزمني: نهج بسيط للحفاظ على نماذج نير هش

Ask ChatGPT about the research

Read More

suggested questions