تستكشف هذه الورقة موضوع قابلية النقل، كمنطقة فرعية تتعلق بالتعميم.من خلال اقتراح استخدام المقاييس بناء على إحصاءات راسخة، يمكننا تقدير التغيير في أداء نماذج NLP في سياقات جديدة.قد تسمح تحديد مقياس جديد لقابلية النقل بتقدير أفضل لأداء نظام NLP في مجالات جديدة، وهو أمر بالغ الأهمية عند تقييم أداء أنظمة NLP في مهام ومجال جديدة.من خلال العديد من مثيلات التعقيد المتزايد، نوضح كيف يمكن استخدام تدابير التشابه المجال خفيف الوزن كمقرات لقابلية النقل في تطبيقات NLP.يتم تقييم تدابير النقل المقترحة في سياق التعرف على الكيان المسمى ومهام الاستدلال باللغة الطبيعية.
This paper explores the topic of transportability, as a sub-area of generalisability. By proposing the utilisation of metrics based on well-established statistics, we are able to estimate the change in performance of NLP models in new contexts. Defining a new measure for transportability may allow for better estimation of NLP system performance in new domains, and is crucial when assessing the performance of NLP systems in new tasks and domains. Through several instances of increasing complexity, we demonstrate how lightweight domain similarity measures can be used as estimators for the transportability in NLP applications. The proposed transportability measures are evaluated in the context of Named Entity Recognition and Natural Language Inference tasks.
References used
https://aclanthology.org/
A long-standing issue with paraphrase generation is the lack of reliable supervision signals. In this paper, we propose a new unsupervised paradigm for paraphrase generation based on the assumption that the probabilities of generating two sentences w
In this tutorial, we aim at bringing interested NLP researchers up to speed about the recent and ongoing techniques for document-level representation learning. Additionally, our goal is to reveal new research opportunities to the audience, which will hopefully bring us closer to address existing challenges in this domain.
Event detection (ED) aims at identifying event instances of specified types in given texts, which has been formalized as a sequence labeling task. As far as we know, existing neural-based ED models make decisions relying entirely on the contextual se
Question Answering (QA) tasks requiring information from multiple documents often rely on a retrieval model to identify relevant information for reasoning. The retrieval model is typically trained to maximize the likelihood of the labeled supporting
Automated story generation remains a difficult area of research because it lacks strong objective measures. Generated stories may be linguistically sound, but in many cases suffer poor narrative coherence required for a compelling, logically-sound st