New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Competency Problems: On Finding and Removing Artifacts in Language Data

مشاكل الكفاءة: عند العثور وإزالة القطع الأثرية في بيانات اللغة

132 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

finding and removing removing artifacts competency problems العثور والإزالة إزالة القطع الأثرية مشاكل الكفاءة صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

قام الكثير من الأعمال الحديثة في NLP بتوثيق القطع الأثرية لحجم البيانات والتحيز والارتباطات الزائفة بين ميزات الإدخال وتسميات الإخراج. ومع ذلك، وكيفية معرفة الميزات التي تحتوي على زائفة "بدلا من الارتباطات المشروعة عادة ما تكون غير محددة. في هذا العمل، نقول أنه بالنسبة لمهام فهم اللغة المعقدة، فإن جميع الارتباطات البسيطة الميزة زائفة، ونحن نقضي بإضفاء الطابع الرسمي على هذه الفكرة في فئة من المشاكل التي نسميها مشاكل الكفاءة. على سبيل المثال، يجب أن تعطي الكلمة مذهلة '' من تلقاء نفسها معلومات حول تسمية المعنويات مستقلة عن السياق الذي يظهر فيه، مما قد يشمل النفي والاستعارة والسخرية وما إلى ذلك. نحن نظريين تحليلا نظريا صعوبة في إنشاء بيانات لمشاكل الكفاءة عندما يتم أخذ التحيز البشري في الاعتبار، إظهار أن مجموعات البيانات الواقعية ستخلص بشكل متزايد من مشاكل الكفاءة حيث يزيد حجم مجموعة البيانات. يمنحنا هذا التحليل اختبارا إحصائيا بسيطا ل Artifacts DataSet، والذي نستخدمه لإظهار المزيد من التحيزات الدقيقة أكثر من الموصوفة في العمل السابق، بما في ذلك إظهار أن النماذج تتأثر بشكل غير لائق من هذه التحيزات الأقل شاقة. يسمح لنا العلاج النظري لهذه المشكلة أيضا بتحليل الحلول المقترحة، مثل إجراء تعديلات محلية لحالات البيانات، وتقديم توصيات لجمع البيانات المستقبلية وجهود تصميم النماذج التي تستهدف مشاكل الكفاءة.

Much recent work in NLP has documented dataset artifacts, bias, and spurious correlations between input features and output labels. However, how to tell which features have spurious'' instead of legitimate correlations is typically left unspecified. In this work we argue that for complex language understanding tasks, all simple feature correlations are spurious, and we formalize this notion into a class of problems which we call competency problems. For example, the word amazing'' on its own should not give information about a sentiment label independent of the context in which it appears, which could include negation, metaphor, sarcasm, etc. We theoretically analyze the difficulty of creating data for competency problems when human bias is taken into account, showing that realistic datasets will increasingly deviate from competency problems as dataset size increases. This analysis gives us a simple statistical test for dataset artifacts, which we use to show more subtle biases than were described in prior work, including demonstrating that models are inappropriately affected by these less extreme biases. Our theoretical treatment of this problem also allows us to analyze proposed solutions, such as making local edits to dataset instances, and to give recommendations for future data collection and model design efforts that target competency problems.

References used

https://aclanthology.org/

rate research

Crowdsourcing Natural Language Data at Scale: A Hands-On Tutorial

367 - Association for Computation Linguistics 2021 مقالة

In this tutorial, we present a portion of unique industry experience in efficient natural language data annotation via crowdsourcing shared by both leading researchers and engineers from Yandex. We will make an introduction to data labeling via publi c crowdsourcing marketplaces and will present the key components of efficient label collection. This will be followed by a practical session, where participants address a real-world language resource production task, experiment with selecting settings for the labeling process, and launch their label collection project on one of the largest crowdsourcing marketplaces. The projects will be run on real crowds within the tutorial session and we will present useful quality control techniques and provide the attendees with an opportunity to discuss their own annotation ideas.

natural language data crowdsourcing natural language بيانات اللغة الطبيعية growdsourcing اللغة الطبيعية صناعة حمض الفوسفور

Finding BERT's Idiomatic Key

505 - Association for Computation Linguistics 2021 مقالة

Sentence embeddings encode information relating to the usage of idioms in a sentence. This paper reports a set of experiments that combine a probing methodology with input masking to analyse where in a sentence this idiomatic information is taken fro m, and what form it takes. Our results indicate that BERT's idiomatic key is primarily found within an idiomatic expression, but also draws on information from the surrounding context. Also, BERT can distinguish between the disruption in a sentence caused by words missing and the incongruity caused by idiomatic usage.

finding bert idiomatic bert idiomatic key finding bert العثور على بيرت itiomatic. بيرت Itiomatic Key. العثور على بيرت صناعة حمض الفوسفور المزيد..

Assessing Nurse Managers' Competency in Implementing International Competency Standards in Al- Assad University Hospital and Al- Wattany Hospital at Lattakia

1906 - Tishreen University 2015 ورقة بحثية

Introduction:The nurse manager role and how it is carried out in healthcare organizations has been an important topic which had emphasized by many international organizations and institutions during the past two decades. Thus, this role had impact on nursing staff, other healthcare providers, patients and on healthcare environment as whole. To carry out her duties, nurse manager needs to build up and improve her personal skills and leadership competencies continuously. This by depending on international competency standards which developed by international organizations and institutions to be the basic to assess and improve nurse managers competency. Depending on these standards, this study was conducted and aimed to assess nurse managers' competency in implementing international competency standards in Al- Assad university hospital and Al- Wattany hospital at Lattakia. Thestudy was conducted on Al- Assad university and Al- Wattany hospitals at Lattakia. Whereas, The study sample consisted of 21 nurse managers , and 80 nurses from two hospitals. The main results ofthis study was that there was agreement between nurses 'viewpoints and nurse managers' viewpoint in both hospitals on nurse managers' implementation of international competencies standards.

الكفاءة كفاءة رئيسات الشعب التمريضية معايير الكفاءة العالمية competency nurse managers' competencies international competencies standards

Finding Pragmatic Differences Between Disciplines

230 - Association for Computation Linguistics 2021 مقالة

Scholarly documents have a great degree of variation, both in terms of content (semantics) and structure (pragmatics). Prior work in scholarly document understanding emphasizes semantics through document summarization and corpus topic modeling but te nds to omit pragmatics such as document organization and flow. Using a corpus of scholarly documents across 19 disciplines and state-of-the-art language modeling techniques, we learn a fixed set of domain-agnostic descriptors for document sections and retrofit'' the corpus to these descriptors (also referred to as normalization''). Then, we analyze the position and ordering of these descriptors across documents to understand the relationship between discipline and structure. We report within-discipline structural archetypes, variability, and between-discipline comparisons, supporting the hypothesis that scholarly communities, despite their size, diversity, and breadth, share similar avenues for expressing their work. Our findings lay the foundation for future work in assessing research quality, domain style transfer, and further pragmatic analysis.

finding pragmatic differences pragmatic differences differences between disciplines العثور على الاختلافات العميمة الاختلافات العملية الاختلافات بين التخصصات صناعة حمض الفوسفور المزيد..

SemEval-2021 Task 9: Fact Verification and Evidence Finding for Tabular Data in Scientific Documents (SEM-TAB-FACTS)

326 - Association for Computation Linguistics 2021 مقالة

Understanding tables is an important and relevant task that involves understanding table structure as well as being able to compare and contrast information within cells. In this paper, we address this challenge by presenting a new dataset and tasks that addresses this goal in a shared task in SemEval 2020 Task 9: Fact Verification and Evidence Finding for Tabular Data in Scientific Documents (SEM-TAB-FACTS). Our dataset contains 981 manually-generated tables and an auto-generated dataset of 1980 tables providing over 180K statement and over 16M evidence annotations. SEM-TAB-FACTS featured two sub-tasks. In sub-task A, the goal was to determine if a statement is supported, refuted or unknown in relation to a table. In sub-task B, the focus was on identifying the specific cells of a table that provide evidence for the statement. 69 teams signed up to participate in the task with 19 successful submissions to subtask A and 12 successful submissions to subtask B. We present our results and main findings from the competition.

fact verification scientific documents tabular data التحقق من الحقيقة الوثائق العلمية بنك صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Competency Problems: On Finding and Removing Artifacts in Language Data

مشاكل الكفاءة: عند العثور وإزالة القطع الأثرية في بيانات اللغة

Ask ChatGPT about the research

Read More

suggested questions