نقدم نظرة عامة على المهمة المشتركة السكري، التي قدمت في ورشة عمل المعالجة بالوثائق العلمية الثانية (SDP) في Naacl 2021. وفي هذه المهمة المشتركة، قدمت النظم مطالبة علمية وجزح من ملخصات البحث، وطلب تحديد المقالات التي تدعمهاأو دحض المطالبة وكذلك توفير جمل إثبات تبرير هذه الملصقات.11 قدمت فرق ما مجموعه 14 تقريرا إلى المتصدرين المهمة المشتركة، مما يؤدي إلى تحسين أكثر من +23 F1 على متري تقييم المهام الأساسية.بالإضافة إلى مسح النظم المشاركة، فإننا نقدم العديد من الأفكار في نهج النمذجة لدعم التقدم المحرز المستمر والبحث في المستقبل حول المهمة المهمة والصعبة للتحقق من الادعاء العلمي.
We present an overview of the SCIVER shared task, presented at the 2nd Scholarly Document Processing (SDP) workshop at NAACL 2021. In this shared task, systems were provided a scientific claim and a corpus of research abstracts, and asked to identify which articles Support or Refute the claim as well as provide evidentiary sentences justifying those labels. 11 teams made a total of 14 submissions to the shared task leaderboard, leading to an improvement of more than +23 F1 on the primary task evaluation metric. In addition to surveying the participating systems, we provide several insights into modeling approaches to support continued progress and future research on the important and challenging task of scientific claim verification.
References used
https://aclanthology.org/
The NLP field has recently seen a substantial increase in work related to reproducibility of results, and more generally in recognition of the importance of having shared definitions and practices relating to evaluation. Much of the work on reproduci
This paper provides an overview of the WANLP 2021 shared task on sarcasm and sentiment detection in Arabic. The shared task has two subtasks: sarcasm detection (subtask 1) and sentiment analysis (subtask 2). This shared task aims to promote and bring
In this paper, we introduce the Eval4NLP-2021 shared task on explainable quality estimation. Given a source-translation pair, this shared task requires not only to provide a sentence-level score indicating the overall quality of the translation, but
This work describes the adaptation of a pretrained sequence-to-sequence model to the task of scientific claim verification in the biomedical domain. We propose a system called VerT5erini that exploits T5 for abstract retrieval, sentence selection, an
We present the GermEval 2021 shared task on the identification of toxic, engaging, and fact-claiming comments. This shared task comprises three binary classification subtasks with the goal to identify: toxic comments, engaging comments, and comments