Research papers, master and doctoral theses about تقييم الدقة

Generation Challenges: Results of the Accuracy Evaluation Shared Task

423 - Association for Computation Linguistics 2021 مقالة

The Shared Task on Evaluating Accuracy focused on techniques (both manual and automatic) for evaluating the factual accuracy of texts produced by neural NLG systems, in a sports-reporting domain. Four teams submitted evaluation techniques for this ta sk, using very different approaches and techniques. The best-performing submissions did encouragingly well at this difficult task. However, all automatic submissions struggled to detect factual errors which are semantically or pragmatically complex (for example, based on incorrect computation or inference).

generation challenges accuracy evaluation shared evaluation shared task تحديات الجيل تقاسم تقييم الدقة تقييم المهمة المشتركة صناعة حمض الفوسفور المزيد..

Shared Task in Evaluating Accuracy: Leveraging Pre-Annotations in the Validation Process

311 - Association for Computation Linguistics 2021 مقالة

We hereby present our submission to the Shared Task in Evaluating Accuracy at the INLG 2021 Conference. Our evaluation protocol relies on three main components; rules and text classifiers that pre-annotate the dataset, a human annotator that validate s the pre-annotations, and a web interface that facilitates this validation. Our submission consists in fact of two submissions; we first analyze solely the performance of the rules and classifiers (pre-annotations), and then the human evaluation aided by the former pre-annotations using the web interface (hybrid). The code for the web interface and the classifiers is publicly available.

task in evaluating evaluating accuracy المهمة في تقييم تقييم الدقة صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد