نقترح طريقة لتقييم جودة النص الذي تم إنشاؤه عن طريق طلب المقيمين لحساب الحقائق، والحساب الدقيقة، واستدعاء، F-Score، ودقة من التهم الخام.نعتقد أن هذا النهج يؤدي إلى هدف أكثر وأسهل لإعادة إنتاج التقييم.نحن نطبق هذا على مهمة تلخيص التقرير الطبي، حيث قياس الجودة الموضوعية والدقة له أهمية قصوى.
We propose a method for evaluating the quality of generated text by asking evaluators to count facts, and computing precision, recall, f-score, and accuracy from the raw counts. We believe this approach leads to a more objective and easier to reproduce evaluation. We apply this to the task of medical report summarisation, where measuring objective quality and accuracy is of paramount importance.
References used
https://aclanthology.org/
Abstract A desirable property of a reference-based evaluation metric that measures the content quality of a summary is that it should estimate how much information that summary has in common with a reference. Traditional text overlap based metrics su
Medical conversations from patient visits are routinely summarized into clinical notes for documentation of clinical care. The automatic creation of clinical note is particularly challenging given that it requires summarization over spoken language a
Text generation systems are ubiquitous in natural language processing applications. However, evaluation of these systems remains a challenge, especially in multilingual settings. In this paper, we propose L'AMBRE -- a metric to evaluate the morphosyn
Extractive summarization has been the mainstay of automatic summarization for decades. Despite all the progress, extractive summarizers still suffer from shortcomings including coreference issues arising from extracting sentences away from their orig
This paper presents a reference study of available algorithms for plagiarism
detection and it develops semantic plagiarism detection algorithm for plagiarism detection
in medical research papers by employing the Medical Ontologies available on the