Towards Objectively Evaluating the Quality of Generated Medical Summaries


Abstract in English

We propose a method for evaluating the quality of generated text by asking evaluators to count facts, and computing precision, recall, f-score, and accuracy from the raw counts. We believe this approach leads to a more objective and easier to reproduce evaluation. We apply this to the task of medical report summarisation, where measuring objective quality and accuracy is of paramount importance.

References used

https://aclanthology.org/

Download