Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Evaluating the Morphosyntactic Well-formedness of Generated Texts

تقييم مورفوسنكتاكي شكل جيد للنصوص المتولدة

558 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

well-formedness of generated generated texts morphosyntactic well-formedness شكل جيد من ولدت النصوص التي تم إنشاؤها مورفوسنكتاكيتش شكل جيد صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Text generation systems are ubiquitous in natural language processing applications. However, evaluation of these systems remains a challenge, especially in multilingual settings. In this paper, we propose L'AMBRE -- a metric to evaluate the morphosyntactic well-formedness of text using its dependency parse and morphosyntactic rules of the language. We present a way to automatically extract various rules governing morphosyntax directly from dependency treebanks. To tackle the noisy outputs from text generation systems, we propose a simple methodology to train robust parsers. We show the effectiveness of our metric on the task of machine translation through a diachronic study of systems translating into morphologically-rich languages.

References used

https://aclanthology.org/

rate research

Understanding Model Robustness to User-generated Noisy Texts

867 - Association for Computation Linguistics 2021 مقالة

Sensitivity of deep-neural models to input noise is known to be a challenging problem. In NLP, model performance often deteriorates with naturally occurring noise, such as spelling errors. To mitigate this issue, models may leverage artificially nois ed data. However, the amount and type of generated noise has so far been determined arbitrarily. We therefore propose to model the errors statistically from grammatical-error-correction corpora. We present a thorough evaluation of several state-of-the-art NLP systems' robustness in multiple languages, with tasks including morpho-syntactic analysis, named entity recognition, neural machine translation, a subset of the GLUE benchmark and reading comprehension. We also compare two approaches to address the performance drop: a) training the NLP models with noised data generated by our framework; and b) reducing the input noise with external system for natural language correction. The code is released at https://github.com/ufal/kazitext.

user-generated noisy texts noisy texts user-generated noisy النصوص الناتجة عن المستخدم نصوص صاخبة صاخبة التي تم إنشاؤها صناعة حمض الفوسفور المزيد..

The Korean Morphologically Tight-Fitting Tokenizer for Noisy User-Generated Texts

799 - Association for Computation Linguistics 2021 مقالة

User-generated texts include various types of stylistic properties, or noises. Such texts are not properly processed by existing morpheme analyzers or language models based on formal texts such as encyclopedias or news articles. In this paper, we pro pose a simple morphologically tight-fitting tokenizer (K-MT) that can better process proper nouns, coinages, and internet slang among other types of noise in Korean user-generated texts. We tested our tokenizer by performing classification tasks on Korean user-generated movie reviews and hate speech datasets, and the Korean Named Entity Recognition dataset. Through our tests, we found that K-MT is better fit to process internet slangs, proper nouns, and coinages, compared to a morpheme analyzer and a character-level WordPiece tokenizer.

noisy user-generated texts noisy user-generated morphologically tight-fitting tokenizer النصوص التي أنشأها المستخدم صاخبة صاخبة المستخدم مظلمة ضيقة مورفولوجية صناعة حمض الفوسفور المزيد..

Towards Objectively Evaluating the Quality of Generated Medical Summaries

574 - Association for Computation Linguistics 2021 مقالة

We propose a method for evaluating the quality of generated text by asking evaluators to count facts, and computing precision, recall, f-score, and accuracy from the raw counts. We believe this approach leads to a more objective and easier to reproduce evaluation. We apply this to the task of medical report summarisation, where measuring objective quality and accuracy is of paramount importance.

generated medical summaries objectively evaluating medical summaries ملخصات طبية تم إنشاؤها تقييم موضوعي الملخصات الطبية صناعة حمض الفوسفور المزيد..

Paragraph-level Simplification of Medical Texts

769 - Association for Computation Linguistics 2021 مقالة

We consider the problem of learning to simplify medical texts. This is important because most reliable, up-to-date information in biomedicine is dense with jargon and thus practically inaccessible to the lay audience. Furthermore, manual simplificati on does not scale to the rapidly growing body of biomedical literature, motivating the need for automated approaches. Unfortunately, there are no large-scale resources available for this task. In this work we introduce a new corpus of parallel texts in English comprising technical and lay summaries of all published evidence pertaining to different clinical topics. We then propose a new metric based on likelihood scores from a masked language model pretrained on scientific texts. We show that this automated measure better differentiates between technical and lay summaries than existing heuristics. We introduce and evaluate baseline encoder-decoder Transformer models for simplification and propose a novel augmentation to these in which we explicitly penalize the decoder for producing jargon'' terms; we find that this yields improvements over baselines in terms of readability.

medical texts simplify medical texts paragraph-level simplification النصوص الطبية تبسيط النصوص الطبية تبسيط مستوى الفقرة صناعة حمض الفوسفور المزيد..

Evaluating retetion of the removable partial dentures supported by implants when changing the shape of the supporting alveolar

1100 - Aِl-Baath University 2017 ورقة بحثية

It is necessary when designing removable partial dentures to be fixed, stable and well supported for improving patient comfort, and decreasing damages on soft tissues and dental abutments, the alveolar ridge has a strong influence in planning for treatment and interpretations biomechanical when treated with removable partial dentures, and have four shapes in the sagittal plane which are horizontal shape, concave shape, distal ascending shape and descending shape.

ثبات السنخ الداعم الأجهزة الجزئية المتحركة المدعومة بالغرسات

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Evaluating the Morphosyntactic Well-formedness of Generated Texts

تقييم مورفوسنكتاكي شكل جيد للنصوص المتولدة

Ask ChatGPT about the research

Read More

suggested questions