New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Does It Capture STEL? A Modular, Similarity-based Linguistic Style Evaluation Framework

هل التقاط ستيل؟إطار تقييم النمط اللغوي المعياري والمقر الرئيسي

130 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

capture stel style evaluation framework linguistic style evaluation التقاط ستيل إطار تقييم النمط تقييم النمط اللغوي صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Style is an integral part of natural language. However, evaluation methods for style measures are rare, often task-specific and usually do not control for content. We propose the modular, fine-grained and content-controlled similarity-based STyle EvaLuation framework (STEL) to test the performance of any model that can compare two sentences on style. We illustrate STEL with two general dimensions of style (formal/informal and simple/complex) as well as two specific characteristics of style (contrac'tion and numb3r substitution). We find that BERT-based methods outperform simple versions of commonly used style measures like 3-grams, punctuation frequency and LIWC-based approaches. We invite the addition of further tasks and task instances to STEL and hope to facilitate the improvement of style-sensitive measures.

References used

https://aclanthology.org/

rate research

A Review of Human Evaluation for Style Transfer

303 - Association for Computation Linguistics 2021 مقالة

This paper reviews and summarizes human evaluation practices described in 97 style transfer papers with respect to three main evaluation aspects: style transfer, meaning preservation, and fluency. In principle, evaluations by human raters should be t he most reliable. However, in style transfer papers, we find that protocols for human evaluations are often underspecified and not standardized, which hampers the reproducibility of research in this field and progress toward better human and automatic evaluation methods.

استجابة شخصية style transfer papers أوراق نقل النمط صناعة حمض الفوسفور

Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer

332 - Association for Computation Linguistics 2021 مقالة

While the field of style transfer (ST) has been growing rapidly, it has been hampered by a lack of standardized practices for automatic evaluation. In this paper, we evaluate leading automatic metrics on the oft-researched task of formality style tra nsfer. Unlike previous evaluations, which focus solely on English, we expand our focus to Brazilian-Portuguese, French, and Italian, making this work the first multilingual evaluation of metrics in ST. We outline best practices for automatic evaluation in (formality) style transfer and identify several models that correlate well with human judgments and are robust across languages. We hope that this work will help accelerate development in ST, where human evaluation is often challenging to collect.

معايير قابلة للتكرار formality style transfer نقل نمط الشخصية صناعة حمض الفوسفور

R2-D2: A Modular Baseline for Open-Domain Question Answering

198 - Association for Computation Linguistics 2021 مقالة

This work presents a novel four-stage open-domain QA pipeline R2-D2 (Rank twice, reaD twice). The pipeline is composed of a retriever, passage reranker, extractive reader, generative reader and a mechanism that aggregates the final prediction from al l system's components. We demonstrate its strength across three open-domain QA datasets: NaturalQuestions, TriviaQA and EfficientQA, surpassing state-of-the-art on the first two. Our analysis demonstrates that: (i) combining extractive and generative reader yields absolute improvements up to 5 exact match and it is at least twice as effective as the posterior averaging ensemble of the same models with different parameters, (ii) the extractive reader with fewer parameters can match the performance of the generative reader on extractive QA datasets.

إعادة صياغة تحويل المعرفة modular baseline خط الأساس وحدات صناعة حمض الفوسفور

A CCG-Based Version of the DisCoCat Framework

242 - Association for Computation Linguistics 2021 مقالة

While the DisCoCat model (Coecke et al., 2010) has been proved a valuable tool for studying compositional aspects of language at the level of semantics, its strong dependency on pregroup grammars poses important restrictions: first, it prevents large -scale experimentation due to the absence of a pregroup parser; and second, it limits the expressibility of the model to context-free grammars. In this paper we solve these problems by reformulating DisCoCat as a passage from Combinatory Categorial Grammar (CCG) to a category of semantics. We start by showing that standard categorial grammars can be expressed as a biclosed category, where all rules emerge as currying/uncurrying the identity; we then proceed to model permutation-inducing rules by exploiting the symmetry of the compact closed category encoding the word meaning. We provide a proof of concept for our method, converting Alice in Wonderland'' into DisCoCat form, a corpus that we make available to the community.

ccg-based version discocat framework combinatory categorial grammar نسخة مقرها CCG الإطار الإكريز القواعد الفرعية التنفيذية صناعة حمض الفوسفور المزيد..

A Linguistic Annotation Framework to Study Interactions in Multilingual Healthcare Conversational Forums

646 - Association for Computation Linguistics 2021 مقالة

In recent years, remote digital healthcare using online chats has gained momentum, especially in the Global South. Though prior work has studied interaction patterns in online (health) forums, such as TalkLife, Reddit and Facebook, there has been lim ited work in understanding interactions in small, close-knit community of instant messengers. In this paper, we propose a linguistic annotation framework to facilitate analysis of health-focused WhatsApp groups. The primary aim of the framework is to understand interpersonal relationships among peer supporters in order to help develop NLP solutions for remote patient care and reduce burden of overworked healthcare providers. Our framework consists of fine-grained peer support categorization and message-level sentiment tagging. Additionally, due to the prevalence of code-mixing in such groups, we incorporate word-level language annotations. We use the proposed framework to study two WhatsApp groups in Kenya for youth living with HIV, facilitated by a healthcare provider.

multilingual healthcare conversational healthcare conversational forums conversational forums تعدد اللغات الرعاية الصحية منتديات صبايا الاردن منتديات صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Does It Capture STEL? A Modular, Similarity-based Linguistic Style Evaluation Framework

هل التقاط ستيل؟إطار تقييم النمط اللغوي المعياري والمقر الرئيسي

Ask ChatGPT about the research

Read More

suggested questions