New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Benchmarking ASR Systems Based on Post-Editing Effort and Error Analysis

قياس أنظمة ASR بناء على جهود ما بعد التحرير وتحليل الأخطاء

493 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

asr systems based benchmarking asr systems benchmarking asr أنظمة العصر مقرها معيار أنظمة ASR. معيار العسر صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

This paper offers a comparative evaluation of four commercial ASR systems which are evaluated according to the post-editing effort required to reach publishable'' quality and according to the number of errors they produce. For the error annotation task, an original error typology for transcription errors is proposed. This study also seeks to examine whether there is a difference in the performance of these systems between native and non-native English speakers. The experimental results suggest that among the four systems, Trint obtains the best scores. It is also observed that most systems perform noticeably better with native speakers and that all systems are most prone to fluency errors.

References used

https://aclanthology.org/

rate research

Integration of Machine Translation and Translation Memory: Post-Editing Efforts

430 - Association for Computation Linguistics 2021 مقالة

The development of Translation Technologies, like Translation Memory and Machine Translation, has completely changed the translation industry and translator's workflow in the last decades. Nevertheless, TM and MT have been developed separately until very recently. This ongoing project will study the external integration of TM and MT, examining if the productivity and post-editing efforts of translators are higher or lower than using only TM. To this end, we will conduct an experiment where Translation students and professional translators will be asked to translate two short texts; then we will check the post-editing efforts (temporal, technical and cognitive efforts) and the quality of the translated texts.

يصنع translation memory ذاكرة الترجمة صناعة حمض الفوسفور

A Novel Machine Learning Based Approach for Post-OCR Error Detection

486 - Association for Computation Linguistics 2021 مقالة

Post processing is the most conventional approach for correcting errors that are caused by Optical Character Recognition(OCR) systems. Two steps are usually taken to correct OCR errors: detection and corrections. For the first task, supervised machin e learning methods have shown state-of-the-art performances. Previously proposed approaches have focused most prominently on combining lexical, contextual and statistical features for detecting errors. In this study, we report a novel system to error detection which is based merely on the n-gram counts of a candidate token. In addition to being simple and computationally less expensive, our proposed system beats previous systems reported in the ICDAR2019 competition on OCR-error detection with notable margins. We achieved state-of-the-art F1-scores for eight out of the ten involved European languages. The maximum improvement is for Spanish which improved from 0.69 to 0.90, and the minimum for Polish from 0.82 to 0.84.

الكشف عن الأخطاء learning based approach approach for post-ocr النهج القائم على التعلم نهج لما بعد التعرف الضابط صناعة حمض الفوسفور

Post-Editing Job Profiles for Subtitlers

336 - Association for Computation Linguistics 2021 مقالة

Language technologies, such as machine translation (MT), but also the application of artificial intelligence in general and an abundance of CAT tools and platforms have an increasing influence on the translation market. Human interaction with these t echnologies becomes ever more important as they impact translators' workflows, work environments, and job profiles. Moreover, it has implications for translator training. One of the tasks that emerged with language technologies is post-editing (PE) where a human translator corrects raw machine translated output according to given guidelines and quality criteria (O'Brien, 2011: 197-198). Already widely used in several traditional translation settings, its use has come into focus in more creative processes such as literary translation and audiovisual translation (AVT) as well. With the integration of MT systems, the translation process should become more efficient. Both economic and cognitive processes are impacted and with it the necessary competences of all stakeholders involved change. In this paper, we want to describe the different potential job profiles and respective competences needed when post-editing subtitles.

job profiles profiles for subtitlers subtitlers ملامح الوظائف الملفات الشخصية للترابط الرملية صناعة حمض الفوسفور المزيد..

Fine-grained Post-training for Improving Retrieval-based Dialogue Systems

258 - Association for Computation Linguistics 2021 مقالة

Retrieval-based dialogue systems display an outstanding performance when pre-trained language models are used, which includes bidirectional encoder representations from transformers (BERT). During the multi-turn response selection, BERT focuses on tr aining the relationship between the context with multiple utterances and the response. However, this method of training is insufficient when considering the relations between each utterance in the context. This leads to a problem of not completely understanding the context flow that is required to select a response. To address this issue, we propose a new fine-grained post-training method that reflects the characteristics of the multi-turn dialogue. Specifically, the model learns the utterance level interactions by training every short context-response pair in a dialogue session. Furthermore, by using a new training objective, the utterance relevance classification, the model understands the semantic relevance and coherence between the dialogue utterances. Experimental results show that our model achieves new state-of-the-art with significant margins on three benchmark datasets. This suggests that the fine-grained post-training method is highly effective for the response selection task.

improving retrieval-based dialogue retrieval-based dialogue systems improving retrieval-based تحسين الحوار المستندة إلى الاسترجاع نظم الحوار المستردلة تحسين استرجاع القائمة صناعة حمض الفوسفور المزيد..

A Preliminary Study on Evaluating Consultation Notes With Post-Editing

312 - Association for Computation Linguistics 2021 مقالة

Automatic summarisation has the potential to aid physicians in streamlining clerical tasks such as note taking. But it is notoriously difficult to evaluate these systems and demonstrate that they are safe to be used in a clinical setting. To circumve nt this issue, we propose a semi-automatic approach whereby physicians post-edit generated notes before submitting them. We conduct a preliminary study on the time saving of automatically generated consultation notes with post-editing. Our evaluators are asked to listen to mock consultations and to post-edit three generated notes. We time this and find that it is faster than writing the note from scratch. We present insights and lessons learnt from this experiment.

evaluating consultation notes study on evaluating evaluating consultation تقييم ملاحظات الاستشارة دراسة على تقييم تقييم الاستشارة صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Benchmarking ASR Systems Based on Post-Editing Effort and Error Analysis

قياس أنظمة ASR بناء على جهود ما بعد التحرير وتحليل الأخطاء

Ask ChatGPT about the research

Read More

suggested questions