Subscribe to the gold package and get unlimited access to Shamra Academy

Maastricht University's Multilingual Speech Translation System for IWSLT 2021

نظام ترجمة خطاب الكلام متعدد اللغات بجامعة ماستريخت ل IWSLT 2021

892 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

maastricht university multilingual multilingual speech translation university multilingual speech جامعة ماستريخت متعددة اللغات ترجمة خطوة متعددة اللغات خطاب جامعي متعدد اللغات صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

This paper describes Maastricht University's participation in the IWSLT 2021 multilingual speech translation track. The task in this track is to build multilingual speech translation systems in supervised and zero-shot directions. Our primary system is an end-to-end model that performs both speech transcription and translation. We observe that the joint training for the two tasks is complementary especially when the speech translation data is scarce. On the source and target side, we use data augmentation and pseudo-labels respectively to improve the performance of our systems. We also introduce an ensembling technique that consistently improves the quality of transcriptions and translations. The experiments show that the end-to-end system is competitive with its cascaded counterpart especially in zero-shot conditions.

References used

https://aclanthology.org/

rate research

Maastricht University's Large-Scale Multilingual Machine Translation System for WMT 2021

1265 - Association for Computation Linguistics 2021 مقالة

We present our development of the multilingual machine translation system for the large-scale multilingual machine translation task at WMT 2021. Starting form the provided baseline system, we investigated several techniques to improve the translation quality on the target subset of languages. We were able to significantly improve the translation quality by adapting the system towards the target subset of languages and by generating synthetic data using the initial model. Techniques successfully applied in zero-shot multilingual machine translation (e.g. similarity regularizer) only had a minor effect on the final translation performance.

متعدد اللغات منخفضة الموارد maastricht university large-scale university large-scale multilingual جامعة ماستريخت واسعة النطاق جامعة واسعة النطاق متعدد اللغات صناعة حمض الفوسفور

ZJU's IWSLT 2021 Speech Translation System

991 - Association for Computation Linguistics 2021 مقالة

In this paper, we describe Zhejiang University's submission to the IWSLT2021 Multilingual Speech Translation Task. This task focuses on speech translation (ST) research across many non-English source languages. Participants can decide whether to work on constrained systems or unconstrained systems which can using external data. We create both cascaded and end-to-end speech translation constrained systems, using the provided data only. In the cascaded approach, we combine Conformer-based automatic speech recognition (ASR) with the Transformer-based neural machine translation (NMT). Our end-to-end direct speech translation systems use ASR pretrained encoder and multi-task decoders. The submitted systems are ensembled by different cascaded models.

zju iwslt zju iwslt. صناعة حمض الفوسفور

KIT's IWSLT 2021 Offline Speech Translation System

897 - Association for Computation Linguistics 2021 مقالة

This paper describes KIT'submission to the IWSLT 2021 Offline Speech Translation Task. We describe a system in both cascaded condition and end-to-end condition. In the cascaded condition, we investigated different end-to-end architectures for the spe ech recognition module. For the text segmentation module, we trained a small transformer-based model on high-quality monolingual data. For the translation module, our last year's neural machine translation model was reused. In the end-to-end condition, we improved our Speech Relative Transformer architecture to reach or even surpass the result of the cascade system.

خط أنابيب Finetuned. kit iwslt كيت iwslt. صناعة حمض الفوسفور

ESPnet-ST IWSLT 2021 Offline Speech Translation System

748 - Association for Computation Linguistics 2021 مقالة

This paper describes the ESPnet-ST group's IWSLT 2021 submission in the offline speech translation track. This year we made various efforts on training data, architecture, and audio segmentation. On the data side, we investigated sequence-level knowl edge distillation (SeqKD) for end-to-end (E2E) speech translation. Specifically, we used multi-referenced SeqKD from multiple teachers trained on different amounts of bitext. On the architecture side, we adopted the Conformer encoder and the Multi-Decoder architecture, which equips dedicated decoders for speech recognition and translation tasks in a unified encoder-decoder model and enables search in both source and target language spaces during inference. We also significantly improved audio segmentation by using the pyannote.audio toolkit and merging multiple short segments for long context modeling. Experimental evaluations showed that each of them contributed to large improvements in translation performance. Our best E2E system combined all the above techniques with model ensembling and achieved 31.4 BLEU on the 2-ref of tst2021 and 21.2 BLEU and 19.3 BLEU on the two single references of tst2021.

مهمة غير متصل espnet-st group iwslt offline speech مجموعة ESPNET-ST IWSLT خطاب غير متصل صناعة حمض الفوسفور

Multilingual Speech Translation with Unified Transformer: Huawei Noah's Ark Lab at IWSLT 2021

560 - Association for Computation Linguistics 2021 مقالة

This paper describes the system submitted to the IWSLT 2021 Multilingual Speech Translation (MultiST) task from Huawei Noah's Ark Lab. We use a unified transformer architecture for our MultiST model, so that the data from different modalities (i.e., speech and text) and different tasks (i.e., Speech Recognition, Machine Translation, and Speech Translation) can be exploited to enhance the model's ability. Specifically, speech and text inputs are firstly fed to different feature extractors to extract acoustic and textual features, respectively. Then, these features are processed by a shared encoder--decoder architecture. We apply several training techniques to improve the performance, including multi-task learning, task-level curriculum learning, data augmentation, etc. Our final system achieves significantly better results than bilingual baselines on supervised language pairs and yields reasonable results on zero-shot language pairs.

huawei noah ark noah ark lab هواوي نوح ark. Noah Ark Lab. صناعة حمض الفوسفور

Maastricht University's Multilingual Speech Translation System for IWSLT 2021

نظام ترجمة خطاب الكلام متعدد اللغات بجامعة ماستريخت ل IWSLT 2021

Ask ChatGPT about the research

Read More

suggested questions