Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

TenTrans High-Performance Inference Toolkit for WMT2021 Efficiency Task

TENTRANS مجموعة أدوات الاستدلال عالية الأداء لمهمة كفاءة WMT2021

406 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

تقاسم الكفاءة efficiency task مهمة الكفاءة صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

The paper describes the TenTrans's submissions to the WMT 2021 Efficiency Shared Task. We explore training a variety of smaller compact transformer models using the teacher-student setup. Our model is trained by our self-developed open-source multilingual training platform TenTrans-Py. We also release an open-source high-performance inference toolkit for transformer models and the code is written in C++ completely. All additional optimizations are built on top of the inference engine including attention caching, kernel fusion, early-stop, and several other optimizations. In our submissions, the fastest system can translate more than 22,000 tokens per second with a single Tesla P4 while maintaining 38.36 BLEU on En-De newstest2019. Our trained models and more details are available in TenTrans-Decoding competition examples.

References used

https://aclanthology.org/

rate research

LightSeq: A High Performance Inference Library for Transformers

962 - Association for Computation Linguistics 2021 مقالة

Transformer and its variants have achieved great success in natural language processing. Since Transformer models are huge in size, serving these models is a challenge for real industrial applications. In this paper, we propose , a highly efficient i nference library for models in the Transformer family. includes a series of GPU optimization techniques to both streamline the computation of Transformer layers and reduce memory footprint. supports models trained using PyTorch and Tensorflow. Experimental results on standard machine translation benchmarks show that achieves up to 14x speedup compared with TensorFlow and 1.4x speedup compared with , a concurrent CUDA implementation. The code will be released publicly after the review.

high performance inference high performance performance inference library الاستدلال عالية الأداء أداء عالي مكتبة استنتاج الأداء صناعة حمض الفوسفور المزيد..

RoBLEURT Submission for WMT2021 Metrics Task

536 - Association for Computation Linguistics 2021 مقالة

In this paper, we present our submission to Shared Metrics Task: RoBLEURT (Robustly Optimizing the training of BLEURT). After investigating the recent advances of trainable metrics, we conclude several aspects of vital importance to obtain a well-per formed metric model by: 1) jointly leveraging the advantages of source-included model and reference-only model, 2) continuously pre-training the model with massive synthetic data pairs, and 3) fine-tuning the model with data denoising strategy. Experimental results show that our model reaching state-of-the-art correlations with the WMT2020 human annotations upon 8 out of 10 to-English language pairs.

shared metrics task metrics task robustly optimizing مهام المقاييس المشتركة مهام المقاييس تحسين بقوة صناعة حمض الفوسفور المزيد..

TenTrans Multilingual Low-Resource Translation System for WMT21 Indo-European Languages Task

380 - Association for Computation Linguistics 2021 مقالة

This paper describes TenTrans' submission to WMT21 Multilingual Low-Resource Translation shared task for the Romance language pairs. This task focuses on improving translation quality from Catalan to Occitan, Romanian and Italian, with the assistance of related high-resource languages. We mainly utilize back-translation, pivot-based methods, multilingual models, pre-trained model fine-tuning, and in-domain knowledge transfer to improve the translation quality. On the test set, our best-submitted system achieves an average of 43.45 case-sensitive BLEU scores across all low-resource pairs. Our data, code, and pre-trained models used in this work are available in TenTrans evaluation examples.

متعددة اللغات NMT. indo-european languages task multilingual low-resource مهمة اللغات الهندية الأوروبية متعدد اللغات منخفضة الموارد صناعة حمض الفوسفور

HPC: High Performance Computing

1357 - Damascus University 2018 حلقة بحث

This Paper Attempts to study the latest advancements in High Performance Computing Technologies, Which Provides suitable environments, Solid infrastructure, Software and Hardware Components, allowing Scientists and Researchers to solve Math, Biology, Machine Learning, Physics Simulations, and numerous other problems, Allowing significant breakthroughs in these fields.

HPC High Performance Computing CUDA MPI IBM Roadrunner

DaNLP: An open-source toolkit for Danish Natural Language Processing

811 - Association for Computation Linguistics 2021 مقالة

We present an open-source toolkit for Danish Natural Language Processing, enabling easy access to Danish NLP's latest advancements. The toolkit features wrapper-functions for loading models and datasets in a unified way using third-party NLP framewor ks. The toolkit is developed to enhance community building, understanding the need from industry and knowledge sharing. As an example of this, we present Angry Tweets: An Annotation Game to create awareness of Danish NLP and create a new sentiment-annotated dataset.

الإنجليزية ينوجندر المعايير danish natural language اللغة الطبيعية الدنماركية صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

TenTrans High-Performance Inference Toolkit for WMT2021 Efficiency Task

TENTRANS مجموعة أدوات الاستدلال عالية الأداء لمهمة كفاءة WMT2021

Ask ChatGPT about the research

Read More

suggested questions