Do you want to publish a course? Click here

Comprehensive Punctuation Restoration for English and Polish

استعادة علامات الترقيم الشاملة للغة الإنجليزية والبولندية

302   0   0   0.0 ( 0 )
 Publication date 2021
and research's language is English
 Created by Shamra Editor




Ask ChatGPT about the research

Punctuation restoration is a fundamental requirement for the readability of text derived from Automatic Speech Recognition (ASR) systems. Most contemporary solutions are limited to predicting only a few of the most frequently occurring marks, such as periods, commas, and question marks - and only one per word. However, in written language, we deal with a much larger number of punctuation characters (such as parentheses, hyphens, etc.), and their combinations (like parenthesis followed by a dot). Such comprehensive punctuation cannot always be unambiguously reduced to a basic set of the most frequently occurring marks. In this work, we evaluate several methods in the comprehensive punctuation reconstruction task. We conduct experiments on parallel corpora of two different languages, English and Polish - languages with a relatively simple and complex morphology, respectively. We also investigate the influence of building a model on comprehensive punctuation on the quality of the basic punctuation restoration task

References used
https://aclanthology.org/
rate research

Read More

We propose a novel method of homonymy-polysemy discrimination for three Indo-European Languages (English, Spanish and Polish). Support vector machines and LASSO logistic regression were successfully used in this task, outperforming baselines. The fea ture set utilised lemma properties, gloss similarities, graph distances and polysemy patterns. The proposed ML models performed equally well for English and the other two languages (constituting testing data sets). The algorithms not only ruled out most cases of homonymy but also were efficacious in distinguishing between closer and indirect semantic relatedness.
In this paper we describe our submissions to WAT-2021 (Nakazawa et al., 2021) for English-to-Myanmar language (Burmese) task. Our team, ID: YCC-MT1'', focused on bringing transliteration knowledge to the decoder without changing the model. We manuall y extracted the transliteration word/phrase pairs from the ALT corpus and applying XML markup feature of Moses decoder (i.e. -xml-input exclusive, -xml-input inclusive). We demonstrate that hybrid translation technique can significantly improve (around 6 BLEU scores) the baseline of three well-known Phrase-based SMT'', Operation Sequence Model'' and Hierarchical Phrase-based SMT''. Moreover, this simple hybrid method achieved the second highest results among the submitted MT systems for English-to-Myanmar WAT2021 translation share task according to BLEU (Papineni et al., 2002) and AMFM scores (Banchs et al., 2015).
Being an integral urban, social, economical and cultural part of its development plans; states and its administrations competes in the design, planning and implementation of sustainable tourism development. Given the importance and the need to de velop a meaningful and targeted strategies toachieve sustainable development in the North region ; this research handled a collection of different tourism planning strategeis such as: site planning strategy, merging and integration strategy, tourism triangle strategy, Hyperlinks axes strategy and Imaginable planning strategy.
Nowadays social-psychological variables , like attitudes and motivation, gender, aptitude, etc. have been established as influential factors in the process of learning a foreign language . Therefore, this research aims at measuring the attitudes of f ourth-year students at the Department of English towards learning English
With the increasing use of technologies and automation in different sides of modern life, the outage of electricity became a big issue that widely affects the daily life of most sectors like industrial, economical or even entertaining sector. So it became so necessary to achieve a high-reliability electrical system to insure the continuation of electricity supply to the end consumer. Consequently, in this research, we are studying a new method of service restoration using genetic algorithms to increase the reliability of distribution systems and improving its performance. The research includes a brief aver view of electrical systems reliability and the basics of Genetic Algorithms and the use of these techniques in dispatching centers. In addition we have designed a program in "MATLAB" environment to apply the service restoration technique using genetic algorithms, and the program has been tested on a case study with the relative results shown .

suggested questions

comments
Fetching comments Fetching comments
Sign in to be able to follow your search criteria
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا