New community

Subscribe to the gold package and get unlimited access to Shamra Academy

You should evaluate your language model on marginal likelihood over tokenisations

يجب عليك تقييم نموذج لغتك على الاحتمال الهامشي فوق Tokenisations

162 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

زيادة الصينية marginal likelihood احتمالية هامشية صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Neural language models typically tokenise input text into sub-word units to achieve an open vocabulary. The standard approach is to use a single canonical tokenisation at both train and test time. We suggest that this approach is unsatisfactory and may bottleneck our evaluation of language model performance. Using only the one-best tokenisation ignores tokeniser uncertainty over alternative tokenisations, which may hurt model out-of-domain performance. In this paper, we argue that instead, language models should be evaluated on their marginal likelihood over tokenisations. We compare different estimators for the marginal likelihood based on sampling, and show that it is feasible to estimate the marginal likelihood with a manageable number of samples. We then evaluate a pretrained language model on both the one-best-tokenisation and marginal perplexities, and show that the marginal perplexity can be significantly better than the one best, especially on out-of-domain data. We link this difference in perplexity to the tokeniser uncertainty as measured by tokeniser entropy. We discuss some implications of our results for language model training and evaluation, particularly with regard to tokenisation robustness.

References used

https://aclanthology.org/

rate research

Should we find another model?: Improving Neural Machine Translation Performance with ONE-Piece Tokenization Method without Model Modification

727 - Association for Computation Linguistics 2021 مقالة

Most of the recent Natural Language Processing(NLP) studies are based on the Pretrain-Finetuning Approach (PFA), but in small and medium-sized enterprises or companies with insufficient hardware there are many limitations to servicing NLP application software using such technology due to slow speed and insufficient memory. The latest PFA technologies require large amounts of data, especially for low-resource languages, making them much more difficult to work with. We propose a new tokenization method, ONE-Piece, to address this limitation that combines the morphology-considered subword tokenization method and the vocabulary method used after probing for an existing method that has not been carefully considered before. Our proposed method can also be used without modifying the model structure. We experiment by applying ONE-Piece to Korean, a morphologically-rich and low-resource language. We derive an optimal subword tokenization result for Korean-English machine translation by conducting a case study that combines the subword tokenization method, morphological segmentation, and vocabulary method. Through comparative experiments with all the tokenization methods currently used in NLP research, ONE-Piece achieves performance comparable to the current Korean-English machine translation state-of-the-art model.

improving neural machine improving neural تحسين الآلة العصبية تحسين العصبية صناعة حمض الفوسفور

Reasoning over Vision and Language: Exploring the Benefits of Supplemental Knowledge

239 - Association for Computation Linguistics 2021 مقالة

The limits of applicability of vision-and language models are defined by the coverage of their training data. Tasks like vision question answering (VQA) often require commonsense and factual information beyond what can be learned from task-specific d atasets. This paper investigates the injection of knowledge from general-purpose knowledge bases (KBs) into vision-and-language transformers. We use an auxiliary training objective that encourages the learned representations to align with graph embeddings of matching entities in a KB. We empirically study the relevance of various KBs to multiple tasks and benchmarks. The technique brings clear benefits to knowledge-demanding question answering tasks (OK-VQA, FVQA) by capturing semantic and relational knowledge absent from existing models. More surprisingly, the technique also benefits visual reasoning tasks (NLVR2, SNLI-VE). We perform probing experiments and show that the injection of additional knowledge regularizes the space of embeddings, which improves the representation of lexical and semantic similarities. The technique is model-agnostic and can expand the applicability of any vision-and-language transformer with minimal computational overhead.

supplemental knowledge vision-and language models exploring المعرفة الإضافية نماذج الرؤية واللغة استكشاف صناعة حمض الفوسفور المزيد..

Should Semantic Vector Composition be Explicit? Can it be Linear?

484 - Association for Computation Linguistics 2021 مقالة

Vector representations have become a central element in semantic language modelling, leading to mathematical overlaps with many fields including quantum theory. Compositionality is a core goal for such representations: given representations for wet' and fish', how should the concept wet fish' be represented? This position paper surveys this question from two points of view. The first considers the question of whether an explicit mathematical representation can be successful using only tools from within linear algebra, or whether other mathematical tools are needed. The second considers whether semantic vector composition should be explicitly described mathematically, or whether it can be a model-internal side-effect of training a neural network. A third and newer question is whether a compositional model can be implemented on a quantum computer. Given the fundamentally linear nature of quantum mechanics, we propose that these questions are related, and that this survey may help to highlight candidate operations for future quantum implementation.

semantic vector composition semantic vector vector composition تكوين ناقلات دلالي ناقل دلالي تكوين ناقلات صناعة حمض الفوسفور المزيد..

How Should Agents Ask Questions For Situated Learning? An Annotated Dialogue Corpus

426 - Association for Computation Linguistics 2021 مقالة

Intelligent agents that are confronted with novel concepts in situated environments will need to ask their human teammates questions to learn about the physical world. To better understand this problem, we need data about asking questions in situated task-based interactions. To this end, we present the Human-Robot Dialogue Learning (HuRDL) Corpus - a novel dialogue corpus collected in an online interactive virtual environment in which human participants play the role of a robot performing a collaborative tool-organization task. We describe the corpus data and a corresponding annotation scheme to offer insight into the form and content of questions that humans ask to facilitate learning in a situated environment. We provide the corpus as an empirically-grounded resource for improving question generation in situated intelligent agents.

annotated dialogue corpus situated الحوار المشروح وجعة و صناعة حمض الفوسفور

Evaluate the brittleness factors for some rocks by ultrasonic measurements

1308 - Aِl-Baath University 2017 ورقة بحثية

Basalt is classified as a isotropic rock in according to its mechanical properties. But gypsum is considered isotropic transverse rock . coming the mechanical parameters values joins practice direction of parameters . By using a nondestructive me thod such as the ultrasonic test which depends on understanding the effectual of mechanical properties upon speed of ultrasonics inside rocks , one can indirectly predict the mentioned parameters Tests were completed within 35 gypsum and 11 basalt rocks specimens which collected by completions under the direction of General Organization for Land Development ( Directorate of Geological Investigation).

ميكانيك الصخور حفر الهشاشة rock mechanics Drilling brittleness

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

You should evaluate your language model on marginal likelihood over tokenisations

يجب عليك تقييم نموذج لغتك على الاحتمال الهامشي فوق Tokenisations

Ask ChatGPT about the research

Read More

suggested questions