New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Constrained Language Models Yield Few-Shot Semantic Parsers

نماذج اللغة المقيدة تسفر عن المحللين الدلاليين

245 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

yield few-shot semantic models yield few-shot language models yield تسفر عن عدد قليل من الدلالات الدلالية ترسل النماذج قليلة نماذج اللغة المحصول صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

We explore the use of large pretrained language models as few-shot semantic parsers. The goal in semantic parsing is to generate a structured meaning representation given a natural language input. However, language models are trained to generate natural language. To bridge the gap, we use language models to paraphrase inputs into a controlled sublanguage resembling English that can be automatically mapped to a target meaning representation. Our results demonstrate that with only a small amount of data and very little code to convert into English-like representations, our blueprint for rapidly bootstrapping semantic parsers leads to surprisingly effective performance on multiple community tasks, greatly exceeding baseline methods also trained on the same limited data.

References used

https://aclanthology.org/

rate research

Testing Cross-Database Semantic Parsers With Canonical Utterances

357 - Association for Computation Linguistics 2021 مقالة

The benchmark performance of cross-database semantic parsing has climbed steadily in recent years, catalyzed by the wide adoption of pre-trained language models. Yet existing work have shown that state-of-the-art cross-database semantic parsers strug gle to generalize to novel user utterances, databases and query structures. To obtain transparent details on the strengths and limitation of these models, we propose a diagnostic testing approach based on controlled synthesis of canonical natural language and SQL pairs. Inspired by the CheckList, we characterize a set of essential capabilities for cross-database semantic parsing models, and detailed the method for synthesizing the corresponding test data. We evaluated a variety of high performing models using the proposed approach, and identified several non-obvious weaknesses across models (e.g. unable to correctly select many columns). Our dataset and code are released as a test suite at http://github.com/hclent/BehaviorCheckingSemPar.

cross-database semantic parsers cross-database semantic المحللين الدلاليين عبر قاعدة البيانات المعتاد قاعدة البيانات الدلالية صناعة حمض الفوسفور

Language Models are Few-shot Multilingual Learners

246 - Association for Computation Linguistics 2021 مقالة

General-purpose language models have demonstrated impressive capabilities, performing on par with state-of-the-art approaches on a range of downstream natural language processing (NLP) tasks and benchmarks when inferring instructions from very few ex amples. Here, we evaluate the multilingual skills of the GPT and T5 models in conducting multi-class classification on non-English languages without any parameter updates. We show that, given a few English examples as context, pre-trained language models can predict not only English test samples but also non-English ones. Finally, we find the in-context few-shot cross-lingual prediction results of language models are significantly better than random prediction, and they are competitive compared to the existing state-of-the-art cross-lingual models and translation models.

few-shot multilingual learners multilingual learners عدد قليل من المتعلمين متعدد اللغات المتعلمين متعدد اللغات صناعة حمض الفوسفور

Getting to Production with Few-shot Natural Language Generation Models

306 - Association for Computation Linguistics 2021 مقالة

In this paper, we study the utilization of pre-trained language models to enable few-shotNatural Language Generation (NLG) in task-oriented dialog systems. We introduce a system consisting of iterative self-training and an extensible mini-template fr amework that textualizes the structured input data into semi-natural text to fully take advantage of pre-trained language models. We compare var-ious representations of NLG models' input and output and show that transforming the input and output to be similar to what the language model has seen before during pre-training improves the model's few-shot performance substantially. We show that neural mod-els can be trained with as few as 300 annotated examples while providing high fidelity, considerably lowering the resource requirements for standing up a new domain or language.This level of data efficiency removes the need for crowd-sourced data collection resulting in higher quality data annotated by expert linguists. In addition, model maintenance and debugging processes will improve in this few-shot setting. Finally, we explore distillation and using a caching system to satisfy latency requirements of real-world systems.

استخراج الفتحة few-shot natural language language generation models قليل من اللغات الطبيعية نماذج توليد اللغة صناعة حمض الفوسفور

Ask2Transformers: Zero-Shot Domain labelling with Pretrained Language Models

397 - Association for Computation Linguistics 2021 مقالة

In this paper we present a system that exploits different pre-trained Language Models for assigning domain labels to WordNet synsets without any kind of supervision. Furthermore, the system is not restricted to use a particular set of domain labels. We exploit the knowledge encoded within different off-the-shelf pre-trained Language Models and task formulations to infer the domain label of a particular WordNet definition. The proposed zero-shot system achieves a new state-of-the-art on the English dataset used in the evaluation.

كلمة متعددة اللغات صناعة حمض الفوسفور

Probing Pre-trained Language Models for Semantic Attributes and their Values

320 - Association for Computation Linguistics 2021 مقالة

Pretrained language models (PTLMs) yield state-of-the-art performance on many natural language processing tasks, including syntax, semantics and commonsense. In this paper, we focus on identifying to what extent do PTLMs capture semantic attributes a nd their values, e.g., the correlation between rich and high net worth. We use PTLMs to predict masked tokens using patterns and lists of items from Wikidata in order to verify how likely PTLMs encode semantic attributes along with their values. Such inferences based on semantics are intuitive for humans as part of our language understanding. Since PTLMs are trained on large amount of Wikipedia data we would assume that they can generate similar predictions, yet our findings reveal that PTLMs are still much worse than humans on this task. We show evidence and analysis explaining how to exploit our methodology to integrate better context and semantics into PTLMs using knowledge bases.

probing pre-trained language probing pre-trained التحقيق اللغة المدربة مسبقا التحقيق مسبقا المدربين صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Constrained Language Models Yield Few-Shot Semantic Parsers

نماذج اللغة المقيدة تسفر عن المحللين الدلاليين

Ask ChatGPT about the research

Read More

suggested questions