New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Did the Cat Drink the Coffee? Challenging Transformers with Generalized Event Knowledge

هل تناول القط القهوة؟المحولات الصعبة مع معرفة الحدث المعمم

462 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

drink the coffee cat drink generalized event knowledge اشرب القهوة شرب القط المعرفة الحدث المعمم صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Prior research has explored the ability of computational models to predict a word semantic fit with a given predicate. While much work has been devoted to modeling the typicality relation between verbs and arguments in isolation, in this paper we take a broader perspective by assessing whether and to what extent computational approaches have access to the information about the typicality of entire events and situations described in language (Generalized Event Knowledge). Given the recent success of Transformers Language Models (TLMs), we decided to test them on a benchmark for the dynamic estimation of thematic fit. The evaluation of these models was performed in comparison with SDM, a framework specifically designed to integrate events in sentence meaning representations, and we conducted a detailed error analysis to investigate which factors affect their behavior. Our results show that TLMs can reach performances that are comparable to those achieved by SDM. However, additional analysis consistently suggests that TLMs do not capture important aspects of event knowledge, and their predictions often depend on surface linguistic features, such as frequent words, collocations and syntactic patterns, thereby showing sub-optimal generalization abilities.

References used

https://aclanthology.org/

rate research

Challenging distributional models with a conceptual network of philosophical terms

181 - Association for Computation Linguistics 2021 مقالة

Computational linguistic research on language change through distributional semantic (DS) models has inspired researchers from fields such as philosophy and literary studies, who use these methods for the exploration and comparison of comparatively s mall datasets traditionally analyzed by close reading. Research on methods for small data is still in early stages and it is not clear which methods achieve the best results. We investigate the possibilities and limitations of using distributional semantic models for analyzing philosophical data by means of a realistic use-case. We provide a ground truth for evaluation created by philosophy experts and a blueprint for using DS models in a sound methodological setup. We compare three methods for creating specialized models from small datasets. Though the models do not perform well enough to directly support philosophers yet, we find that models designed for small data yield promising directions for future work.

challenging distributional models conceptual network challenging distributional نماذج التوزيع الصعبة الشبكة المفاهيمية التوزيع الصعب صناعة حمض الفوسفور المزيد..

Modeling Event Plausibility with Consistent Conceptual Abstraction

293 - Association for Computation Linguistics 2021 مقالة

Understanding natural language requires common sense, one aspect of which is the ability to discern the plausibility of events. While distributional models---most recently pre-trained, Transformer language models---have demonstrated improvements in m odeling event plausibility, their performance still falls short of humans'. In this work, we show that Transformer-based plausibility models are markedly inconsistent across the conceptual classes of a lexical hierarchy, inferring that a person breathing'' is plausible while a dentist breathing'' is not, for example. We find this inconsistency persists even when models are softly injected with lexical knowledge, and we present a simple post-hoc method of forcing model consistency that improves correlation with human plausibility judgements.

consistent conceptual abstraction conceptual abstraction consistent conceptual التجريد المفاهيمي ثابت التجريد المفاهيمي مفهوم ثابت صناعة حمض الفوسفور المزيد..

Learning General Event Schemas with Episodic Logic

267 - Association for Computation Linguistics 2021 مقالة

We present a system for learning generalized, stereotypical patterns of events---or schemas''---from natural language stories, and applying them to make predictions about other stories. Our schemas are represented with Episodic Logic, a logical form that closely mirrors natural language. By beginning with a head start'' set of protoschemas--- schemas that a 1- or 2-year-old child would likely know---we can obtain useful, general world knowledge with very few story examples---often only one or two. Learned schemas can be combined into more complex, composite schemas, and used to make predictions in other stories where only partial information is available.

episodic logic learning general event general event schemas منطق episodic التعلم الحدث العام مخططات الحدث العام صناعة حمض الفوسفور المزيد..

Improving Abstractive Summarization with Commonsense Knowledge

263 - Association for Computation Linguistics 2021 مقالة

Large scale pretrained models have demonstrated strong performances on several natural language generation and understanding benchmarks. However, introducing commonsense into them to generate more realistic text remains a challenge. Inspired from pre vious work on commonsense knowledge generation and generative commonsense reasoning, we introduce two methods to add commonsense reasoning skills and knowledge into abstractive summarization models. Both methods beat the baseline on ROUGE scores, demonstrating the superiority of our models over the baseline. Human evaluation results suggest that summaries generated by our methods are more realistic and have fewer commonsensical errors.

improving abstractive summarization improving abstractive تحسين تلخيص الجماعي تحسين المبادرة صناعة حمض الفوسفور

Does It Happen? Multi-hop Path Structures for Event Factuality Prediction with Graph Transformer Networks

669 - Association for Computation Linguistics 2021 مقالة

The goal of Event Factuality Prediction (EFP) is to determine the factual degree of an event mention, representing how likely the event mention has happened in text. Current deep learning models has demonstrated the importance of syntactic and semant ic structures of the sentences to identify important context words for EFP. However, the major problem with these EFP models is that they only encode the one-hop paths between the words (i.e., the direct connections) to form the sentence structures. In this work, we show that the multi-hop paths between the words are also necessary to compute the sentence structures for EFP. To this end, we introduce a novel deep learning model for EFP that explicitly considers multi-hop paths with both syntax-based and semantic-based edges between the words to obtain sentence structures for representation learning in EFP. We demonstrate the effectiveness of the proposed model via the extensive experiments in this work.

event factuality prediction graph transformer networks factuality prediction تنبؤ الحقوق في الواقع شبكة محول الرسم البياني التنبؤ بالتواصل صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Did the Cat Drink the Coffee? Challenging Transformers with Generalized Event Knowledge

هل تناول القط القهوة؟المحولات الصعبة مع معرفة الحدث المعمم

Ask ChatGPT about the research

Read More

suggested questions