Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages

TWUG: مورد كبير من الرسوم البيانية استخدام كلمة DIACHRONIC بأربع لغات

625 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

إجابة احتمالية أعلى usage graphs dwug استخدام الرسوم البيانية تليك صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Word meaning is notoriously difficult to capture, both synchronically and diachronically. In this paper, we describe the creation of the largest resource of graded contextualized, diachronic word meaning annotation in four different languages, based on 100,000 human semantic proximity judgments. We describe in detail the multi-round incremental annotation process, the choice for a clustering algorithm to group usages into senses, and possible -- diachronic and synchronic -- uses for this dataset.

References used

https://aclanthology.org/

rate research

Modeling Sense Structure in Word Usage Graphs with the Weighted Stochastic Block Model

718 - Association for Computation Linguistics 2021 مقالة

We suggest to model human-annotated Word Usage Graphs capturing fine-grained semantic proximity distinctions between word uses with a Bayesian formulation of the Weighted Stochastic Block Model, a generative model for random graphs popular in biology , physics and social sciences. By providing a probabilistic model of graded word meaning we aim to approach the slippery and yet widely used notion of word sense in a novel way. The proposed framework enables us to rigorously compare models of word senses with respect to their fit to the data. We perform extensive experiments and select the empirically most adequate model.

weighted stochastic block stochastic block model word usage graphs بلوك ستوكاستيك المرجح كتلة ستوكاستيك نموذج الرسوم البيانية استخدام كلمة صناعة حمض الفوسفور المزيد..

Learning grounded word meaning representations on similarity graphs

1084 - Association for Computation Linguistics 2021 مقالة

This paper introduces a novel approach to learn visually grounded meaning representations of words as low-dimensional node embeddings on an underlying graph hierarchy. The lower level of the hierarchy models modality-specific word representations, co nditioned to another modality, through dedicated but communicating graphs, while the higher level puts these representations together on a single graph to learn a representation jointly from both modalities. The topology of each graph models similarity relations among words, and is estimated jointly with the graph embedding. The assumption underlying this model is that words sharing similar meaning correspond to communities in an underlying graph in a low-dimensional space. We named this model Hierarchical Multi-Modal Similarity Graph Embedding (HM-SGE). Experimental results validate the ability of HM-SGE to simulate human similarity judgments and concept categorization, outperforming the state of the art.

learning grounded word grounded meaning representations learning grounded تعلم الكلمة المحددة تعني المعنى المحدد تعلم الأساس صناعة حمض الفوسفور المزيد..

Relation Extraction with Word Graphs from N-grams

751 - Association for Computation Linguistics 2021 مقالة

Most recent studies for relation extraction (RE) leverage the dependency tree of the input sentence to incorporate syntax-driven contextual information to improve model performance, with little attention paid to the limitation where high-quality depe ndency parsers in most cases unavailable, especially for in-domain scenarios. To address this limitation, in this paper, we propose attentive graph convolutional networks (A-GCN) to improve neural RE methods with an unsupervised manner to build the context graph, without relying on the existence of a dependency parser. Specifically, we construct the graph from n-grams extracted from a lexicon built from pointwise mutual information (PMI) and apply attention over the graph. Therefore, different word pairs from the contexts within and across n-grams are weighted in the model and facilitate RE accordingly. Experimental results with further analyses on two English benchmark datasets for RE demonstrate the effectiveness of our approach, where state-of-the-art performance is observed on both datasets.

محاذاة الكيان high-quality dependency parsers محلل التبعية عالية الجودة صناعة حمض الفوسفور

Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT

671 - Association for Computation Linguistics 2021 مقالة

Infusing factual knowledge into pre-trained models is fundamental for many knowledge-intensive tasks. In this paper, we proposed Mixture-of-Partitions (MoP), an infusion approach that can handle a very large knowledge graph (KG) by partitioning it in to smaller sub-graphs and infusing their specific knowledge into various BERT models using lightweight adapters. To leverage the overall factual knowledge for a target task, these sub-graph adapters are further fine-tuned along with the underlying BERT through a mixture layer. We evaluate our MoP with three biomedical BERTs (SciBERT, BioBERT, PubmedBERT) on six downstream tasks (inc. NLI, QA, Classification), and the results show that our MoP consistently enhances the underlying BERTs in task performance, and achieves new SOTA performances on five evaluated datasets.

large knowledge graph biomedical knowledge graphs infusing large biomedical الرسم البياني المعرفة الكبيرة رسوم المعرفة الطبية الحيوية الرسوم البيانية غرس Biomedical كبيرة صناعة حمض الفوسفور المزيد..

Generating Diverse Descriptions from Semantic Graphs

769 - Association for Computation Linguistics 2021 مقالة

Text generation from semantic graphs is traditionally performed with deterministic methods, which generate a unique description given an input graph. However, the generation problem admits a range of acceptable textual outputs, exhibiting lexical, sy ntactic and semantic variation. To address this disconnect, we present two main contributions. First, we propose a stochastic graph-to-text model, incorporating a latent variable in an encoder-decoder model, and its use in an ensemble. Second, to assess the diversity of the generated sentences, we propose a new automatic evaluation metric which jointly evaluates output diversity and quality in a multi-reference setting. We evaluate the models on WebNLG datasets in English and Russian, and show an ensemble of stochastic models produces diverse sets of generated sentences while, retaining similar quality to state-of-the-art models.

generating diverse descriptions semantic graphs generating diverse توليد أوصاف متنوعة الرسوم البيانية الدلالية توليد متنوع صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages

TWUG: مورد كبير من الرسوم البيانية استخدام كلمة DIACHRONIC بأربع لغات

Ask ChatGPT about the research

Read More

suggested questions