Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Learning Universal Authorship Representations

تعلم تمثيلات التأليف الشامل

618 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

learning universal authorship learning universal universal authorship representations تعلم التأليف العالمي تعلم عالمي تمثيلات التأليف الشامل صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Determining whether two documents were composed by the same author, also known as authorship verification, has traditionally been tackled using statistical methods. Recently, authorship representations learned using neural networks have been found to outperform alternatives, particularly in large-scale settings involving hundreds of thousands of authors. But do such representations learned in a particular domain transfer to other domains? Or are these representations inherently entangled with domain-specific features? To study these questions, we conduct the first large-scale study of cross-domain transfer for authorship verification considering zero-shot transfers involving three disparate domains: Amazon reviews, fanfiction short stories, and Reddit comments. We find that although a surprising degree of transfer is possible between certain domains, it is not so successful between others. We examine properties of these domains that influence generalization and propose simple but effective methods to improve transfer.

References used

https://aclanthology.org/

rate research

Learning Cross-lingual Representations for Event Coreference Resolution with Multi-view Alignment and Optimal Transport

965 - Association for Computation Linguistics 2021 مقالة

We study a new problem of cross-lingual transfer learning for event coreference resolution (ECR) where models trained on data from a source language are adapted for evaluations in different target languages. We introduce the first baseline model for this task based on XLM-RoBERTa, a state-of-the-art multilingual pre-trained language model. We also explore language adversarial neural networks (LANN) that present language discriminators to distinguish texts from the source and target languages to improve the language generalization for ECR. In addition, we introduce two novel mechanisms to further enhance the general representation learning of LANN, featuring: (i) multi-view alignment to penalize cross coreference-label alignment of examples in the source and target languages, and (ii) optimal transport to select close examples in the source and target languages to provide better training signals for the language discriminators. Finally, we perform extensive experiments for cross-lingual ECR from English to Spanish and Chinese to demonstrate the effectiveness of the proposed methods.

متخصصة لغة متعددة اللغات صناعة حمض الفوسفور

A Call for Clarity in Contemporary Authorship Attribution Evaluation

876 - Association for Computation Linguistics 2021 مقالة

Recent research has documented that results reported in frequently-cited authorship attribution papers are difficult to reproduce. Inaccessible code and data are often proposed as factors which block successful reproductions. Even when original mater ials are available, problems remain which prevent researchers from comparing the effectiveness of different methods. To solve the remaining problems---the lack of fixed test sets and the use of inappropriately homogeneous corpora---our paper contributes materials for five closed-set authorship identification experiments. The five experiments feature texts from 106 distinct authors. Experiments involve a range of contemporary non-fiction American English prose. These experiments provide the foundation for comparable and reproducible authorship attribution research involving contemporary writing.

call for clarity authorship attribution evaluation attribution evaluation دعوة للوضوح تقييم إسناد التأليف تقييم الإسناد صناعة حمض الفوسفور المزيد..

Knowledge Router: Learning Disentangled Representations for Knowledge Graphs

708 - Association for Computation Linguistics 2021 مقالة

The design of expressive representations of entities and relations in a knowledge graph is an important endeavor. While many of the existing approaches have primarily focused on learning from relational patterns and structural information, the intrin sic complexity of KG entities has been more or less overlooked. More concretely, we hypothesize KG entities may be more complex than we think, i.e., an entity may wear many hats and relational triplets may form due to more than a single reason. To this end, this paper proposes to learn disentangled representations of KG entities - a new method that disentangles the inner latent properties of KG entities. Our disentangled process operates at the graph level and a neighborhood mechanism is leveraged to disentangle the hidden properties of each entity. This disentangled representation learning approach is model agnostic and compatible with canonical KG embedding approaches. We conduct extensive experiments on several benchmark datasets, equipping a variety of models (DistMult, SimplE, and QuatE) with our proposed disentangling mechanism. Experimental results demonstrate that our proposed approach substantially improves performance on key metrics.

knowledge router router راوتر المعرفة راوتر صناعة حمض الفوسفور

Authorship Methodology in the Biography of the Prophet The Pioneers Stage

2555 - Tishreen University 2017 ورقة بحثية

The prophet's bibliography is considered as one of the oldest Islamic literature works. These works are historically significant for two reasons. On the first hand, they are seen as manuscripts documenting the life of the Holy Prophet Muhammad (pea ce is upon him). On the other hand, they predict for the Arab thought methodology in the outsets of entering the field of writing and methodological authorship. This research focuses on the concepts of the prophet's bibliography, the reasons behind writing it, its resources and its harbingers. Moreover, it studies the content of five books exploring the writing methodology in them. These books, which are considered as the first stage of this type of writings, have been chosen in an effort to gain access to the results that show the authorship methodology in the case of the pioneers. In addition, this research highlights their role in establishing for writing in other types.

methodology منهج السيرة النبوية تأليف The prophet's bibliography authorship

Developing a Benchmark for Reducing Data Bias in Authorship Attribution

613 - Association for Computation Linguistics 2021 مقالة

Authorship attribution is the task of assigning an unknown document to an author from a set of candidates. In the past, studies in this field use various evaluation datasets to demonstrate the effectiveness of preprocessing steps, features, and model s. However, only a small fraction of works use more than one dataset to prove claims. In this paper, we present a collection of highly diverse authorship attribution datasets, which better generalizes evaluation results from authorship attribution research. Furthermore, we implement a wide variety of previously used machine learning models and show that many approaches show vastly different performances when applied to different datasets. We include pre-trained language models, for the first time testing them in this field in a systematic way. Finally, we propose a set of aggregated scores to evaluate different aspects of the dataset collection.

reducing data bias benchmark for reducing reducing data تقليل تحيز البيانات معيار للحد من تقليل البيانات صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Learning Universal Authorship Representations

تعلم تمثيلات التأليف الشامل

Ask ChatGPT about the research

Read More

suggested questions