Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

The FairyNet Corpus - Character Networks for German Fairy Tales

The Fairynet Corpus - شبكات الأحرف من أجل حكايات خرافية الألمانية

602 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

This paper presents a data set of German fairy tales, manually annotated with character networks which were obtained with high inter rater agreement. The release of this corpus provides an opportunity of training and comparing different algorithms for the extraction of character networks, which so far was barely possible due to heterogeneous interests of previous researchers. We demonstrate the usefulness of our data set by providing baseline experiments for the automatic extraction of character networks, applying a rule-based pipeline as well as a neural approach, and find the neural approach outperforming the rule-approach in most evaluation settings.

References used

https://aclanthology.org/

rate research

Peer-to-peer networks for file sharing

2540 - Damascus University 2011 ورقة بحثية

The advantage of peer-to-peer (P2P) paradigm relies on two main concepts: cooperation among users and resource sharing. There are many applications based on peer-to-peer paradigm, but the most popular one is the file sharing. We can classify the fi le sharing application into centralized systems, (having a central server), and decentralized systems. Another classification would be structured and unstructured systems, based on the way of managing the indexing information. In this paper, we have implemented a centralized peer-to-peer application for file sharing. Then we evaluated the performance of the system by means of simulation.

الند للند مشاركة الملفات منظومات مركزية شبكة هجينة منظومات بنيوية مخدم مركزي Peer-to-peer file sharing centralized systems directory server hybrid networks structured systems المزيد..

The GLAUx corpus: methodological issues in designing a long-term, diverse, multi-layered corpus of Ancient Greek

780 - Association for Computation Linguistics 2021 مقالة

This paper describes the GLAUx project (the Greek Language Automated''), an ongoing effort to develop a large long-term diachronic corpus of Greek, covering sixteen centuries of literary and non-literary material annotated with NLP methods. After pro viding an overview of related corpus projects and discussing the general architecture of the corpus, it zooms in on a number of larger methodological issues in the design of historical corpora. These include the encoding of textual variants, handling extralinguistic variation and annotating linguistic ambiguity. Finally, the long- and short-term perspectives of this project are discussed.

greek language automated ancient greek greek language اللغة اليونانية الآلي اليونانية القديمة اللغة اليونانية صناعة حمض الفوسفور المزيد..

Light Verb Constructions and Their Families - A Corpus Study on German `stehen unter'-LVCs

638 - Association for Computation Linguistics 2021 مقالة

The paper reports on a corpus study of German light verb constructions (LVCs). LVCs come in families which exemplify systematic interpretation patterns. The paper's aim is to account for the properties determining these patterns on the basis of a corpus study on German LVCs of the type stehen unter' NP' (stand under NP').

العثور على بيرت german light verb corpus study الفعل المصباح الألماني دراسة كوربوس صناعة حمض الفوسفور

Summarization of German Court Rulings

665 - Association for Computation Linguistics 2021 مقالة

Historically speaking, the German legal language is widely neglected in NLP research, especially in summarization systems, as most of them are based on English newspaper articles. In this paper, we propose the task of automatic summarization of Germa n court rulings. Due to their complexity and length, it is of critical importance that legal practitioners can quickly identify the content of a verdict and thus be able to decide on the relevance for a given legal case. To tackle this problem, we introduce a new dataset consisting of 100k German judgments with short summaries. Our dataset has the highest compression ratio among the most common summarization datasets. German court rulings contain much structural information, so we create a pre-processing pipeline tailored explicitly to the German legal domain. Additionally, we implement multiple extractive as well as abstractive summarization systems and build a wide variety of baseline models. Our best model achieves a ROUGE-1 score of 30.50. Therefore with this work, we are laying the crucial groundwork for further research on German summarization systems.

german court rulings german court court rulings أحكام المحكمة الألمانية المحكمة الألمانية أحكام المحكمة صناعة حمض الفوسفور المزيد..

Glyph Enhanced Chinese Character Pre-Training for Lexical Sememe Prediction

691 - Association for Computation Linguistics 2021 مقالة

Sememes are defined as the atomic units to describe the semantic meaning of concepts. Due to the difficulty of manually annotating sememes and the inconsistency of annotations between experts, the lexical sememe prediction task has been proposed. How ever, previous methods heavily rely on word or character embeddings, and ignore the fine-grained information. In this paper, we propose a novel pre-training method which is designed to better incorporate the internal information of Chinese character. The Glyph enhanced Chinese Character representation (GCC) is used to assist sememe prediction. We experiment and evaluate our model on HowNet, which is a famous sememe knowledge base. The experimental results show that our method outperforms existing non-external information models.

lexical sememe prediction enhanced chinese character glyph enhanced chinese تنبؤات نظرية معجمية تعزيز الشخصية الصينية glyph عزز الصينية صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

The FairyNet Corpus - Character Networks for German Fairy Tales

The Fairynet Corpus - شبكات الأحرف من أجل حكايات خرافية الألمانية

Ask ChatGPT about the research

Read More

suggested questions