ثورة الشبكات العصبية العميقة في العديد من المجالات، بما في ذلك معالجة اللغة الطبيعية.تحدد هذه الورقة مواد تعليمية لمحاضرة تمهيدية بشأن التعلم العميق في معالجة اللغة الطبيعية (NLP).تغطي المواد المقدمة الرئيسية محاضرة مدرسية صيفية حول نماذج ترميز التشفير.التكميلية لهذا هي مجموعة من النزلات دفتر Jupyter من التدريس المبكر، والتي استندت فيها أجزاء من المحاضرة.الهدف الرئيسي من المواد التعليمية هذه هو تقديم نظرة عامة على نهج الشبكة العصبية لمعالجة اللغة الطبيعية، مع ترابط المفاهيم الحديثة إلى الجذور تظهر نظيراتها الأساسية التقليدية.تغادر المحاضرة من الأساليب الإحصائية القائمة على العد، ويمتد إلى الشبكات المتكررة المتكررة والاهتمام، وهو في كل مكان في NLP اليوم.
Deep neural networks have revolutionized many fields, including Natural Language Processing. This paper outlines teaching materials for an introductory lecture on deep learning in Natural Language Processing (NLP). The main submitted material covers a summer school lecture on encoder-decoder models. Complementary to this is a set of jupyter notebook slides from earlier teaching, on which parts of the lecture were based on. The main goal of this teaching material is to provide an overview of neural network approaches to natural language processing, while linking modern concepts back to the roots showing traditional essential counterparts. The lecture departs from count-based statistical methods and spans up to gated recurrent networks and attention, which is ubiquitous in today's NLP.
References used
https://aclanthology.org/
Deep neural language models such as BERT have enabled substantial recent advances in many natural language processing tasks. However, due to the effort and computational cost involved in their pre-training, such models are typically introduced only f
We consider the task of linking social media accounts that belong to the same author in an automated fashion on the basis of the content and meta-data of the corresponding document streams. We focus on learning an embedding that maps variable-sized s
Concept normalization of clinical texts to standard medical classifications and ontologies is a task with high importance for healthcare and medical research. We attempt to solve this problem through automatic SNOMED CT encoding, where SNOMED CT is o
Active learning has been shown to reduce annotation requirements for numerous natural language processing tasks, including semantic role labeling (SRL). SRL involves labeling argument spans for potentially multiple predicates in a sentence, which mak
The exponential growth of the internet and social media in the past decade gave way to the increase in dissemination of false or misleading information. Since the 2016 US presidential election, the term fake news'' became increasingly popular and thi