Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Foundations of Statistical Natural Language Processing

أساسيات معالجة اللغات الطبيعية الإحصائية

4456 1 93 2.0 ( 1 )

Download Cite

Added by MIT press كتاب

Publication date 1999

fields Informatics Engineering

and research's language is العربية

Authors Christopher Manning( باحث )

Created by Shadi Saleh

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Statistical approaches to processing natural language text have become dominant in recent years. It provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations.

Artificial intelligence review:

Upgrade your account to view the content

Research summary

تُعدّ كتاب 'أسس معالجة اللغة الطبيعية الإحصائية' من تأليف كريستوفر دي مانينغ وهينريش شوتزه أحد الكتب المرجعية في مجال معالجة اللغة الطبيعية باستخدام الأساليب الإحصائية. يتناول الكتاب مواضيع متعددة تشمل الأسس الرياضية، والمفاهيم اللغوية الأساسية، والعمل القائم على النصوص، والنماذج الإحصائية مثل نماذج n-gram، والتصنيف، والتجميع، والترجمة الآلية، واسترجاع المعلومات. يتميز الكتاب بتقديم توازن بين النظرية والتطبيق، حيث يشرح المفاهيم النظرية الأساسية في الاحتمالات والإحصاء ونظرية المعلومات واللغويات، ثم يطبق هذه المفاهيم على مشاكل معالجة اللغة الطبيعية مثل التصنيف، والتجزئة، والتجميع، والترجمة الآلية. يتضمن الكتاب أيضًا العديد من الأمثلة العملية والتمارين التي تهدف إلى تعزيز الفهم وتطبيق المفاهيم النظرية على مشاكل حقيقية.

Critical review

دراسة نقدية: يُعتبر الكتاب مرجعًا شاملاً ومهمًا في مجال معالجة اللغة الطبيعية الإحصائية، ولكنه يفتقر إلى تغطية بعض المواضيع الحديثة مثل التعلم العميق والشبكات العصبية التي أصبحت ذات أهمية كبيرة في هذا المجال. كما أن الكتاب يركز بشكل كبير على الأساليب الإحصائية التقليدية، مما قد يجعله أقل جاذبية للباحثين الذين يفضلون الأساليب الحديثة. بالإضافة إلى ذلك، قد يجد القارئ صعوبة في متابعة بعض الأجزاء النظرية المعقدة دون خلفية قوية في الرياضيات والإحصاء.

Questions related to the research

ما هي المواضيع الأساسية التي يغطيها الكتاب؟

يغطي الكتاب مواضيع متعددة تشمل الأسس الرياضية، والمفاهيم اللغوية الأساسية، والعمل القائم على النصوص، والنماذج الإحصائية مثل نماذج n-gram، والتصنيف، والتجميع، والترجمة الآلية، واسترجاع المعلومات.
كيف يوازن الكتاب بين النظرية والتطبيق؟

يشرح الكتاب المفاهيم النظرية الأساسية في الاحتمالات والإحصاء ونظرية المعلومات واللغويات، ثم يطبق هذه المفاهيم على مشاكل معالجة اللغة الطبيعية مثل التصنيف، والتجزئة، والتجميع، والترجمة الآلية.
ما هي بعض الانتقادات الموجهة للكتاب؟

يُنتقد الكتاب لافتقاره إلى تغطية المواضيع الحديثة مثل التعلم العميق والشبكات العصبية، ولتركيزه الكبير على الأساليب الإحصائية التقليدية. كما قد يجد القارئ صعوبة في متابعة بعض الأجزاء النظرية المعقدة دون خلفية قوية في الرياضيات والإحصاء.
ما هي الفائدة العملية من دراسة هذا الكتاب؟

الفائدة العملية تشمل فهم وتطبيق المفاهيم النظرية على مشاكل حقيقية في معالجة اللغة الطبيعية مثل التصنيف، والتجزئة، والتجميع، والترجمة الآلية، مما يعزز من قدرة الباحثين والمطورين على تطوير حلول فعالة في هذا المجال.

Keywords

معالجة اللغة الطبيعية النماذج الإحصائية التصنيف التجميع الترجمة الآلية استرجاع المعلومات نظرية الاحتمالات نظرية المعلومات

References used

https://nlp.stanford.edu/fsnlp/promo/

rate research

Gender Bias in Natural Language Processing Across Human Languages

917 - Association for Computation Linguistics 2021 مقالة

Natural Language Processing (NLP) systems are at the heart of many critical automated decision-making systems making crucial recommendations about our future world. Gender bias in NLP has been well studied in English, but has been less studied in oth er languages. In this paper, a team including speakers of 9 languages - Chinese, Spanish, English, Arabic, German, French, Farsi, Urdu, and Wolof - reports and analyzes measurements of gender bias in the Wikipedia corpora for these 9 languages. We develop extensions to profession-level and corpus-level gender bias metric calculations originally designed for English and apply them to 8 other languages, including languages that have grammatically gendered nouns including different feminine, masculine, and neuter profession words. We discuss future work that would benefit immensely from a computational linguistics perspective.

مشكلة تقسيم زمرة language processing human languages معالجة اللغة لغات بشرية صناعة حمض الفوسفور

Improving Policing with Natural Language Processing

993 - Association for Computation Linguistics 2021 مقالة

This article explores the potential for Natural Language Processing (NLP) to enable a more effective, prevention focused and less confrontational policing model that has hitherto been too resource consuming to implement at scale. Problem-Oriented Pol icing (POP) is a potential replacement, at least in part, for traditional policing which adopts a reactive approach, relying heavily on the criminal justice system. By contrast, POP seeks to prevent crime by manipulating the underlying conditions that allow crimes to be committed. Identifying these underlying conditions requires a detailed understanding of crime events - tacit knowledge that is often held by police officers but which can be challenging to derive from structured police data. One potential source of insight exists in unstructured free text data commonly collected by police for the purposes of investigation or administration. Yet police agencies do not typically have the skills or resources to analyse these data at scale. In this article we argue that NLP offers the potential to unlock these unstructured data and by doing so allow police to implement more POP initiatives. However we caution that using NLP models without adequate knowledge may either allow or perpetuate bias within the data potentially leading to unfavourable outcomes.

قانون جنائي صناعة حمض الفوسفور

Translational NLP: A New Paradigm and General Principles for Natural Language Processing Research

1307 - Association for Computation Linguistics 2021 مقالة

Natural language processing (NLP) research combines the study of universal principles, through basic science, with applied science targeting specific use cases and settings. However, the process of exchange between basic NLP and applications is often assumed to emerge naturally, resulting in many innovations going unapplied and many important questions left unstudied. We describe a new paradigm of Translational NLP, which aims to structure and facilitate the processes by which basic and applied NLP research inform one another. Translational NLP thus presents a third research paradigm, focused on understanding the challenges posed by application needs and how these challenges can drive innovation in basic science and technology design. We show that many significant advances in NLP research have emerged from the intersection of basic principles with application needs, and present a conceptual framework outlining the stakeholders and key questions in translational research. Our framework provides a roadmap for developing Translational NLP as a dedicated research area, and identifies general translational principles to facilitate exchange between basic and applied research.

Dynabench. صناعة حمض الفوسفور

Robustness and Adversarial Examples in Natural Language Processing

1122 - Association for Computation Linguistics 2021 مقالة

Recent studies show that many NLP systems are sensitive and vulnerable to a small perturbation of inputs and do not generalize well across different datasets. This lack of robustness derails the use of NLP systems in real-world applications. This tut orial aims at bringing awareness of practical concerns about NLP robustness. It targets NLP researchers and practitioners who are interested in building reliable NLP systems. In particular, we will review recent studies on analyzing the weakness of NLP systems when facing adversarial inputs and data with a distribution shift. We will provide the audience with a holistic view of 1) how to use adversarial examples to examine the weakness of NLP models and facilitate debugging; 2) how to enhance the robustness of existing NLP models and defense against adversarial inputs; and 3) how the consideration of robustness affects the real-world NLP applications used in our daily lives. We will conclude the tutorial by outlining future research directions in this area.

answering صناعة حمض الفوسفور

The Climate Change Debate and Natural Language Processing

947 - Association for Computation Linguistics 2021 مقالة

The debate around climate change (CC)---its extent, its causes, and the necessary responses---is intense and of global importance. Yet, in the natural language processing (NLP) community, this domain has so far received little attention. In contrast, it is of enormous prominence in various social science disciplines, and some of that work follows the ''text-as-data'' paradigm, seeking to employ quantitative methods for analyzing large amounts of CC-related text. Other research is qualitative in nature and studies details, nuances, actors, and motivations within CC discourses. Coming from both NLP and Political Science, and reviewing key works in both disciplines, we discuss how social science approaches to CC debates can inform advances in text-mining/NLP, and how, in return, NLP can support policy-makers and activists in making sense of large-scale and complex CC discourses across multiple genres, channels, topics, and communities. This is paramount for their ability to make rapid and meaningful impact on the discourse, and for shaping the necessary policy change.

إعادة التقديم والسؤال climate change debate مناقشة تغير المناخ صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Foundations of Statistical Natural Language Processing

أساسيات معالجة اللغات الطبيعية الإحصائية

Ask ChatGPT about the research

Read More

suggested questions