New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Naive Bayes versus BERT: Jupyter notebook assignments for an introductory NLP course

بايس ساذجة مقابل بيرت: مهام دفتر Jupyter لدورة غير محددة

421 1 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

We describe two Jupyter notebooks that form the basis of two assignments in an introductory Natural Language Processing (NLP) module taught to final year undergraduate students at Dublin City University. The notebooks show the students how to train a bag-of-words polarity classifier using multinomial Naive Bayes, and how to fine-tune a polarity classifier using BERT. The students take the code as a starting point for their own experiments.

References used

https://aclanthology.org/

rate research

Naive Bayes-based Experiments in Romanian Dialect Identification

337 - Association for Computation Linguistics 2021 مقالة

This article describes the experiments and systems developed by the SUKI team for the second edition of the Romanian Dialect Identification (RDI) shared task which was organized as part of the 2021 VarDial Evaluation Campaign. We submitted two runs t o the shared task and our second submission was the overall best submission by a noticeable margin. Our best submission used a character n-gram based naive Bayes classifier with adaptive language models. We describe our experiments on the development set leading to both submissions.

romanian dialect identification dialect identification romanian dialect الهوية الرومانية الهوية تحديد الهياكل لهجة رومانية صناعة حمض الفوسفور المزيد..

Contemporary NLP Modeling in Six Comprehensive Programming Assignments

306 - Association for Computation Linguistics 2021 مقالة

We present a series of programming assignments, adaptable to a range of experience levels from advanced undergraduate to PhD, to teach students design and implementation of modern NLP systems. These assignments build from the ground up and emphasize full-stack understanding of machine learning models: initially, students implement inference and gradient computation by hand, then use PyTorch to build nearly state-of-the-art neural networks using current best practices. Topics are chosen to cover a wide range of modeling and inference techniques that one might encounter, ranging from linear models suitable for industry applications to state-of-the-art deep learning models used in NLP research. The assignments are customizable, with constrained options to guide less experienced students or open-ended options giving advanced students freedom to explore. All of them can be deployed in a fully autogradable fashion, and have collectively been tested on over 300 students across several semesters.

comprehensive programming assignments contemporary nlp modeling comprehensive programming تعيينات البرمجة الشاملة النمذجة NLP المعاصرة البرمجة الشاملة صناعة حمض الفوسفور المزيد..

EstBERT: A Pretrained Language-Specific BERT for Estonian

164 - Association for Computation Linguistics 2021 مقالة

This paper presents EstBERT, a large pretrained transformer-based language-specific BERT model for Estonian. Recent work has evaluated multilingual BERT models on Estonian tasks and found them to outperform the baselines. Still, based on existing stu dies on other languages, a language-specific BERT model is expected to improve over the multilingual ones. We first describe the EstBERT pretraining process and then present the models' results based on the finetuned EstBERT for multiple NLP tasks, including POS and morphological tagging, dependency parsing, named entity recognition and text classification. The evaluation results show that the models based on EstBERT outperform multilingual BERT models on five tasks out of seven, providing further evidence towards a view that training language-specific BERT models are still useful, even when multilingual models are available.

language-specific bert pretrained language-specific bert language-specific bert model بيرت محددة باللغة برت لغة محددة مسبقا لغة بيرت الخاصة باللغة صناعة حمض الفوسفور المزيد..

Unsupervised Data Augmentation with Naive Augmentation and without Unlabeled Data

331 - Association for Computation Linguistics 2021 مقالة

Unsupervised Data Augmentation (UDA) is a semisupervised technique that applies a consistency loss to penalize differences between a model's predictions on (a) observed (unlabeled) examples; and (b) corresponding noised' examples produced via data au gmentation. While UDA has gained popularity for text classification, open questions linger over which design decisions are necessary and how to extend the method to sequence labeling tasks. In this paper, we re-examine UDA and demonstrate its efficacy on several sequential tasks. Our main contribution is an empirical study of UDA to establish which components of the algorithm confer benefits in NLP. Notably, although prior work has emphasized the use of clever augmentation techniques including back-translation, we find that enforcing consistency between predictions assigned to observed and randomly substituted words often yields comparable (or greater) benefits compared to these more complex perturbation models. Furthermore, we find that applying UDA's consistency loss affords meaningful gains without any unlabeled data at all, i.e., in a standard supervised setting. In short, UDA need not be unsupervised to realize much of its noted benefits, and does not require complex data augmentation to be effective.

unsupervised data augmentation naive augmentation تكبير البيانات غير المزعجة زيادة ساذجة صناعة حمض الفوسفور

Adapting BERT for Continual Learning of a Sequence of Aspect Sentiment Classification Tasks

274 - Association for Computation Linguistics 2021 مقالة

This paper studies continual learning (CL) of a sequence of aspect sentiment classification (ASC) tasks. Although some CL techniques have been proposed for document sentiment classification, we are not aware of any CL work on ASC. A CL system that in crementally learns a sequence of ASC tasks should address the following two issues: (1) transfer knowledge learned from previous tasks to the new task to help it learn a better model, and (2) maintain the performance of the models for previous tasks so that they are not forgotten. This paper proposes a novel capsule network based model called B-CL to address these issues. B-CL markedly improves the ASC performance on both the new task and the old tasks via forward and backward knowledge transfer. The effectiveness of B-CL is demonstrated through extensive experiments.

aspect sentiment classification adapting bert sequence of aspect تصنيف معنويات الجانب تكييف بيرت تسلسل الجانب صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Naive Bayes versus BERT: Jupyter notebook assignments for an introductory NLP course

بايس ساذجة مقابل بيرت: مهام دفتر Jupyter لدورة غير محددة

Ask ChatGPT about the research

Read More

suggested questions