Research papers, master and doctoral theses about صفر بالرصاص البثق العصبي

Fine-tuning Encoders for Improved Monolingual and Zero-shot Polylingual Neural Topic Modeling

219 - Association for Computation Linguistics 2021 مقالة

Neural topic models can augment or replace bag-of-words inputs with the learned representations of deep pre-trained transformer-based word prediction models. One added benefit when using representations from multilingual models is that they facilitat e zero-shot polylingual topic modeling. However, while it has been widely observed that pre-trained embeddings should be fine-tuned to a given task, it is not immediately clear what supervision should look like for an unsupervised task such as topic modeling. Thus, we propose several methods for fine-tuning encoders to improve both monolingual and zero-shot polylingual neural topic modeling. We consider fine-tuning on auxiliary tasks, constructing a new topic classification task, integrating the topic classification objective directly into topic model training, and continued pre-training. We find that fine-tuning encoder representations on topic classification and integrating the topic classification task directly into topic modeling improves topic quality, and that fine-tuning encoder representations on any task is the most important factor for facilitating cross-lingual transfer.

zero-shot polylingual neural polylingual neural topic neural topic modeling صفر بالرصاص البثق العصبي البوللينلينلينجلين العصبي الموضوع نمذجة الموضوع العصبي صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد