Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Developing a Clinical Language Model for Swedish: Continued Pretraining of Generic BERT with In-Domain Data

تطوير نموذج لغة سريري للسويدية: استمرار الاحتجاج من بيرت عام مع بيانات داخل المجال

444 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

The use of pretrained language models, fine-tuned to perform a specific downstream task, has become widespread in NLP. Using a generic language model in specialized domains may, however, be sub-optimal due to differences in language use and vocabulary. In this paper, it is investigated whether an existing, generic language model for Swedish can be improved for the clinical domain through continued pretraining with clinical text. The generic and domain-specific language models are fine-tuned and evaluated on three representative clinical NLP tasks: (i) identifying protected health information, (ii) assigning ICD-10 diagnosis codes to discharge summaries, and (iii) sentence-level uncertainty prediction. The results show that continued pretraining on in-domain data leads to improved performance on all three downstream tasks, indicating that there is a potential added value of domain-specific language models for clinical NLP.

References used

https://aclanthology.org/

rate research

Building a Swedish Open-Domain Conversational Language Model

458 - Association for Computation Linguistics 2021 مقالة

We present on-going work of evaluating the, to our knowledge, first large generative language model trained to converse in Swedish, using data from the online discussion forum Flashback. We conduct a human evaluation pilot study that indicates the mo del is often able to respond to conversations in both a human-like and informative manner, on a diverse set of topics. While data from online forums can be useful to build conversational systems, we reflect on the negative consequences that incautious application might have, and the need for taking active measures to safeguard against them.

swedish open-domain conversational open-domain conversational language conversational language model السوق السويدية مفتوحة المحادثة لغة محادثة مفتوحة نموذج لغة المحادثة صناعة حمض الفوسفور المزيد..

Multilingual Domain Adaptation for NMT: Decoupling Language and Domain Information with Adapters

664 - Association for Computation Linguistics 2021 مقالة

Adapter layers are lightweight, learnable units inserted between transformer layers. Recent work explores using such layers for neural machine translation (NMT), to adapt pre-trained models to new domains or language pairs, training only a small set of parameters for each new setting (language pair or domain). In this work we study the compositionality of language and domain adapters in the context of Machine Translation. We aim to study, 1) parameter-efficient adaptation to multiple domains and languages simultaneously (full-resource scenario) and 2) cross-lingual transfer in domains where parallel data is unavailable for certain language pairs (partial-resource scenario). We find that in the partial resource scenario a naive combination of domain-specific and language-specific adapters often results in catastrophic forgetting' of the missing languages. We study other ways to combine the adapters to alleviate this issue and maximize cross-lingual transfer. With our best adapter combinations, we obtain improvements of 3-4 BLEU on average for source languages that do not have in-domain data. For target languages without in-domain data, we achieve a similar improvement by combining adapters with back-translation. Supplementary material is available at https://tinyurl.com/r66stbxj.

domain information decoupling language multilingual domain adaptation معلومات المجال لغة الخمول التكيف المجال متعدد اللغات صناعة حمض الفوسفور المزيد..

Small Model and In-Domain Data Are All You Need

239 - Association for Computation Linguistics 2021 مقالة

I participated in the WMT shared news translation task and focus on one high resource language pair: English and Chinese (two directions, Chinese to English and English to Chinese). The submitted systems (ZengHuiMT) focus on data cleaning, data selec tion, back translation and model ensemble. The techniques I used for data filtering and selection include filtering by rules, language model and word alignment. I used a base translation model trained on initial corpus to obtain the target versions of the WMT21 test sets, then I used language models to find out the monolingual data that is most similar to the target version of test set, such monolingual data was then used to do back translation. On the test set, my best submitted systems achieve 35.9 and 32.2 BLEU for English to Chinese and Chinese to English directions respectively, which are quite high for a small model.

chinese صينى بيانات صناعة حمض الفوسفور

Frustratingly Easy Edit-based Linguistic Steganography with a Masked Language Model

324 - Association for Computation Linguistics 2021 مقالة

With advances in neural language models, the focus of linguistic steganography has shifted from edit-based approaches to generation-based ones. While the latter's payload capacity is impressive, generating genuine-looking texts remains challenging. I n this paper, we revisit edit-based linguistic steganography, with the idea that a masked language model offers an off-the-shelf solution. The proposed method eliminates painstaking rule construction and has a high payload capacity for an edit-based model. It is also shown to be more secure against automatic detection than a generation-based method while offering better control of the security/payload capacity trade-off.

frustratingly easy edit-based easy edit-based linguistic frustratingly easy محبط سهل التحرير سهل التحرير اللغوي من السهل المحبط صناعة حمض الفوسفور المزيد..

Using Confidential Data for Domain Adaptation of Neural Machine Translation

358 - Association for Computation Linguistics 2021 مقالة

We study the problem of domain adaptation in Neural Machine Translation (NMT) when domain-specific data cannot be shared due to confidentiality or copyright issues. As a first step, we propose to fragment data into phrase pairs and use a random sampl e to fine-tune a generic NMT model instead of the full sentences. Despite the loss of long segments for the sake of confidentiality protection, we find that NMT quality can considerably benefit from this adaptation, and that further gains can be obtained with a simple tagging technique.

معلومات شخصية صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Developing a Clinical Language Model for Swedish: Continued Pretraining of Generic BERT with In-Domain Data

تطوير نموذج لغة سريري للسويدية: استمرار الاحتجاج من بيرت عام مع بيانات داخل المجال

Ask ChatGPT about the research

Read More

suggested questions