لقد ثبت أن التدريبات متعددة المهام مع المهام الإضافية يمكن أن تحسن جودة المهمة المستهدفة من خلال نقل المهام العابر.ومع ذلك، من المحتمل أن تكون أهمية كل مهمة مساعدة للمهمة الأساسية غير معروفة مسبقا.في حين أن أهمية الأثقال ذات المهام الإضافية يمكن ضبطها يدويا، إلا أنها تصبح عمليا غير قابلة للتنفيذ مع عدد المهام.لمعالجة هذا، نقترح طريقة بحث تقوم تلقائيا بتعيين الأوزان الأهمية.نقوم بصياغة ذلك كمشكلة تعليمية للتعزيز وتعلم جدول أخذ عينات من المهام بناء على دقة تقييم النموذج متعدد المهام.يوضح تقييمنا التجريبي على XNLI والغراء أن أسلوبنا تتفوق على أخذ العينات الموحدة والساعي الأساسي المهمة الموحدة المقابلة.
It has been shown that training multi-task models with auxiliary tasks can improve the target task quality through cross-task transfer. However, the importance of each auxiliary task to the primary task is likely not known a priori. While the importance weights of auxiliary tasks can be manually tuned, it becomes practically infeasible with the number of tasks scaling up. To address this, we propose a search method that automatically assigns importance weights. We formulate it as a reinforcement learning problem and learn a task sampling schedule based on the evaluation accuracy of the multi-task model. Our empirical evaluation on XNLI and GLUE shows that our method outperforms uniform sampling and the corresponding single-task baseline.
References used
https://aclanthology.org/
We present our entry into the 2021 3C Shared Task Citation Context Classification based on Purpose competition. The goal of the competition is to classify a citation in a scientific article based on its purpose. This task is important because it coul
This paper explores the effect of using multitask learning for abstractive summarization in the context of small training corpora. In particular, we incorporate four different tasks (extractive summarization, language modeling, concept detection, and
The Semantic Verbal Fluency Task (SVF) is an efficient and minimally invasive speech-based screening tool for Mild Cognitive Impairment (MCI). In the SVF, testees have to produce as many words for a given semantic category as possible within 60 secon
Darknet market forums are frequently used to exchange illegal goods and services between parties who use encryption to conceal their identities. The Tor network is used to host these markets, which guarantees additional anonymization from IP and loca
Deep reinforcement learning has shown great potential in training dialogue policies. However, its favorable performance comes at the cost of many rounds of interaction. Most of the existing dialogue policy methods rely on a single learning system, wh