في هذه الورقة، نتعلم مشكلة توليد النصوص الدقيقة بميزانية حسابية محدودة.لذلك، نستخدم هندسة شبكة مصممة ذات أداء أداء جيدا (GAN) - Gan التي ترويج التنوع (DPGAN)، وحاول استبدال قطرة LSTM بطبقة محول ذاتية انتباهي من أجل الرافعة الماليةكفاءتهم.تم تقييم DPGan الناتج عن النفس (SADPGAN) للأداء والجودة والتنوع للنص والاستقرار الناتج.تشير التجارب الحاسوبية إلى أن بنية محول غير قادرة على الاسترجاع في استبدال طبقة LSTM، ضمان الأداء أثناء مرحلة التدريب المسبق وتخضع لانهيار الوضع الكامل أثناء مرحلة ضبط GAN.تشير نتائجنا إلى أن الهندسة المعمارية المحول تحتاج إلى تكييفها قبل أن يتم استخدامها كإعداد لاستبدال RNNS في قانع إنشاء النصوص.
In this paper we address the problem of fine-tuned text generation with a limited computational budget. For that, we use a well-performing text generative adversarial network (GAN) architecture - Diversity-Promoting GAN (DPGAN), and attempted a drop-in replacement of the LSTM layer with a self-attention-based Transformer layer in order to leverage their efficiency. The resulting Self-Attention DPGAN (SADPGAN) was evaluated for performance, quality and diversity of generated text and stability. Computational experiments suggested that a transformer architecture is unable to drop-in replace the LSTM layer, under-performing during the pre-training phase and undergoing a complete mode collapse during the GAN tuning phase. Our results suggest that the transformer architecture need to be adapted before it can be used as a replacement for RNNs in text-generating GANs.
References used
https://aclanthology.org/
Vector representations have become a central element in semantic language modelling, leading to mathematical overlaps with many fields including quantum theory. Compositionality is a core goal for such representations: given representations for wet'
This muscle has
been used to design a robot arm similar to that of the humerus and
forearm, which can attract large objects with a weight of up to 500
N, which is equivalent to a professional bodybuilder raising his
weight.
The research aims to optimize the investment in solar cooling process using two
models of vessels (clay- mineral).The study was conducted at the site of Tartous in the
month (the fourth - fifth - sixth) years (2013) and that the fruits of the tomat
In this paper, we focus on the detection of sexist hate speech against women in tweets studying for the first time the impact of gender stereotype detection on sexism classification. We propose: (1) the first dataset annotated for gender stereotype d
Despite the recent advances in applying pre-trained language models to generate high-quality texts, generating long passages that maintain long-range coherence is yet challenging for these models. In this paper, we propose DiscoDVT, a discourse-aware