غالبا ما يكون اختلاف الفرد في أسلوب الكتابة وظيفة من السمات الاجتماعية والشخصية. في حين أن التباين الاجتماعي المنظم قد درس على نطاق واسع، مثل التباين القائم على النوع الاجتماعي، فإن أقل بكثير معروف حول كيفية وصف الأساليب الفردية بسبب طبيعتها الخصوصية. نقدم نهجا جديدا لدراسة idiolects من خلال مقارنة هائلة للمؤلف عبر المؤلف لتحديد وترميز الميزات الأسلوبية. يحقق النموذج العصبي الأداء القوي في تحديد التأليف على النصوص القصيرة ومن خلال مهمة التحقيق القائم على التشبيه، يظهر أن التمثيلات المستفادة تظهر منتديات مفاجئة ترميز التحولات النوعية والكمية من الأساليب القطرية. من خلال اضطراب النص، نحدد المساهمات النسبية للعناصر اللغوية المختلفة على التباين الاضطراب. علاوة على ذلك، فإننا نقدم وصفا ل idiolects من خلال قياس الاختلاف بين المؤلفين و interra، مما يدل على أن الاختلاف في idiolects غالبا ما يكون مميزا بعد متسقة.
An individual's variation in writing style is often a function of both social and personal attributes. While structured social variation has been extensively studied, e.g., gender based variation, far less is known about how to characterize individual styles due to their idiosyncratic nature. We introduce a new approach to studying idiolects through a massive cross-author comparison to identify and encode stylistic features. The neural model achieves strong performance at authorship identification on short texts and through an analogy-based probing task, showing that the learned representations exhibit surprising regularities that encode qualitative and quantitative shifts of idiolectal styles. Through text perturbation, we quantify the relative contributions of different linguistic elements to idiolectal variation. Furthermore, we provide a description of idiolects through measuring inter- and intra-author variation, showing that variation in idiolects is often distinctive yet consistent.
References used
https://aclanthology.org/
In recent years, world business in online discussions and opinion sharing on social media is booming. Re-entry prediction task is thus proposed to help people keep track of the discussions which they wish to continue. Nevertheless, existing works onl
Adaptive Machine Translation purports to dynamically include user feedback to improve translation quality. In a post-editing scenario, user corrections of machine translation output are thus continuously incorporated into translation models, reducing
We use dialogue act recognition (DAR) to investigate how well BERT represents utterances in dialogue, and how fine-tuning and large-scale pre-training contribute to its performance. We find that while both the standard BERT pre-training and pretraini
Online platforms and communities establish their own norms that govern what behavior is acceptable within the community. Substantial effort in NLP has focused on identifying unacceptable behaviors and, recently, on forecasting them before they occur.
People utilize online forums to either look for information or to contribute it. Because of their growing popularity, certain online forums have been created specifically to provide support, assistance, and opinions for people suffering from mental i