النماذج العصبية تتفوق في استخراج الأنماط الإحصائية من كميات كبيرة من البيانات، ولكن الكفاح لتعلم أنماط أو سبب حول اللغة من بعض الأمثلة فقط.في هذه الورقة، نسأل: هل يمكننا أن نتعلم قواعد واضحة تعميم بئر من بعض الأمثلة فقط؟نستكشف هذا السؤال باستخدام تخليق البرنامج.نقوم بتطوير نموذج توليف لتعلم قواعد علم الأصوات كبرامج في لغة خاصة بالمجال.نحن نختبر قدرة نماذجنا على التعميم من بعض الأمثلة التدريبية باستخدام مجموعة بياناتنا الجديدة من مشاكل اللغويات أولمبياد، وهي مجموعة صعبة من المهام التي تتطلب قدرة التفكير اللغوي القوي.بالإضافة إلى كونها كفاءة عالية، فإن نهجنا يولد برامج قابلة للقراءة البشرية، وتسمح بالتحكم في تعميم البرامج المستفادة.
Neural models excel at extracting statistical patterns from large amounts of data, but struggle to learn patterns or reason about language from only a few examples. In this paper, we ask: Can we learn explicit rules that generalize well from only a few examples? We explore this question using program synthesis. We develop a synthesis model to learn phonology rules as programs in a domain-specific language. We test the ability of our models to generalize from few training examples using our new dataset of problems from the Linguistics Olympiad, a challenging set of tasks that require strong linguistic reasoning ability. In addition to being highly sample-efficient, our approach generates human-readable programs, and allows control over the generalizability of the learnt programs.
References used
https://aclanthology.org/
Neural dialog models are known to suffer from problems such as generating unsafe and inconsistent responses. Even though these problems are crucial and prevalent, they are mostly manually identified by model designers through interactions. Recently,
We participated in all tracks of the WMT 2021 efficient machine translation task: single-core CPU, multi-core CPU, and GPU hardware with throughput and latency conditions. Our submissions combine several efficiency strategies: knowledge distillation,
Today, news media organizations regularly engage with readers by enabling them to comment on news articles. This creates the need for comment moderation and removal of disallowed comments -- a time-consuming task often performed by human moderators.
This work investigates the value of augmenting recurrent neural networks with feature engineering for the Second Nuanced Arabic Dialect Identification (NADI) Subtask 1.2: Country-level DA identification. We compare the performance of a simple word-le
People convey their intention and attitude through linguistic styles of the text that they write. In this study, we investigate lexicon usages across styles throughout two lenses: human perception and machine word importance, since words differ in th