Decontextualization: Making Sentences Stand-Alone

99 0 0.0 ( 0 )

تحميل البحث استخدام كمرجع

نشر من قبل Eunsol Choi

تاريخ النشر 2021

مجال البحث الهندسة المعلوماتية

والبحث باللغة English

تأليف Eunsol Choi - Jennimaria Palomaki - Matthew Lamm

الحساب واللغة الذكاء الاصطناعي

قم بزيارة صفحتنا على فيسبوك

‎Shamra Academia - شمرا أكاديميا‎

اسأل ChatGPT حول البحث

الملخص بالعربية الملخص بالإنكليزية

Models for question answering, dialogue agents, and summarization often interpret the meaning of a sentence in a rich context and use that meaning in a new context. Taking excerpts of text can be problematic, as key pieces may not be explicit in a local window. We isolate and define the problem of sentence decontextualization: taking a sentence together with its context and rewriting it to be interpretable out of context, while preserving its meaning. We describe an annotation procedure, collect data on the Wikipedia corpus, and use the data to train models to automatically decontextualize sentences. We present preliminary studies that show the value of sentence decontextualization in a user facing task, and as preprocessing for systems that perform document understanding. We argue that decontextualization is an important subtask in many downstream applications, and that the definitions and resources provided can benefit tasks that operate on sentences that occur in a richer context.

قيم البحث

101 - Xiangci Li , Hairong Liu , Liang Huang 2020

Existing natural language processing systems are vulnerable to noisy inputs resulting from misspellings. On the contrary, humans can easily infer the corresponding correct words from their misspellings and surrounding context. Inspired by this, we ad dress the stand-alone spelling correction problem, which only corrects the spelling of each token without additional token insertion or deletion, by utilizing both spelling information and global context representations. We present a simple yet powerful solution that jointly detects and corrects misspellings as a sequence labeling task by fine-turning a pre-trained language model. Our solution outperforms the previous state-of-the-art result by 12.8% absolute F0.5 score.

الحساب واللغة

Stand-Alone Self-Attention in Vision Models

220 - Prajit Ramachandran , Niki Parmar , Ashish Vaswani 2019

Convolutions are a fundamental building block of modern computer vision systems. Recent approaches have argued for going beyond convolutions in order to capture long-range dependencies. These efforts focus on augmenting convolutional models with cont ent-based interactions, such as self-attention and non-local means, to achieve gains on a number of vision tasks. The natural question that arises is whether attention can be a stand-alone primitive for vision models instead of serving as just an augmentation on top of convolutions. In developing and testing a pure self-attention vision model, we verify that self-attention can indeed be an effective stand-alone layer. A simple procedure of replacing all instances of spatial convolutions with a form of self-attention applied to ResNet model produces a fully self-attentional model that outperforms the baseline on ImageNet classification with 12% fewer FLOPS and 29% fewer parameters. On COCO object detection, a pure self-attention model matches the mAP of a baseline RetinaNet while having 39% fewer FLOPS and 34% fewer parameters. Detailed ablation studies demonstrate that self-attention is especially impactful when used in later layers. These results establish that stand-alone self-attention is an important addition to the vision practitioners toolbox.

الرؤية الحاسوبية وتمييز الأنماط

A stand-alone fiber-coupled single-photon source

89 - Alexander Schlehahn , Sarah Fischbach , Ronny Schmidt 2017

In this work, we present a stand-alone and fiber-coupled quantum-light source. The plug-and-play device is based on an optically driven quantum dot delivering single photons via an optical fiber. The quantum dot is deterministically integrated in a m onolithic microlens which is precisely coupled to the core of an optical fiber via active optical alignment and epoxide adhesive bonding. The rigidly coupled fiber-emitter assembly is integrated in a compact Stirling cryocooler with a base temperature of 35 K. We benchmark our practical quantum device via photon auto-correlation measurements revealing $g^{(2)}(0)=0.07 pm 0.05$ under continuous-wave excitation and we demonstrate triggered non-classical light at a repetition rate of 80 MHz. The long-term stability of our quantum light source is evaluated by endurance tests showing that the fiber-coupled quantum dot emission is stable within 4% over several successive cool-down/warm-up cycles. Additionally, we demonstrate non-classical photon emission for a user-intervention-free 100-hour test run and stable single-photon count rates up to 11.7 kHz with a standard deviation of 4%.

الفيزياء ميسكالي وننكالي أجهزة الكشف الفيزيائية فيزياء الكم

Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation

149 - Huiyu Wang , Yukun Zhu , Bradley Green 2020

Convolution exploits locality for efficiency at a cost of missing long range context. Self-attention has been adopted to augment CNNs with non-local interactions. Recent works prove it possible to stack self-attention layers to obtain a fully attenti onal network by restricting the attention to a local region. In this paper, we attempt to remove this constraint by factorizing 2D self-attention into two 1D self-attentions. This reduces computation complexity and allows performing attention within a larger or even global region. In companion, we also propose a position-sensitive self-attention design. Combining both yields our position-sensitive axial-attention layer, a novel building block that one could stack to form axial-attention models for image classification and dense prediction. We demonstrate the effectiveness of our model on four large-scale datasets. In particular, our model outperforms all existing stand-alone self-attention models on ImageNet. Our Axial-DeepLab improves 2.8% PQ over bottom-up state-of-the-art on COCO test-dev. This previous state-of-the-art is attained by our small variant that is 3.8x parameter-efficient and 27x computation-efficient. Axial-DeepLab also achieves state-of-the-art results on Mapillary Vistas and Cityscapes.

الرؤية الحاسوبية وتمييز الأنماط التعلم الآلي

ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences

99 - Yanjun Gao , Ting-hao Huang , Rebecca J. Passonneau 2021

Atomic clauses are fundamental text units for understanding complex sentences. Identifying the atomic sentences within complex sentences is important for applications such as summarization, argument mining, discourse analysis, discourse parsing, and question answering. Previous work mainly relies on rule-based methods dependent on parsing. We propose a new task to decompose each complex sentence into simple sentences derived from the tensed clauses in the source, and a novel problem formulation as a graph edit task. Our neural model learns to Accept, Break, Copy or Drop elements of a graph that combines word adjacency and grammatical dependencies. The full processing pipeline includes modules for graph construction, graph editing, and sentence generation from the output graph. We introduce DeSSE, a new dataset designed to train and evaluate complex sentence decomposition, and MinWiki, a subset of MinWikiSplit. ABCD achieves comparable performance as two parsing baselines on MinWiki. On DeSSE, which has a more even balance of complex sentence types, our model achieves higher accuracy on the number of atomic sentences than an encoder-decoder baseline. Results include a detailed error analysis.

الحساب واللغة الذكاء الاصطناعي التعلم الآلي