ترغب بنشر مسار تعليمي؟ اضغط هنا

104 - Zhe Lin , Yitao Cai , Xiaojun Wan 2021
Paraphrase generation is an important task in natural language processing. Previous works focus on sentence-level paraphrase generation, while ignoring document-level paraphrase generation, which is a more challenging and valuable task. In this paper , we explore the task of document-level paraphrase generation for the first time and focus on the inter-sentence diversity by considering sentence rewriting and reordering. We propose CoRPG (Coherence Relationship guided Paraphrase Generation), which leverages graph GRU to encode the coherence relationship graph and get the coherence-aware representation for each sentence, which can be used for re-arranging the multiple (possibly modified) input sentences. We create a pseudo document-level paraphrase dataset for training CoRPG. Automatic evaluation results show CoRPG outperforms several strong baseline models on the BERTScore and diversity scores. Human evaluation also shows our model can generate document paraphrase with more diversity and semantic preservation.
Annotated images are required for both supervised model training and evaluation in image classification. Manually annotating images is arduous and expensive, especially for multi-labeled images. A recent trend for conducting such laboursome annotatio n tasks is through crowdsourcing, where images are annotated by volunteers or paid workers online (e.g., workers of Amazon Mechanical Turk) from scratch. However, the quality of crowdsourcing image annotations cannot be guaranteed, and incompleteness and incorrectness are two major concerns for crowdsourcing annotations. To address such concerns, we have a rethinking of crowdsourcing annotations: Our simple hypothesis is that if the annotators only partially annotate multi-label images with salient labels they are confident in, there will be fewer annotation errors and annotators will spend less time on uncertain labels. As a pleasant surprise, with the same annotation budget, we show a multi-label image classifier supervised by images with salient annotations can outperform models supervised by fully annotated images. Our method contributions are 2-fold: An active learning way is proposed to acquire salient labels for multi-label images; and a novel Adaptive Temperature Associated Model (ATAM) specifically using partial annotations is proposed for multi-label image classification. We conduct experiments on practical crowdsourcing data, the Open Street Map (OSM) dataset and benchmark dataset COCO 2014. When compared with state-of-the-art classification methods trained on fully annotated images, the proposed ATAM can achieve higher accuracy. The proposed idea is promising for crowdsourcing data annotation. Our code will be publicly available.
Most publicly available datasets for image classification are with single labels, while images are inherently multi-labeled in our daily life. Such an annotation gap makes many pre-trained single-label classification models fail in practical scenario s. This annotation issue is more concerned for aerial images: Aerial data collected from sensors naturally cover a relatively large land area with multiple labels, while annotated aerial datasets, which are publicly available (e.g., UCM, AID), are single-labeled. As manually annotating multi-label aerial images would be time/labor-consuming, we propose a novel self-correction integrated domain adaptation (SCIDA) method for automatic multi-label learning. SCIDA is weakly supervised, i.e., automatically learning the multi-label image classification model from using massive, publicly available single-label images. To achieve this goal, we propose a novel Label-Wise self-Correction (LWC) module to better explore underlying label correlations. This module also makes the unsupervised domain adaptation (UDA) from single- to multi-label data possible. For model training, the proposed model only uses single-label information yet requires no prior knowledge of multi-labeled data; and it predicts labels for multi-label aerial images. In our experiments, trained with single-labeled MAI-AID-s and MAI-UCM-s datasets, the proposed model is tested directly on our collected Multi-scene Aerial Image (MAI) dataset.
Inspired by the discovery of a doubly charmed tetraquark state $T_{cc}^+$ by the LHCb Collaboration and based on the prediction of a doubly charmed $DD^{ast}$ molecule with $I(J^{P})=0(1^{+})$ in our recent works, [Phys.Rev.D 102 (2020), 091502] and [Phys.Rev.D 99 (2019), 094018], we employ the effective Lagrangian approach to investigate the decay width of $T_{cc}to D Dpi$ and $T_{cc}to DDgamma$. We show that both the $T_{cc}to D Dpi$ and $T_{cc}to DDgamma$ modes contribute to the decay width of $T_{cc}$, with the former playing a more important role.However, within the $DD^*$ molecule picture, the obtained decay width is rather small compared with the experimental value of $Gamma=410pm175$ keV. We argue that the existence of a compact tetraquark component cannot lead to an appreciable increase of the decay width. As a result, the discrepancy may indicate that either $T_{cc}$ is more likely a near threshold $DD^*$ resonance or the decay width is close to the lower experimental boundary.
The three pentaquark states, $P_{c}(4312)$, $P_{c}(4440)$, and $P_{c}(4457)$, discovered by the LHCb Collaboration in 2019, can be nicely arranged into a multiplet of $bar{D}^{(ast)}Sigma_{c}^{(ast)}$ of seven molecules dictated by heavy quark spin s ymmetry. In this work we employ the effective Lagrangian approach to investigate the two decay modes of $P_{c}(4457)$, $P_{c}(4457) to P_{c}(4312) pi$ and $P_{c}(4457) to P_{c}(4312) gamma$, via the triangle mechanism, assuming that $P_{c}(4457)$ and $P_{c}(4312)$ are $bar{D}^{ast}Sigma_{c}$ and $bar{D}Sigma_{c}$ bound states but the spin of $P_{c}(4457)$ can be either 1/2 or 3/2. Our results show that the spin of $P_{c}(4457)$ can not be discriminated through these two decay modes. The decay widths of $P_{c}(4457) to P_{c}(4312) pi$ and $P_{c}(4457) to P_{c}(4312) gamma$ are estimated to be of order of 100 keV and 1 keV, respectively. The ratio of the partial decay widths of $P_{c}(4457) to P_{c}(4312) pi$ to $P_{c}(4457) to P_{c}(4312) gamma$ is similar to the ratio of $D^{ast}to Dpi$ to $D^{ast}to Dgamma$, which could be used to check the molecular nature of $P_{c}(4457)$ and $P_{c}(4312)$ if they can be observed in the future.
114 - Khoi Pham , Kushal Kafle , Zhe Lin 2021
Visual attributes constitute a large portion of information contained in a scene. Objects can be described using a wide variety of attributes which portray their visual appearance (color, texture), geometry (shape, size, posture), and other intrinsic properties (state, action). Existing work is mostly limited to study of attribute prediction in specific domains. In this paper, we introduce a large-scale in-the-wild visual attribute prediction dataset consisting of over 927K attribute annotations for over 260K object instances. Formally, object attribute prediction is a multi-label classification problem where all attributes that apply to an object must be predicted. Our dataset poses significant challenges to existing methods due to large number of attributes, label sparsity, data imbalance, and object occlusion. To this end, we propose several techniques that systematically tackle these challenges, including a base model that utilizes both low- and high-level CNN features with multi-hop attention, reweighting and resampling techniques, a novel negative label expansion scheme, and a novel supervised attribute-aware contrastive learning algorithm. Using these techniques, we achieve near 3.7 mAP and 5.7 overall F1 points improvement over the current state of the art. Further details about the VAW dataset can be found at http://vawdataset.com/.
80 - Yitao Cai , Zhe Lin , Xiaojun Wan 2021
Abstract Meaning Representation (AMR) is a rooted, labeled, acyclic graph representing the semantics of natural language. As previous works show, although AMR is designed for English at first, it can also represent semantics in other languages. Howev er, they find that concepts in their predicted AMR graphs are less specific. We argue that the misprediction of concepts is due to the high relevance between English tokens and AMR concepts. In this work, we introduce bilingual input, namely the translated texts as well as non-English texts, in order to enable the model to predict more accurate concepts. Besides, we also introduce an auxiliary task, requiring the decoder to predict the English sequences at the same time. The auxiliary task can help the decoder understand what exactly the corresponding English tokens are. Our proposed cross-lingual AMR parser surpasses previous state-of-the-art parser by 10.6 points on Smatch F1 score. The ablation study also demonstrates the efficacy of our proposed modules.
Natural convection in porous media is a fundamental process for the long-term storage of CO2 in deep saline aquifers. Typically, details of mass transfer in porous media are inferred from the numerical solution of the volume-averaged Darcy-Oberbeck-B oussinesq (DOB) equations, even though these equations do not account for the microscopic properties of a porous medium. According to the DOB equations, natural convection in a porous medium is uniquely determined by the Rayleigh number. However, in contrast with experiments, DOB simulations yield a linear scaling of the Sherwood number with the Rayleigh number (Ra) for high values of Ra (Ra>>1,300). Here, we perform Direct Numerical Simulations (DNS), fully resolving the flow field within the pores. We show that the boundary layer thickness is determined by the pore size instead of the Rayleigh number, as previously assumed. The mega- and proto- plume sizes increase with the pore size. Our DNS results exhibit a nonlinear scaling of the Sherwood number at high porosity, and for the same Rayleigh number, higher Sherwood numbers are predicted by DNS at lower porosities. It can be concluded that the scaling of the Sherwood number depends on the porosity and the pore-scale parameters, which is consistent with experimental studies.
208 - Xin Yuan , Zhe Lin , Jason Kuen 2021
We develop an approach to learning visual representations that embraces multimodal data, driven by a combination of intra- and inter-modal similarity preservation objectives. Unlike existing visual pre-training methods, which solve a proxy prediction task in a single domain, our method exploits intrinsic data properties within each modality and semantic information from cross-modal correlation simultaneously, hence improving the quality of learned visual representations. By including multimodal training in a unified framework with different types of contrastive losses, our method can learn more powerful and generic visual features. We first train our model on COCO and evaluate the learned visual representations on various downstream tasks including image classification, object detection, and instance segmentation. For example, the visual representations pre-trained on COCO by our method achieve state-of-the-art top-1 validation accuracy of $55.3%$ on ImageNet classification, under the common transfer protocol. We also evaluate our method on the large-scale Stock images dataset and show its effectiveness on multi-label image tagging, and cross-modal retrieval tasks.
From the amplitude analysis of the $D^+_s to pi^+ pi^0 eta$ decay, the BESIII Collaboration firstly observed the $D^+_s to a_0(980)^+pi^0$ and $D^+_s to a_0(980)^0pi^+$ decay modes, which are expected to occur through the pure $W$-annihilation proces ses. The measured branching fraction $mathcal{B}[D_{s}^{+}to a_{0}(980)^{+(0)}pi^{0(+)},a_{0}(980)^{+(0)}to pi^{+(0)}eta]$ is, however, found to be larger than those of known $W$-annihilation decays by one order of magnitude. This apparent contradiction can be reconciled if the two decays are induced by internal $W$-conversion or external $W$-emission mechanisms instead of $W$-annihilation mechanism. In this work, we propose that the $D^+_s$ decay proceeds via both the external and internal $W$-emission instead of $W$-annihilation mechanisms. In such a scenario, we perform a study of the $D^+_s to pi^+pi^0eta$ decay by taking into account the contributions from the tree diagram $D^+_s to rho^+ eta to pi^+ pi^0 eta$ and the intermediate $rho^+ eta$ and $K^*bar{K}/Kbar{K}^*$ triangle diagrams. The intermediate $a_0(980)$ state can be dynamically generated from the final state interactions of coupled $K bar{K}$ and $pi eta$ channels, and it is shown that the experimental data can be described fairly well, which supports the interpretation of $a_0(980)$ as a molecular state.
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا