ترغب بنشر مسار تعليمي؟ اضغط هنا

171 - Yifan Zhao , Jiawei Zhao , Jia Li 2021
Conventional RGB-D salient object detection methods aim to leverage depth as complementary information to find the salient regions in both modalities. However, the salient object detection results heavily rely on the quality of captured depth data wh ich sometimes are unavailable. In this work, we make the first attempt to solve the RGB-D salient object detection problem with a novel depth-awareness framework. This framework only relies on RGB data in the testing phase, utilizing captured depth data as supervision for representation learning. To construct our framework as well as achieving accurate salient detection results, we propose a Ubiquitous Target Awareness (UTA) network to solve three important challenges in RGB-D SOD task: 1) a depth awareness module to excavate depth information and to mine ambiguous regions via adaptive depth-error weights, 2) a spatial-aware cross-modal interaction and a channel-aware cross-level interaction, exploiting the low-level boundary cues and amplifying high-level salient channels, and 3) a gated multi-scale predictor module to perceive the object saliency in different contextual scales. Besides its high performance, our proposed UTA network is depth-free for inference and runs in real-time with 43 FPS. Experimental evidence demonstrates that our proposed network not only surpasses the state-of-the-art methods on five public RGB-D SOD benchmarks by a large margin, but also verifies its extensibility on five public RGB SOD benchmarks.
In this paper, we propose to align sentence representations from different languages into a unified embedding space, where semantic similarities (both cross-lingual and monolingual) can be computed with a simple dot product. Pre-trained language mode ls are fine-tuned with the translation ranking task. Existing work (Feng et al., 2020) uses sentences within the same batch as negatives, which can suffer from the issue of easy negatives. We adapt MoCo (He et al., 2020) to further improve the quality of alignment. As the experimental results show, the sentence representations produced by our model achieve the new state-of-the-art on several tasks, including Tatoeba en-zh similarity search (Artetxe and Schwenk, 2019b), BUCC en-zh bitext mining, and semantic textual similarity on 7 datasets.
Virtual 3D try-on can provide an intuitive and realistic view for online shopping and has a huge potential commercial value. However, existing 3D virtual try-on methods mainly rely on annotated 3D human shapes and garment templates, which hinders the ir applications in practical scenarios. 2D virtual try-on approaches provide a faster alternative to manipulate clothed humans, but lack the rich and realistic 3D representation. In this paper, we propose a novel Monocular-to-3D Virtual Try-On Network (M3D-VTON) that builds on the merits of both 2D and 3D approaches. By integrating 2D information efficiently and learning a mapping that lifts the 2D representation to 3D, we make the first attempt to reconstruct a 3D try-on mesh only taking the target clothing and a person image as inputs. The proposed M3D-VTON includes three modules: 1) The Monocular Prediction Module (MPM) that estimates an initial full-body depth map and accomplishes 2D clothes-person alignment through a novel two-stage warping procedure; 2) The Depth Refinement Module (DRM) that refines the initial body depth to produce more detailed pleat and face characteristics; 3) The Texture Fusion Module (TFM) that fuses the warped clothing with the non-target body part to refine the results. We also construct a high-quality synthesized Monocular-to-3D virtual try-on dataset, in which each person image is associated with a front and a back depth map. Extensive experiments demonstrate that the proposed M3D-VTON can manipulate and reconstruct the 3D human body wearing the given clothing with compelling details and is more efficient than other 3D approaches.
We study the problem of estimating a functional or a parameter in the context where outcome is subject to nonignorable missingness. We completely avoid modeling the regression relation, while allowing the propensity to be modeled by a semiparametric logistic relation where the dependence on covariates is unspecified. We discover a surprising phenomenon in that the estimation of the parameter in the propensity model as well as the functional estimation can be carried out without assessing the missingness dependence on covariates. This allows us to propose a general class of estimators for both model parameter estimation and functional estimation, including estimating the outcome mean. The robustness of the estimators are nonstandard and are established rigorously through theoretical derivations, and are supported by simulations and a data application.
This work is motivated by learning the individualized minimal clinically important difference, a vital concept to assess clinical importance in various biomedical studies. We formulate the scientific question into a high-dimensional statistical probl em where the parameter of interest lies in an individualized linear threshold. The goal of this paper is to develop a hypothesis testing procedure for the significance of a single element in this high-dimensional parameter as well as for the significance of a linear combination of this parameter. The difficulty dues to the high-dimensionality of the nuisance component in developing such a testing procedure, and also stems from the fact that this high-dimensional threshold model is nonregular and the limiting distribution of the corresponding estimator is nonstandard. To deal with these challenges, we construct a test statistic via a new bias corrected smoothed decorrelated score approach, and establish its asymptotic distributions under both the null and local alternative hypotheses. In addition, we propose a double-smoothing approach to select the optimal bandwidth parameter in our test statistic and provide theoretical guarantees for the selected bandwidth. We conduct comprehensive simulation studies to demonstrate how our proposed procedure can be applied in empirical studies. Finally, we apply the proposed method to a clinical trial where the scientific goal is to assess the clinical importance of a surgery procedure.
Recently, about five hundred fast radio bursts (FRBs) detected by CHIME/FRB Project have been reported. The vast amounts of data would make FRBs a promising low-redshift cosmological probe in the forthcoming years, and thus the issue of how many FRBs are needed for precise cosmological parameter estimation in different dark energy models should be detailedly investigated. Different from the usually considered $w(z)$-parameterized models in the literature, in this work we investigate the holographic dark energy (HDE) model and the Ricci dark energy (RDE) model, which originate from the holographic principle of quantum gravity, using the simulated localized FRB data as a cosmological probe for the first time. We show that the Hubble constant $H_0$ can be constrained to about 2% precision in the HDE model with the Macquart relation of FRB by using 10000 accurately-localized FRBs combined with the current CMB data, which is similar to the precision of the SH0ES value. Using 10000 localized FRBs combined with the CMB data can achieve about 6% constraint on the dark-energy parameter $c$ in the HDE model, which is tighter than the current BAO data combined with CMB. We also study the combination of the FRB data and another low-redshift cosmological probe, i.e. gravitational wave (GW) standard siren data, with the purpose of measuring cosmological parameters independent of CMB. Although the parameter degeneracies inherent in FRB and in GW are rather different, we find that more than 10000 FRBs are demanded to effectively improve the constraints in the holographic dark energy models.
Despite recent progress on image-based virtual try-on, current methods are constraint by shared warping networks and thus fail to synthesize natural try-on results when faced with clothing categories that require different warping operations. In this paper, we address this problem by finding clothing category-specific warping networks for the virtual try-on task via Neural Architecture Search (NAS). We introduce a NAS-Warping Module and elaborately design a bilevel hierarchical search space to identify the optimal network-level and operation-level flow estimation architecture. Given the network-level search space, containing different numbers of warping blocks, and the operation-level search space with different convolution operations, we jointly learn a combination of repeatable warping cells and convolution operations specifically for the clothing-person alignment. Moreover, a NAS-Fusion Module is proposed to synthesize more natural final try-on results, which is realized by leveraging particular skip connections to produce better-fused features that are required for seamlessly fusing the warped clothing and the unchanged person part. We adopt an efficient and stable one-shot searching strategy to search the above two modules. Extensive experiments demonstrate that our WAS-VTON significantly outperforms the previous fixed-architecture try-on methods with more natural warping results and virtual try-on results.
121 - Xiaolei Zhang , Wei Zhao 2021
In this paper, we introduce and study the class of $phi$-$w$-flat modules which are generalizations of both $phi$-flat modules and $w$-flat modules. The $phi$-$w$-weak global dimension $phi$-$w$-w.gl.dim$(R)$ of a commutative ring $R$ is also introdu ced and studied. We show that, for a $phi$-ring $R$, $phi$-$w$-w.gl.dim$(R)=0$ if and only if $w$-$dim(R)=0$ if and only if $R$ is a $phi$-von Neumann ring. It is also proved that, for a strongly $phi$-ring $R$, $phi$-$w$-w.gl.dim$(R)leq 1$ if and only if each nonnil ideal of $R$ is $phi$-$w$-flat, if and only if $R$ is a $phi$-PvMR, if and only if $R$ is a PvMR.
102 - Jue Jiang , Weiwei Zhao , Fei Wang 2021
When a ferromagnet is placed in contact with a superconductor, owing to incompatible spin order, the Cooper pairs from the superconductor cannot survive more than one or two nanometers inside the ferromagnet. This is confirmed in the measurements of ferromagnetic nickel (Ni) nanowires contacted by superconducting niobium (Nb) leads. However, when a thin copper (Cu) buffer layer (3 nm, oxidized due to exposure to air) is inserted between the Nb electrodes and the Ni wire, the spatial extent of the superconducting proximity range is dramatically increased from 2 to a few tens of nanometers. Scanning transmission electron microscope images verify the existence of Cu oxides and the magnetization measurements of such a 3 nm oxidized Cu film on a SiO2/Si substrate and on Nb/SiO2/Si show evidence of ferromagnetism. One way to understand the long-range proximity effect in the Ni nanowire is that the oxidized Cu buffer layer with ferromagnetism facilitates the conversion of singlet superconductivity in Nb into triplet supercurrent along the Ni nanowires.
The case difference heuristic (CDH) approach is a knowledge-light method for learning case adaptation knowledge from the case base of a case-based reasoning system. Given a pair of cases, the CDH approach attributes the difference in their solutions to the difference in the problems they solve, and generates adaptation rules to adjust solutions accordingly when a retrieved case and new query have similar problem differences. As an alternative to learning adaptation rules, several researchers have applied neural networks to learn to predict solution differences from problem differences. Previous work on such approaches has assumed that the feature set describing problems is predefined. This paper investigates a two-phase process combining deep learning for feature extraction and neural network based adaptation learning from extracted features. Its performance is demonstrated in a regression task on an image data: predicting age given the image of a face. Results show that the combined process can successfully learn adaptation knowledge applicable to nonsymbolic differences in cases. The CBR system achieves slightly lower performance overall than a baseline deep network regressor, but better performance than the baseline on novel queries.
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا