ترغب بنشر مسار تعليمي؟ اضغط هنا

408 - Yi Zhu , Vidur Raj , Ziyuan Li 2021
Highly sensitive photodetectors with single photon level detection is one of the key components to a range of emerging technologies, in particular the ever-growing field of optical communication, remote sensing, and quantum computing. Currently, most of the single-photon detection technologies require external biasing at high voltages and/or cooling to low temperatures, posing great limitations for wider applications. Here, we demonstrate InP nanowire array photodetectors that can achieve single-photon level light detection at room temperature without an external bias. We use top-down etched, heavily doped p-type InP nanowires and n-type AZO/ZnO carrier selective contact to form a radial p-n junction with a built-in electric field exceeding 3x10^5 V/cm at 0 V. The device exhibits broadband light sensitivity and can distinguish a single photon per pulse from the dark noise at 0 V, enabled by its design to realize near-ideal broadband absorption, extremely low dark current, and highly efficient charge carrier separation. Meanwhile, the bandwidth of the device reaches above 600 MHz with a timing jitter of 538 ps. The proposed device design provides a new pathway towards low-cost, high-sensitivity, self-powered photodetectors for numerous future applications.
Penalized likelihood models are widely used to simultaneously select variables and estimate model parameters. However, the existence of weak signals can lead to inaccurate variable selection, biased parameter estimation, and invalid inference. Thus, identifying weak signals accurately and making valid inferences are crucial in penalized likelihood models. In this paper, we develop a unified approach to identify weak signals and make inferences in penalized likelihood models, including the special case when the responses are categorical. To identify weak signals, we utilize the estimated selection probability of each covariate as a measure of signal strength and formulate a signal identification criterion. To construct confidence intervals, we adopt a two-step inference procedure. Extensive simulation studies show that the proposed two-step inference procedure outperforms several existing methods. We illustrate the proposed method with an application to the Practice Fusion diabetes dataset.
117 - Li Wang , Li Zhang , Yi Zhu 2021
Recognizing and localizing objects in the 3D space is a crucial ability for an AI agent to perceive its surrounding environment. While significant progress has been achieved with expensive LiDAR point clouds, it poses a great challenge for 3D object detection given only a monocular image. While there exist different alternatives for tackling this problem, it is found that they are either equipped with heavy networks to fuse RGB and depth information or empirically ineffective to process millions of pseudo-LiDAR points. With in-depth examination, we realize that these limitations are rooted in inaccurate object localization. In this paper, we propose a novel and lightweight approach, dubbed {em Progressive Coordinate Transforms} (PCT) to facilitate learning coordinate representations. Specifically, a localization boosting mechanism with confidence-aware loss is introduced to progressively refine the localization prediction. In addition, semantic image representation is also exploited to compensate for the usage of patch proposals. Despite being lightweight and simple, our strategy leads to superior improvements on the KITTI and Waymo Open Dataset monocular 3D detection benchmarks. At the same time, our proposed PCT shows great generalization to most coordinate-based 3D detection frameworks. The code is available at: https://github.com/amazon-research/progressive-coordinate-transforms .
398 - Haofei Kuang , Yi Zhu , Zhi Zhang 2021
Contrastive learning has revolutionized self-supervised image representation learning field, and recently been adapted to video domain. One of the greatest advantages of contrastive learning is that it allows us to flexibly define powerful loss objec tives as long as we can find a reasonable way to formulate positive and negative samples to contrast. However, existing approaches rely heavily on the short-range spatiotemporal salience to form clip-level contrastive signals, thus limit themselves from using global context. In this paper, we propose a new video-level contrastive learning method based on segments to formulate positive pairs. Our formulation is able to capture global context in a video, thus robust to temporal content change. We also incorporate a temporal order regularization term to enforce the inherent sequential structure of videos. Extensive experiments show that our video-level contrastive learning framework (VCLR) is able to outperform previous state-of-the-arts on five video datasets for downstream action classification, action localization and video retrieval. Code is available at https://github.com/amazon-research/video-contrastive-learning.
86 - Fangrui Zhu , Yi Zhu , Li Zhang 2021
Semantic segmentation is a challenging problem due to difficulties in modeling context in complex scenes and class confusions along boundaries. Most literature either focuses on context modeling or boundary refinement, which is less generalizable in open-world scenarios. In this work, we advocate a unified framework(UN-EPT) to segment objects by considering both context information and boundary artifacts. We first adapt a sparse sampling strategy to incorporate the transformer-based attention mechanism for efficient context modeling. In addition, a separate spatial branch is introduced to capture image details for boundary refinement. The whole model can be trained in an end-to-end manner. We demonstrate promising performance on three popular benchmarks for semantic segmentation with low memory footprint. Code will be released soon.
Language instruction plays an essential role in the natural language grounded navigation tasks. However, navigators trained with limited human-annotated instructions may have difficulties in accurately capturing key information from the complicated i nstruction at different timesteps, leading to poor navigation performance. In this paper, we exploit to train a more robust navigator which is capable of dynamically extracting crucial factors from the long instruction, by using an adversarial attacking paradigm. Specifically, we propose a Dynamic Reinforced Instruction Attacker (DR-Attacker), which learns to mislead the navigator to move to the wrong target by destroying the most instructive information in instructions at different timesteps. By formulating the perturbation generation as a Markov Decision Process, DR-Attacker is optimized by the reinforcement learning algorithm to generate perturbed instructions sequentially during the navigation, according to a learnable attack score. Then, the perturbed instructions, which serve as hard samples, are used for improving the robustness of the navigator with an effective adversarial training strategy and an auxiliary self-supervised reasoning task. Experimental results on both Vision-and-Language Navigation (VLN) and Navigation from Dialog History (NDH) tasks show the superiority of our proposed method over state-of-the-art methods. Moreover, the visualization analysis shows the effectiveness of the proposed DR-Attacker, which can successfully attack crucial information in the instructions at different timesteps. Code is available at https://github.com/expectorlin/DR-Attacker.
Navigation is one of the fundamental features of a autonomous robot. And the ability of long-term navigation with semantic instruction is a `holy grail` goals of intelligent robots. The development of 3D simulation technology provide a large scale of data to simulate the real-world environment. The deep learning proves its ability to robustly learn various embodied navigation tasks. However, deep learning on embodied navigation is still in its infancy due to the unique challenges faced by the navigation exploration and learning from partial observed visual input. Recently, deep learning in embodied navigation has become even thriving, with numerous methods have been proposed to tackle different challenges in this area. To give a promising direction for future research, in this paper, we present a comprehensive review of embodied navigation tasks and the recent progress in deep learning based methods. It includes two major tasks: target-oriented navigation and the instruction-oriented navigation.
Membranes are present in all cells and tissues. Mathematical models of cells and tissues need a compact mathematical description of membranes with a resolution of about 1 nm. Membranes isolate cells because ions have difficulty penetrating the dielec tric barrier they create. Here we introduce a dielectric mathematical membrane condition to replace a condition that did not include dielectric properties. Our mathematical membrane condition includes a dielectric lipid bilayer punctured by channels that conduct ions selectively.
Long linear carbon-chains have been attracting intense interest arising from the remarkable properties predicted and their potential applications in future nanotechnology. Here we comprehensively interrogate the excitonic transitions and the associat ed relaxation dynamics of nanotube confined long linear carbon-chains by using steady state and time-resolved Raman spectroscopies. The exciton relaxation dynamics on the confined carbon-chains occurs on a hundreds of picoseconds timescale, in strong contrast to the host dynamics that occurs on a few picosecond timescale. A prominent time-resolved Raman response is observed over a broad energy range extending from 1.2 to 2.8 eV, which includes the strong Raman resonance region around 2.2 eV. Evidence for a strong coupling between the chain and the nanotube host is found from the dynamics at high excitation energies which provides a clear evidence for an efficient energy transfer from the host carbon nanotube to the chain. Our experimental study presents the first unique characterization of the long linear carbon-chain exciton dynamics, providing indispensable knowledge for the understanding of the interactions between different carbon allotropes.
44 - Xueru Wang , Junyi Zhu 2021
The coupled nonlocal NLS equation is studied by virtue of the $2times2$ Dbar-problem. Two spectral transform matrices are introduced to define two associated Dbar-problems. The relations between the coupled nonlocal NLS potential and the solution of the Dbar-problem are constructed. The spatial transform method is extended to obtain the coupled nonlocal NLS equation and its conservation laws. The general nonlocal reduction of the coupled nonlocal NLS equation to the nonlocal NLS equation is discussed in detail. The explicit solutions are derived.
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا