أوراق بحثية, رسائل ماجستير ودكتوراه منشورة من قبل Yuan Ren

DeepExpress: Heterogeneous and Coupled Sequence Modeling for Express Delivery Prediction

75 - Siyuan Ren , Bin Guo , Longbing Cao 2021

The prediction of express delivery sequence, i.e., modeling and estimating the volumes of daily incoming and outgoing parcels for delivery, is critical for online business, logistics, and positive customer experience, and specifically for resource al location optimization and promotional activity arrangement. A precise estimate of consumer delivery requests has to involve sequential factors such as shopping behaviors, weather conditions, events, business campaigns, and their couplings. Besides, conventional sequence prediction assumes a stable sequence evolution, failing to address complex nonlinear sequences and various feature effects in the above multi-source data. Although deep networks and attention mechanisms demonstrate the potential of complex sequence modeling, extant networks ignore the heterogeneous and coupling situation between features and sequences, resulting in weak prediction accuracy. To address these issues, we propose DeepExpress - a deep-learning based express delivery sequence prediction model, which extends the classic seq2seq framework to learning complex coupling between sequence and features. DeepExpress leverages an express delivery seq2seq learning, a carefully-designed heterogeneous feature representation, and a novel joint training attention mechanism to adaptively map heterogeneous data, and capture sequence-feature coupling for precise estimation. Experimental results on real-world data demonstrate that the proposed method outperforms both shallow and deep baseline models.

التعلم الآلي الذكاء الاصطناعي

Efficient Human Pose Estimation by Maximizing Fusion and High-Level Spatial Attention

101 - Zhiyuan Ren , Yaohai Zhou , Yizhe Chen 2021

In this paper, we propose an efficient human pose estimation network -- SFM (slender fusion model) by fusing multi-level features and adding lightweight attention blocks -- HSA (High-Level Spatial Attention). Many existing methods on efficient networ k have already taken feature fusion into consideration, which largely boosts the performance. However, its performance is far inferior to large network such as ResNet and HRNet due to its limited fusion operation in the network. Specifically, we expand the number of fusion operation by building bridges between two pyramid frameworks without adding layers. Meanwhile, to capture long-range dependency, we propose a lightweight attention block -- HSA, which computes second-order attention map. In summary, SFM maximizes the number of feature fusion in a limited number of layers. HSA learns high precise spatial information by computing the attention of spatial attention map. With the help of SFM and HSA, our network is able to generate multi-level feature and extract precise global spatial information with little computing resource. Thus, our method achieve comparable or even better accuracy with less parameters and computational cost. Our SFM achieve 89.0 in [email protected], 42.0 in [email protected] on MPII validation set and 71.7 in AP, 90.7 in [email protected] on COCO validation with only 1.7G FLOPs and 1.5M parameters. The source code will be public soon.

الرؤية الحاسوبية وتمييز الأنماط

Measuring optical vortices by means of dual shearing-type Sagnac interferometers

75 - Hantao Wang , Huajun Zhang , Mingyuan Ren 2021

Measuring the positions of optical vortices is an essential part in the researches of speckles and adaptive optics. The measurement accuracy is restricted by the performance of optical devices and the properties of optical vortices, such as density a nd size. In order to achieve high accuracy and wide range of application, the dual shearing-type Sagnac interferometers is proposed using two shearing plates to adjust the precision of optical vortices measurement. The shearing displacements are able to balance the measuring precision and the value of the intensity ratio point to provide optimum measurement performance. This method is useful for the observation of optical vortices with different sizes and densities, especially for the high density condition.

بصريات

Convergent Filaments Contracting Towards an Intermediate-mass Prestellar Core

69 - Zhiyuan Ren , Lei Zhu , Hui Shi 2021

Filamentary structures are closely associated with star-forming cores, but their detailed physical connections are still not clear. We studied the dense gas in the region of OMC-3 MMS-7 in Orion A molecular cloud using the molecular lines observed wi th the Atacama Large Millimeter/submillimeter Array (ALMA) and the Submillimeter Array (SMA). The ALMA N$_2$H$^+$ (1-0) emission has revealed three dense filaments intersected at the center, coincident with the central core MMS-7, which has a mass of $3.6,M_odot$. The filaments and cores are embedded in a parental clump with total mass of $29,M_odot$. The N$_2$H$^+$ velocity field exhibits a noticeable increasing trend along the filaments towards the central core MMS-7 with a scale of $v-v_{rm lsr} simeq 1.5$ ${rm km, s^{-1}}$ over a spatial range of $sim$20 arcsec ($8times 10^3$ AU), corresponding to a gradient of $40,{rm km, s^{-1}},{rm pc}^{-1}$. This feature is most likely to indicate an infall motion towards the center. The derived infall rate ($8times 10^{-5},M_odot$ year$^{-1}$) and timescale ($3.6times 10^5$ years) are much lower than that in a spherical free-fall collapse and more consistent with the contraction of filament structures. The filaments also exhibit a possible fragmentation, but it does not seem to largely interrupt the gas structure or the infall motion towards the center. MMS-7 thus provides an example of filamentary infall into an individual prestellar core. The filament contraction could be less intense but more steady than the global spherical collapse, and may help generate an intermediate- or even high-mass star.

الفيزياء الفلكية من المجرات

S3Net: 3D LiDAR Sparse Semantic Segmentation Network

360 - Ran Cheng , Ryan Razani , Yuan Ren 2021

Semantic Segmentation is a crucial component in the perception systems of many applications, such as robotics and autonomous driving that rely on accurate environmental perception and understanding. In literature, several approaches are introduced to attempt LiDAR semantic segmentation task, such as projection-based (range-view or birds-eye-view), and voxel-based approaches. However, they either abandon the valuable 3D topology and geometric relations and suffer from information loss introduced in the projection process or are inefficient. Therefore, there is a need for accurate models capable of processing the 3D driving-scene point cloud in 3D space. In this paper, we propose S3Net, a novel convolutional neural network for LiDAR point cloud semantic segmentation. It adopts an encoder-decoder backbone that consists of Sparse Intra-channel Attention Module (SIntraAM), and Sparse Inter-channel Attention Module (SInterAM) to emphasize the fine details of both within each feature map and among nearby feature maps. To extract the global contexts in deeper layers, we introduce Sparse Residual Tower based upon sparse convolution that suits varying sparsity of LiDAR point cloud. In addition, geo-aware anisotrophic loss is leveraged to emphasize the semantic boundaries and penalize the noise within each predicted regions, leading to a robust prediction. Our experimental results show that the proposed method leads to a large improvement (12%) compared to its baseline counterpart (MinkNet42 cite{choy20194d}) on SemanticKITTI cite{DBLP:conf/iccv/BehleyGMQBSG19} test set and achieves state-of-the-art mIoU accuracy of semantic segmentation approaches.

الرؤية الحاسوبية وتمييز الأنماط

Quantum Confinement of Bloch Waves and Related Problems

109 - Shang Yuan Ren 2021

The quantum confinement of Bloch waves is fundamentally different from the well-known quantum confinement of plane waves. Unlike that obtained in the latter are all stationary states only; in the former, there is always a new type of states -- the bo undary dependent states. This distinction leads to interesting physics in low-dimensional systems.

علم المواد

Deep Template Matching for Pedestrian Attribute Recognition with the Auxiliary Supervision of Attribute-wise Keypoints

108 - Jiajun Zhang , Pengyuan Ren , Jianmin Li 2020

Pedestrian Attribute Recognition (PAR) has aroused extensive attention due to its important role in video surveillance scenarios. In most cases, the existence of a particular attribute is strongly related to a partial region. Recent works design comp licated modules, e.g., attention mechanism and proposal of body parts to localize the attribute corresponding region. These works further prove that localization of attribute specific regions precisely will help in improving performance. However, these part-information-based methods are still not accurate as well as increasing model complexity which makes it hard to deploy on realistic applications. In this paper, we propose a Deep Template Matching based method to capture body parts features with less computation. Further, we also proposed an auxiliary supervision method that use human pose keypoints to guide the learning toward discriminative local cues. Extensive experiments show that the proposed method outperforms and has lower computational complexity, compared with the state-of-the-art approaches on large-scale pedestrian attribute datasets, including PETA, PA-100K, RAP, and RAPv2 zs.

الرؤية الحاسوبية وتمييز الأنماط الذكاء الاصطناعي التعلم الآلي

Analysis and Optimization of Service Delay for Multi-quality Videos in Multi-tier Heterogeneous Network with Random Caching

81 - Xuewei Zhang , Tiejun Lv , Yuan Ren 2020

Aiming to minimize service delay, we propose a new random caching scheme in device-to-device (D2D)-assisted heterogeneous network. To support diversified viewing qualities of multimedia video services, each video file is encoded into a base layer (BL ) and multiple enhancement layers (ELs) by scalable video coding (SVC). A super layer, including the BL and several ELs, is transmitted to every user. We define and quantify the service delay of multi-quality videos by deriving successful transmission probabilities when a user is served by a D2D helper, a small-cell base station (SBS) and a macro-cell base station (MBS). We formulate a delay minimization problem subject to the limited cache sizes of D2D helpers and SBSs. The structure of the optimal solutions to the problem is revealed, and then an improved standard gradient projection method is designed to effectively obtain the solutions. Both theoretical analysis and Monte-Carlo simulations validate the successful transmission probabilities. Compared with three benchmark caching policies, the proposed SVC-based random caching scheme is superior in terms of reducing the service delay.

بنية الشبكات والإنترنت الحساب واللغة

Filament Intersections and Cold Dense Cores in Orion A North

264 - Chao Zhang , Zhiyuan Ren , Jingwen Wu 2020

We studied the filament structures and dense cores in OMC-2,3 region in Orion A North molecular cloud using the high-resolution N2H+ (1-0) spectral cube observed with the Atacama Large Millimeter/Submillimeter Array (ALMA). The filament network over a total length of 2 pc is found to contain 170 intersections and 128 candidate dense cores. The dense cores are all displaced from the infrared point sources (possible young stars), and the major fraction of cores (103) are located around the intersections. Towards the intersections, there is also an increasing trend for the total column density Ntot as well as the the power-law index of the column-density Probability Distribution Function (N-PDF), suggesting that the intersections would in general have more significant gas assembly than the other part of the filament paths. The virial analysis shows that the dense cores mostly have virial mass ratio of alpha_vir=M_vir/M_gas<1.0, suggesting them to be bounded by the self gravity. In the mean time, only about 23 percent of the cores have critical mass ratio of alpha_crit=M_crit/M_gas<1.0, suggesting them to be unstable against core collapse. Combining these results, it shows that the major fraction of the cold starless and possible prestellar cores in OMC-2,3 are being assembled around the intersections, and currently in a gravitationally bound state. But more extensive core collapse and star formation may still require continuous core-mass growth or other perturbatio

الفيزياء الفلكية من المجرات الفيزياء الفلكية الشمسية والنجوم

Economical Caching for Scalable Videos in Cache-enabled Heterogeneous Networks

91 - Xuewei Zhang , Tiejun Lv , Yuan Ren 2019

We develop the optimal economical caching schemes in cache-enabled heterogeneous networks, while delivering multimedia video services with personalized viewing qualities to mobile users. By applying scalable video coding (SVC), each video file to be requested is divided into one base layer (BL) and several enhancement layers (ELs). In order to assign different transmission tasks, the serving small-cell base stations (SBSs) are grouped into K clusters. The SBSs are able to cache and cooperatively transmit BL and EL contents to the user. We analytically derive the expressions for successful transmission probability and ergodic service rate, and then the closed form of EConomical Efficiency (ECE) is obtained. In order to enhance the ECE performance, we formulate the ECE optimization problems for two cases. In the first case, with equal cache size equipped at each SBS, the layer caching indicator is determined. Since this problem is NP-hard, after the l0-norm approximation, the discrete optimization variables are relaxed to be continuous, and this relaxed problem is convex. Next, based on the optimal solution derived from the relaxed problem, we devise a greedystrategy based heuristic algorithm to achieve the near-optimal layer caching indicators. In the second case, the cache size for each SBS, the layer size and the layer caching indicator are jointly optimized. This problem is a mixed integer programming problem, which is more challenging. To effectively solve this problem, the original ECE maximization problem is divided into two subproblems. These two subproblems are iteratively solved until the original optimization problem is convergent. Numerical results verify the correctness of theoretical derivations. Additionally, compared to the most popular layer placement strategy, the performance superiority of the proposed SVC-based caching schemes is testified.

الوسائط المتعددة بنية الشبكات والإنترنت

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد