أوراق بحثية, رسائل ماجستير ودكتوراه منشورة من قبل Yong Luo

Modeling Accurate Human Activity Recognition for Embedded Devices Using Multi-level Distillation

156 - Runze Chen , Haiyong Luo , Fang Zhao 2021

Human Activity Recognition (HAR) based on IMU sensors is a crucial area in ubiquitous computing. Because of the trend of deploying AI on IoT devices or smartphones, more researchers are designing different HAR models for embedded devices. Deployment of models in embedded devices can help enhance the efficiency of HAR. We propose a multi-level HAR modeling pipeline called Stage-Logits-Memory Distillation (SMLDist) for constructing deep convolutional HAR models with embedded hardware support. SMLDist includes stage distillation, memory distillation, and logits distillation. Stage distillation constrains the learning direction of the intermediate features. The teacher model teaches the student models how to explain and store the inner relationship among high-dimensional features based on Hopfield networks in memory distillation. Logits distillation builds logits distilled by a smoothed conditional rule to preserve the probability distribution and enhance the softer target accuracy. We compare the accuracy, F1 macro score, and energy cost on embedded platforms of a MobileNet V3 model built by SMLDist with various state-of-the-art HAR frameworks. The product model has a good balance with robustness and efficiency. SMLDist can also compress models with a minor performance loss at an equal compression ratio to other advanced knowledge distillation methods on seven public datasets.

التعلم الآلي الذكاء الاصطناعي تفاعل الإنسان والحاسوب

Lvio-Fusion: A Self-adaptive Multi-sensor Fusion SLAM Framework Using Actor-critic Method

120 - Yupeng Jia , Haiyong Luo , Fang Zhao 2021

State estimation with sensors is essential for mobile robots. Due to different performance of sensors in different environments, how to fuse measurements of various sensors is a problem. In this paper, we propose a tightly coupled multi-sensor fusion framework, Lvio-Fusion, which fuses stereo camera, Lidar, IMU, and GPS based on the graph optimization. Especially for urban traffic scenes, we introduce a segmented global pose graph optimization with GPS and loop-closure, which can eliminate accumulated drifts. Additionally, we creatively use a actor-critic method in reinforcement learning to adaptively adjust sensors weight. After training, actor-critic agent can provide the system better and dynamic sensors weight. We evaluate the performance of our system on public datasets and compare it with other state-of-the-art methods, which shows that the proposed method achieves high estimation accuracy and robustness to various environments. And our implementations are open source and highly scalable.

علم الروبوتات

On $n$-dimensional complete self-similar solutions to the mean curvature flow in $mathbb{R}^{n+1}$ with nonnegative constant scalar curvature

182 - Yong Luo , Linlin Sun , Jiabin Yin 2021

As is well known, self-similar solutions to the mean curvature flow, including self-shrinkers, translating solitons and self-expanders, arise naturally in the singularity analysis of the mean curvature flow. Recently, Guo cite{Guo} proved that $n$-di mensional compact self-shrinkers in $mathbb{R}^{n+1}$ with scalar curvature bounded from above or below by some constant are isometric to the round sphere $mathbb{S}^n(sqrt{n})$, which implies that $n$-dimensional compact self-shrinkers in $mathbb{R}^{n+1}$ with constant scalar curvature are isometric to the round sphere $mathbb{S}^n(sqrt{n})$(see also cite{Hui1}). Complete classifications of $n$-dimensional translating solitons in $mathbb{R}^{n+1}$ with nonnegative constant scalar curvature and of $n$-dimensional self-expanders in $mathbb{R}^{n+1}$ with nonnegative constant scalar curvature were given by Mart{i}n, Savas-Halilaj and Smoczykcite{MSS} and Ancari and Chengcite{AC}, respectively. In this paper we give complete classifications of $n$-dimensional complete self-shrinkers in $mathbb{R}^{n+1}$ with nonnegative constant scalar curvature. We will also give alternative proofs of the classification theorems due to Mart{i}n, Savas-Halilaj and Smoczyk cite{MSS} and Ancari and Chengcite{AC}.

الهندسة التفاضلية

Towards understanding the power of quantum kernels in the NISQ era

127 - Xinbiao Wang , Yuxuan Du , Yong Luo 2021

A key problem in the field of quantum computing is understanding whether quantum machine learning (QML) models implemented on noisy intermediate-scale quantum (NISQ) machines can achieve quantum advantages. Recently, Huang et al. [Nat Commun 12, 2631 ] partially answered this question by the lens of quantum kernel learning. Namely, they exhibited that quantum kernels can learn specific datasets with lower generalization error over the optimal classical kernel methods. However, most of their results are established on the ideal setting and ignore the caveats of near-term quantum machines. To this end, a crucial open question is: does the power of quantum kernels still hold under the NISQ setting? In this study, we fill this knowledge gap by exploiting the power of quantum kernels when the quantum system noise and sample error are considered. Concretely, we first prove that the advantage of quantum kernels is vanished for large size of datasets, few number of measurements, and large system noise. With the aim of preserving the superiority of quantum kernels in the NISQ era, we further devise an effective method via indefinite kernel learning. Numerical simulations accord with our theoretical results. Our work provides theoretical guidance of exploring advanced quantum kernels to attain quantum advantages on NISQ devices.

فيزياء الكم التعلم الآلي

Rigidity theorems for minimal Lagrangian surfaces with Legendrian capillary boundary

88 - Yong Luo , Linlin Sun 2020

In this note, we study minimal Lagrangian surfaces in $mathbb{B}^4$ with Legendrian capillary boundary on $mathbb{S}^3$. On the one hand, we prove that any minimal Lagrangian surface in $mathbb{B}^4$ with Legendrian free boundary on $mathbb{S}^3$ mus t be an equatorial plane disk. One the other hand, we show that any annulus type minimal Lagrangian surface in $mathbb{B}^4$ with Legendrian capillary boundary on $mathbb{S}^3$ must be congruent to one of the Lagrangian catenoids. These results confirm the conjecture proposed by Li, Wang and Weng (Sci. China Math., 2020).

الهندسة التفاضلية

Look, Read and Feel: Benchmarking Ads Understanding with Multimodal Multitask Learning

279 - Huaizheng Zhang , Yong Luo , Qiming Ai 2019

Given the massive market of advertising and the sharply increasing online multimedia content (such as videos), it is now fashionable to promote advertisements (ads) together with the multimedia content. It is exhausted to find relevant ads to match t he provided content manually, and hence, some automatic advertising techniques are developed. Since ads are usually hard to understand only according to its visual appearance due to the contained visual metaphor, some other modalities, such as the contained texts, should be exploited for understanding. To further improve user experience, it is necessary to understand both the topic and sentiment of the ads. This motivates us to develop a novel deep multimodal multitask framework to integrate multiple modalities to achieve effective topic and sentiment prediction simultaneously for ads understanding. In particular, our model first extracts multimodal information from ads and learn high-level and comparable representations. The visual metaphor of the ad is decoded in an unsupervised manner. The obtained representations are then fed into the proposed hierarchical multimodal attention modules to learn task-specific representations for final prediction. A multitask loss function is also designed to train both the topic and sentiment prediction models jointly in an end-to-end manner. We conduct extensive experiments on the latest and large advertisement dataset and achieve state-of-the-art performance for both prediction tasks. The obtained results could be utilized as a benchmark for ads understanding.

الوسائط المتعددة التعلم الآلي

Towards Digital Retina in Smart Cities: A Model Generation, Utilization and Communication Paradigm

58 - Yihang Lou , Ling-Yu Duan , Yong Luo 2019

The digital retina in smart cities is to select what the City Eye tells the City Brain, and convert the acquired visual data from front-end visual sensors to features in an intelligent sensing manner. By deploying deep learning and/or handcrafted mod els in front-end devices, the compact features can be extracted and subsequently delivered to back-end cloud for search and advanced analytics. In this context, we propose a model generation, utilization, and communication paradigm, aiming to address a set of unique challenges for better artificial intelligence services in smart cities. In particular, we present an integrated multiple deep learning models reuse and prediction strategy, which greatly increases the feasibility of the digital retina in processing and analyzing the large-scale visual data in smart cities. The promise of the proposed paradigm is demonstrated through a set of experiments.

الرؤية الحاسوبية وتمييز الأنماط

Large Margin Multi-modal Multi-task Feature Extraction for Image Classification

70 - Yong Luo , Yonggang Wen , Dacheng Tao 2019

The features used in many image analysis-based applications are frequently of very high dimension. Feature extraction offers several advantages in high-dimensional cases, and many recent studies have used multi-task feature extraction approaches, whi ch often outperform single-task feature extraction approaches. However, most of these methods are limited in that they only consider data represented by a single type of feature, even though features usually represent images from multiple modalities. We therefore propose a novel large margin multi-modal multi-task feature extraction (LM3FE) framework for handling multi-modal features for image classification. In particular, LM3FE simultaneously learns the feature extraction matrix for each modality and the modality combination coefficients. In this way, LM3FE not only handles correlated and noisy features, but also utilizes the complementarity of different modalities to further help reduce feature redundancy in each modality. The large margin principle employed also helps to extract strongly predictive features so that they are more suitable for prediction (e.g., classification). An alternating algorithm is developed for problem optimization and each sub-problem can be efficiently solved. Experiments on two challenging real-world image datasets demonstrate the effectiveness and superiority of the proposed method.

التعلم الالي الرؤية الحاسوبية وتمييز الأنماط التعلم الآلي

ResumeNet: A Learning-based Framework for Automatic Resume Quality Assessment

87 - Yong Luo , Huaizheng Zhang , Yongjie Wang 2018

Recruitment of appropriate people for certain positions is critical for any companies or organizations. Manually screening to select appropriate candidates from large amounts of resumes can be exhausted and time-consuming. However, there is no public tool that can be directly used for automatic resume quality assessment (RQA). This motivates us to develop a method for automatic RQA. Since there is also no public dataset for model training and evaluation, we build a dataset for RQA by collecting around 10K resumes, which are provided by a private resume management company. By investigating the dataset, we identify some factors or features that could be useful to discriminate good resumes from bad ones, e.g., the consistency between different parts of a resume. Then a neural-network model is designed to predict the quality of each resume, where some text processing techniques are incorporated. To deal with the label deficiency issue in the dataset, we propose several variants of the model by either utilizing the pair/triplet-based loss, or introducing some semi-supervised learning technique to make use of the abundant unlabeled data. Both the presented baseline model and its variants are general and easy to implement. Various popular criteria including the receiver operating characteristic (ROC) curve, F-measure and ranking-based average precision (AP) are adopted for model evaluation. We compare the different variants with our baseline model. Since there is no public algorithm for RQA, we further compare our results with those obtained from a website that can score a resume. Experimental results in terms of different criteria demonstrate the effectiveness of the proposed method. We foresee that our approach would transform the way of future human resources management.

استرجاع المعلومات التعلم الآلي

Some remarks on bi-f-harmonic maps and f-biharmonic maps

98 - Yong Luo , Ye-Lin Ou 2018

In this paper, we prove that the class of bi-f-harmonic maps and that of f-biharmonic maps from a conformal manifold of dimension not equal to 2 are the same (Theorem 1.1). We also give several results on nonexistence of proper bi-f-harmonic maps and f-biharmonic maps from complete Riemannian manifolds into nonpositively curved Riemannian manifolds. These include: any bi-f-harmonic map from a compact manifold into a non-positively curved manifold is f-harmonic (Theorem 1.6), and any f-biharmonic (respectively, bi-f-harmonic) map with bounded f and bounded f-bienrgy (respectively, bi-f-energy) from a complete Riemannian manifold into a manifold of strictly negative curvature has rank < 2 everywhere (Theorems 2.2 and 2.3).

الهندسة التفاضلية

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد