أوراق بحثية, رسائل ماجستير ودكتوراه منشورة من قبل Lei Xu

Achieving adjustable elasticity with non-affine to affine transition

142 - Xiangying Shen , Chenchao Fang , Zhipeng Jin 2021

For various engineering and industrial applications it is desirable to realize mechanical systems with broadly adjustable elasticity to respond flexibly to the external environment. Here we discover a topology-correlated transition between affine and non-affine regimes in elasticity in both two- and three-dimensional packing-derived networks. Based on this transition, we numerically design and experimentally realize multifunctional systems with adjustable elasticity. Within one system, we achieve solid-like affine response, liquid-like non-affine response and a continuous tunability in between. Moreover, the system also exhibits a broadly tunable Poissons ratio from positive to negative values, which is of practical interest for energy absorption and for fracture-resistant materials. Our study reveals a fundamental connection between elasticity and network topology, and demonstrates its practical potential for designing mechanical systems and metamaterials.

علم المواد الأنظمة المضطربة والشبكات العصبية مادة مكثفة ناعمة

Reducing Computational Complexity of Tensor Contractions via Tensor-Train Networks

124 - Ilya Kisil , Giuseppe G. Calvi , Kriton Konstantinidis 2021

There is a significant expansion in both volume and range of applications along with the concomitant increase in the variety of data sources. These ever-expanding trends have highlighted the necessity for more versatile analysis tools that offer grea ter opportunities for algorithmic developments and computationally faster operations than the standard flat-view matrix approach. Tensors, or multi-way arrays, provide such an algebraic framework which is naturally suited to data of such large volume, diversity, and veracity. Indeed, the associated tensor decompositions have demonstrated their potential in breaking the Curse of Dimensionality associated with traditional matrix methods, where a necessary exponential increase in data volume leads to adverse or even intractable consequences on computational complexity. A key tool underpinning multi-linear manipulation of tensors and tensor networks is the standard Tensor Contraction Product (TCP). However, depending on the dimensionality of the underlying tensors, the TCP also comes at the price of high computational complexity in tensor manipulation. In this work, we resort to diagrammatic tensor network manipulation to calculate such products in an efficient and computationally tractable manner, by making use of Tensor Train decomposition (TTD). This has rendered the underlying concepts easy to perceive, thereby enhancing intuition of the associated underlying operations, while preserving mathematical rigour. In addition to bypassing the cumbersome mathematical multi-linear expressions, the proposed Tensor Train Contraction Product model is shown to accelerate significantly the underlying computational operations, as it is independent of tensor order and linear in the tensor dimension, as opposed to performing the full computations through the standard approach (exponential in tensor order).

التحليل العددي التحليل العددي

TransFER: Learning Relation-aware Facial Expression Representations with Transformers

113 - Fanglei Xue , Qiangchang Wang , Guodong Guo 2021

Facial expression recognition (FER) has received increasing interest in computer vision. We propose the TransFER model which can learn rich relation-aware local representations. It mainly consists of three components: Multi-Attention Dropping (MAD), ViT-FER, and Multi-head Self-Attention Dropping (MSAD). First, local patches play an important role in distinguishing various expressions, however, few existing works can locate discriminative and diverse local patches. This can cause serious problems when some patches are invisible due to pose variations or viewpoint changes. To address this issue, the MAD is proposed to randomly drop an attention map. Consequently, models are pushed to explore diverse local patches adaptively. Second, to build rich relations between different local patches, the Vision Transformers (ViT) are used in FER, called ViT-FER. Since the global scope is used to reinforce each local patch, a better representation is obtained to boost the FER performance. Thirdly, the multi-head self-attention allows ViT to jointly attend to features from different information subspaces at different positions. Given no explicit guidance, however, multiple self-attentions may extract similar relations. To address this, the MSAD is proposed to randomly drop one self-attention module. As a result, models are forced to learn rich relations among diverse local patches. Our proposed TransFER model outperforms the state-of-the-art methods on several FER benchmarks, showing its effectiveness and usefulness.

الرؤية الحاسوبية وتمييز الأنماط

Evaluation on characterization of acoustic emission of brittle rocks from the experiment to numerical simulation

104 - Fengchang Bu , Lei Xue , Mengyang Zhai 2021

Acoustic emission (AE) characterization is an effective technique to indirectly capture the progressive failure process of the brittle rock. In previous studies, both the experiment and numerical simulation were adopted to investigate AE characterist ics of the brittle rock. However, as the most popular numerical model, the moment tensor model (MTM) did not reproduce the monitoring and analyzing manner of AE signals from the physical experiment. Consequently, its result could not be constrained by the experimental result. It is thus necessary to evaluate the consistency and compatibility between the experiment and MTM. To fulfill this, we developed a particle-velocity-based model (PVBM) which enabled directly monitor and analyze the particle velocity in the numerical model and had good robustness. The PVBM imitated the actual experiment and could fill in gaps between the experiment and MTM. AE experiments of Marine shale under uniaxial compression were carried out, of which results were simulated by MTM. In general, the variation trend of the experimental result could be presented by MTM. Nevertheless, magnitudes of AE parameters by MTM presented notable differences with more than several orders compared with those by the experiment. We sequentially used PVBM as a proxy to analyze these discrepancies quantitatively and make a systematical evaluation on AE characterization of brittle rocks from the experiment to numerical simulation, considering the influence of wave reflection, energy geometrical diffusion, viscous attenuation, particle size as well as progressive deterioration of rock material. It was suggested that only the combination of MTM and PVBM could reasonably and accurately acquire AE characteristics of the actual AE experiment of brittle rocks by making full use of their respective advantages.

الجيوفيزياء أجهزة الكشف الفيزيائية

Optimal gamma-ray selections for monochromatic line searches with DAMPE

366 - Zun-Lei Xu , Kai-Kai Duan , Wei Jiang 2021

The DArk Matter Particle Explorer (DAMPE) is a space high-energy cosmic-ray detector covering a wide energy band with a high energy resolution. One of the key scientific goals of DAMPE is to carry out indirect detection of dark matter by searching fo r high-energy gamma-ray line structure. To promote the sensitivity of gamma-ray line search with DAMPE, it is crucial to improve the acceptance and energy resolution of gamma-ray photons. In this paper, we quantitatively prove that the photon sample with the largest ratio of acceptance to energy resolution is optimal for line search. We therefore develop a line-search sample specifically optimized for the line search. Meanwhile, in order to increase the statistics, we also selected the so called BGO-only photons that convert into $e^+e^-$ pairs only in the BGO calorimeter. The standard, the line-search, and the BGO-only photon samples are then tested for line search individually and collectively. The results show that a significantly improved limit could be obtained from an appropriate combination of the date sets, and the increase is about 20% for the highest case compared with using the standard sample only.

فيزياء الطاقة العالية - التجربة ظاهرة عالية الطاقة الفيزياء الفيزيائية

Dynamic Transformer for Efficient Machine Translation on Embedded Devices

128 - Hishan Parry , Lei Xun , Amin Sabet 2021

The Transformer architecture is widely used for machine translation tasks. However, its resource-intensive nature makes it challenging to implement on constrained embedded devices, particularly where available hardware resources can vary at run-time. We propose a dynamic machine translation model that scales the Transformer architecture based on the available resources at any particular time. The proposed approach, Dynamic-HAT, uses a HAT SuperTransformer as the backbone to search for SubTransformers with different accuracy-latency trade-offs at design time. The optimal SubTransformers are sampled from the SuperTransformer at run-time, depending on latency constraints. The Dynamic-HAT is tested on the Jetson Nano and the approach uses inherited SubTransformers sampled directly from the SuperTransformer with a switching time of <1s. Using inherited SubTransformers results in a BLEU score loss of <1.5% because the SubTransformer configuration is not retrained from scratch after sampling. However, to recover this loss in performance, the dimensions of the design space can be reduced to tailor it to a family of target hardware. The new reduced design space results in a BLEU score increase of approximately 1% for sub-optimal models from the original design space, with a wide range for performance scaling between 0.356s - 1.526s for the GPU and 2.9s - 7.31s for the CPU.

الحساب واللغة التعلم الآلي

Graph Theory for Metro Traffic Modelling

116 - Bruno Scalzo Dees , Yao Lei Xu , Anthony G. Constantinides 2021

A unifying graph theoretic framework for the modelling of metro transportation networks is proposed. This is achieved by first introducing a basic graph framework for the modelling of the London underground system from a diffusion law point of view. This forms a basis for the analysis of both station importance and their vulnerability, whereby the concept of graph vertex centrality plays a key role. We next explore k-edge augmentation of a graph topology, and illustrate its usefulness both for improving the network robustness and as a planning tool. Upon establishing the graph theoretic attributes of the underlying graph topology, we proceed to introduce models for processing data on such a metro graph. Commuter movement is shown to obey the Ficks law of diffusion, where the graph Laplacian provides an analytical model for the diffusion process of commuter population dynamics. Finally, we also explore the application of modern deep learning models, such as graph neural networks and hyper-graph neural networks, as general purpose models for the modelling and forecasting of underground data, especially in the context of the morning and evening rush hours. Comprehensive simulations including the passenger in- and out-flows during the morning rush hour in London demonstrates the advantages of the graph models in metro planning and traffic management, a formal mathematical approach with wide economic implications.

التعلم الآلي

Tensor-Train Recurrent Neural Networks for Interpretable Multi-Way Financial Forecasting

87 - Yao Lei Xu , Giuseppe G. Calvi , Danilo P. Mandic 2021

Recurrent Neural Networks (RNNs) represent the de facto standard machine learning tool for sequence modelling, owing to their expressive power and memory. However, when dealing with large dimensional data, the corresponding exponential increase in th e number of parameters imposes a computational bottleneck. The necessity to equip RNNs with the ability to deal with the curse of dimensionality, such as through the parameter compression ability inherent to tensors, has led to the development of the Tensor-Train RNN (TT-RNN). Despite achieving promising results in many applications, the full potential of the TT-RNN is yet to be explored in the context of interpretable financial modelling, a notoriously challenging task characterized by multi-modal data with low signal-to-noise ratio. To address this issue, we investigate the potential of TT-RNN in the task of financial forecasting of currencies. We show, through the analysis of TT-factors, that the physical meaning underlying tensor decomposition, enables the TT-RNN model to aid the interpretability of results, thus mitigating the notorious black-box issue associated with neural networks. Furthermore, simulation results highlight the regularization power of TT decomposition, demonstrating the superior performance of TT-RNN over its uncompressed RNN counterpart and other tensor forecasting methods.

التعلم الآلي

Optimising Resource Management for Embedded Machine Learning

130 - Lei Xun , Long Tran-Thanh , Bashir M Al-Hashimi 2021

Machine learning inference is increasingly being executed locally on mobile and embedded platforms, due to the clear advantages in latency, privacy and connectivity. In this paper, we present approaches for online resource management in heterogeneous multi-core systems and show how they can be applied to optimise the performance of machine learning workloads. Performance can be defined using platform-dependent (e.g. speed, energy) and platform-independent (accuracy, confidence) metrics. In particular, we show how a Deep Neural Network (DNN) can be dynamically scalable to trade-off these various performance metrics. Achieving consistent performance when executing on different platforms is necessary yet challenging, due to the different resources provided and their capability, and their time-varying availability when executing alongside other workloads. Managing the interface between available hardware resources (often numerous and heterogeneous in nature), software requirements, and user experience is increasingly complex.

الرؤية الحاسوبية وتمييز الأنماط

Incremental Training and Group Convolution Pruning for Runtime DNN Performance Scaling on Heterogeneous Embedded Platforms

75 - Lei Xun , Long Tran-Thanh , Bashir M Al-Hashimi 2021

Inference for Deep Neural Networks is increasingly being executed locally on mobile and embedded platforms due to its advantages in latency, privacy and connectivity. Since modern System on Chips typically execute a combination of different and dynam ic workloads concurrently, it is challenging to consistently meet inference time/energy budget at runtime because of the local computing resources available to the DNNs vary considerably. To address this challenge, a variety of dynamic DNNs were proposed. However, these works have significant memory overhead, limited runtime recoverable compression rate and narrow dynamic ranges of performance scaling. In this paper, we present a dynamic DNN using incremental training and group convolution pruning. The channels of the DNN convolution layer are divided into groups, which are then trained incrementally. At runtime, following groups can be pruned for inference time/energy reduction or added back for accuracy recovery without model retraining. In addition, we combine task mapping and Dynamic Voltage Frequency Scaling (DVFS) with our dynamic DNN to deliver finer trade-off between accuracy and time/power/energy over a wider dynamic range. We illustrate the approach by modifying AlexNet for the CIFAR10 image dataset and evaluate our work on two heterogeneous hardware platforms: Odroid XU3 (ARM big.LITTLE CPUs) and Nvidia Jetson Nano (CPU and GPU). Compared to the existing works, our approach can provide up to 2.36x (energy) and 2.73x (time) wider dynamic range with a 2.4x smaller memory footprint at the same compression rate. It achieved 10.6x (energy) and 41.6x (time) wider dynamic range by combining with task mapping and DVFS.

الرؤية الحاسوبية وتمييز الأنماط

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد