أوراق بحثية, رسائل ماجستير ودكتوراه منشورة من قبل Wei Fang

Does Knowledge Help General NLU? An Empirical Study

318 - Ruochen Xu , Yuwei Fang , Chenguang Zhu 2021

It is often observed in knowledge-centric tasks (e.g., common sense question and answering, relation classification) that the integration of external knowledge such as entity representation into language models can help provide useful information to boost the performance. However, it is still unclear whether this benefit can extend to general natural language understanding (NLU) tasks. In this work, we empirically investigated the contribution of external knowledge by measuring the end-to-end performance of language models with various knowledge integration methods. We find that the introduction of knowledge can significantly improve the results on certain tasks while having no adverse effects on other tasks. We then employ mutual information to reflect the difference brought by knowledge and a neural interpretation model to reveal how a language model utilizes external knowledge. Our study provides valuable insights and guidance for practitioners to equip NLP models with knowledge.

الحساب واللغة

Robust fast method for variable-order time-fractional diffusion equations without regularity assumptions

99 - Jia-Li Zhang , Zhi-Wei Fang , Hai-Wei Sun 2021

In this paper, we develop a robust fast method for mobile-immobile variable-order (VO) time-fractional diffusion equations (tFDEs), superiorly handling the cases of small or vanishing lower bound of the VO function. The valid fast approximation of th e VO Caputo fractional derivative is obtained using integration by parts and the exponential-sum-approximation method. Compared with the general direct method, the proposed algorithm ($RF$-$L1$ formula) reduces the acting memory from $mathcal{O}(n)$ to $mathcal{O}(log^2 n)$ and computational cost from $mathcal{O}(n^2)$ to $mathcal{O}(n log^2 n)$, respectively, where $n$ is the number of time levels. Then $RF$-$L1$ formula is applied to construct the fast finite difference scheme for the VO tFDEs, which sharp decreases the memory requirement and computational complexity. The error estimate for the proposed scheme is studied only under some assumptions of the VO function, coefficients, and the source term, but without any regularity assumption of the true solutions. Numerical experiments are presented to verify the effectiveness of the proposed method.

التحليل العددي التحليل العددي

On-chip multi-color microdisk laser on Yb3+-doped thin-film lithium niobate

138 - Yuan Zhou , Zhe Wang , Zhiwei Fang 2021

We demonstrate an on-chip Yb3+-doped lithium niobate (LN) microdisk laser. The intrinsic quality factors of the fabricated Yb3+-doped LN microdisk resonator are measured up to 3.79x10^5 at 976 nm wavelength and 1.1x10^6 at 1514 nm wavelength. The mul ti-mode laser emissions are obtained in a band from 1020 nm to 1070 nm pumped by 984 nm laser and with the low threshold of 103 {mu}W, resulting in a slope efficiency of 0.53% at room temperature. Furthermore, the second-harmonic frequency of pump light and the sum-frequency of the pump light and laser emissions are both generated in the on-chip Yb3+-doped LN microdisk benefited from the strong c{hi}(2) nonlinearity of LN. These microdisk lasers are expected to contribute to the high-density integration of LNOI-based photonic chip.

بصريات

Trash to Treasure: Harvesting OOD Data with Cross-Modal Matching for Open-Set Semi-Supervised Learning

183 - Junkai Huang , Chaowei Fang , Weikai Chen 2021

Open-set semi-supervised learning (open-set SSL) investigates a challenging but practical scenario where out-of-distribution (OOD) samples are contained in the unlabeled data. While the mainstream technique seeks to completely filter out the OOD samp les for semi-supervised learning (SSL), we propose a novel training mechanism that could effectively exploit the presence of OOD data for enhanced feature learning while avoiding its adverse impact on the SSL. We achieve this goal by first introducing a warm-up training that leverages all the unlabeled data, including both the in-distribution (ID) and OOD samples. Specifically, we perform a pretext task that enforces our feature extractor to obtain a high-level semantic understanding of the training images, leading to more discriminative features that can benefit the downstream tasks. Since the OOD samples are inevitably detrimental to SSL, we propose a novel cross-modal matching strategy to detect OOD samples. Instead of directly applying binary classification, we train the network to predict whether the data sample is matched to an assigned one-hot class label. The appeal of the proposed cross-modal matching over binary classification is the ability to generate a compatible feature space that aligns with the core classification task. Extensive experiments show that our approach substantially lifts the performance on open-set SSL and outperforms the state-of-the-art by a large margin.

الرؤية الحاسوبية وتمييز الأنماط

Coherent mode-combined ultra-narrow-linewidth single-mode micro-disk laser

235 - Jintian Lin , Saeed Farajollahi , Zhiwei Fang 2021

Integrated single-mode microlasers with ultra-narrow linewidths play a game-changing role in a broad spectrum of applications ranging from coherent communication and LIDAR to metrology and sensing. Generation of such light sources in a controllable a nd cost-effective manner remains an outstanding challenge due to the difficulties in the realization of ultra-high Q active micro-resonators with suppressed mode numbers. Here, we report a microlaser generated in an ultra-high Q Erbium doped lithium niobate (LN) micro-disk. Through the formation of coherently combined polygon modes at both pump and laser wavelengths, the microlaser exhibits single mode operation with an ultra-narrow-linewidth of 98 Hz. In combination with the superior electro-optic and nonlinear optical properties of LN crystal, the mass-producible on-chip single-mode microlaser will provide an essential building block for the photonic integrated circuits demanding high precision frequency control and reconfigurability.

بصريات

Pruning of Deep Spiking Neural Networks through Gradient Rewiring

159 - Yanqi Chen , Zhaofei Yu , Wei Fang 2021

Spiking Neural Networks (SNNs) have been attached great importance due to their biological plausibility and high energy-efficiency on neuromorphic chips. As these chips are usually resource-constrained, the compression of SNNs is thus crucial along t he road of practical use of SNNs. Most existing methods directly apply pruning approaches in artificial neural networks (ANNs) to SNNs, which ignore the difference between ANNs and SNNs, thus limiting the performance of the pruned SNNs. Besides, these methods are only suitable for shallow SNNs. In this paper, inspired by synaptogenesis and synapse elimination in the neural system, we propose gradient rewiring (Grad R), a joint learning algorithm of connectivity and weight for SNNs, that enables us to seamlessly optimize network structure without retraining. Our key innovation is to redefine the gradient to a new synaptic parameter, allowing better exploration of network structures by taking full advantage of the competition between pruning and regrowth of connections. The experimental results show that the proposed method achieves minimal loss of SNNs performance on MNIST and CIFAR-10 dataset so far. Moreover, it reaches a $sim$3.5% accuracy loss under unprecedented 0.73% connectivity, which reveals remarkable structure refining capability in SNNs. Our work suggests that there exists extremely high redundancy in deep SNNs. Our codes are available at https://github.com/Yanqi-Chen/Gradient-Rewiring.

الحوسبة العصبية والتطورية الذكاء الاصطناعي الرؤية الحاسوبية وتمييز الأنماط

An Ultrahigh-Q Microresonator on 4H-silicon-carbide-on-insulator Platform for Multiple Harmonics, Cascaded Raman Lasing and Kerr Comb Generations

370 - Chengli Wang , Zhiwei Fang , Ailun Yi 2021

The realization of ultrahigh quality (Q) resonators regardless of the underpinning material platforms has been a ceaseless pursuit, because the high Q resonators provide an extreme environment of storage of light to enable observations of many unconv entional nonlinear optical phenomenon with high efficiencies. Here, we demonstrate an ultra-high Q factor (7.1*10^6) microresonator on the 4H-silicon-carbide-on-insulator (4H-SiCOI) platform in which both c{hi}^(2) and c{hi}^(3) nonlinear processes of high efficiencies have been generated. Broadband frequency

بصريات

Deep Transformers for Fast Small Intestine Grounding in Capsule Endoscope Video

52 - Xinkai Zhao , Chaowei Fang , Feng Gao 2021

Capsule endoscopy is an evolutional technique for examining and diagnosing intractable gastrointestinal diseases. Because of the huge amount of data, analyzing capsule endoscope videos is very time-consuming and labor-intensive for gastrointestinal m edicalists. The development of intelligent long video analysis algorithms for regional positioning and analysis of capsule endoscopic video is therefore essential to reduce the workload of clinicians and assist in improving the accuracy of disease diagnosis. In this paper, we propose a deep model to ground shooting range of small intestine from a capsule endoscope video which has duration of tens of hours. This is the first attempt to attack the small intestine grounding task using deep neural network method. We model the task as a 3-way classification problem, in which every video frame is categorized into esophagus/stomach, small intestine or colorectum. To explore long-range temporal dependency, a transformer module is built to fuse features of multiple neighboring frames. Based on the classification model, we devise an efficient search algorithm to efficiently locate the starting and ending shooting boundaries of the small intestine. Without searching the small intestine exhaustively in the full video, our method is implemented via iteratively separating the video segment along the direction to the target boundary in the middle. We collect 113 videos from a local hospital to validate our method. In the 5-fold cross validation, the average IoU between the small intestine segments located by our method and the ground-truths annotated by broad-certificated gastroenterologists reaches 0.945.

الرؤية الحاسوبية وتمييز الأنماط

Deep Residual Learning in Spiking Neural Networks

92 - Wei Fang , Zhaofei Yu , Yanqi Chen 2021

Deep Spiking Neural Networks (SNNs) present optimization difficulties for gradient-based approaches due to discrete binary activation and complex spatial-temporal dynamics. Considering the huge success of ResNet in deep learning, it would be natural to train deep SNNs with residual learning. Previous Spiking ResNet mimics the standard residual block in ANNs and simply replaces ReLU activation layers with spiking neurons, which suffers the degradation problem and can hardly implement residual learning. In this paper, we propose the spike-element-wise (SEW) ResNet to realize residual learning in deep SNNs. We prove that the SEW ResNet can easily implement identity mapping and overcome the vanishing/exploding gradient problems of Spiking ResNet. We evaluate our SEW ResNet on ImageNet and DVS Gesture datasets, and show that SEW ResNet outperforms the state-of-the-art directly trained SNNs in both accuracy and time-steps. Moreover, SEW ResNet can achieve higher performance by simply adding more layers, providing a simple method to train deep SNNs. To our best knowledge, this is the first time that directly training deep SNNs with more than 100 layers becomes possible.

الحوسبة العصبية والتطورية

Fast Parametric Model Checking through Model Fragmentation

194 - Xinwei Fang , Radu Calinescu , Simos Gerasimou 2021

Parametric model checking (PMC) computes algebraic formulae that express key non-functional properties of a system (reliability, performance, etc.) as rational functions of the system and environment parameters. In software engineering, PMC formulae can be used during design, e.g., to analyse the sensitivity of different system architectures to parametric variability, or to find optimal system configurations. They can also be used at runtime, e.g., to check if non-functional requirements are still satisfied after environmental changes, or to select new configurations after such changes. However, current PMC techniques do not scale well to systems with complex behaviour and more than a few parameters. Our paper introduces a fast PMC (fPMC) approach that overcomes this limitation, extending the applicability of PMC to a broader class of systems than previously possible. To this end, fPMC partitions the Markov models that PMC operates with into emph{fragments} whose reachability properties are analysed independently, and obtains PMC reachability formulae by combining the results of these fragment analyses. To demonstrate the effectiveness of fPMC, we show how our fPMC tool can analyse three systems (taken from the research literature, and belonging to different application domains) with which current PMC techniques and tools struggle.

هندسة البرمجيات اللغات الرسمية ونظرية الأتومات علم الروبوتات

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد