أوراق بحثية, رسائل ماجستير ودكتوراه منشورة من قبل Zheng Zhu

Euphemistic Phrase Detection by Masked Language Model

111 - Wanzheng Zhu , Suma Bhat 2021

It is a well-known approach for fringe groups and organizations to use euphemisms -- ordinary-sounding and innocent-looking words with a secret meaning -- to conceal what they are discussing. For instance, drug dealers often use pot for marijuana and avocado for heroin. From a social media content moderation perspective, though recent advances in NLP have enabled the automatic detection of such single-word euphemisms, no existing work is capable of automatically detecting multi-word euphemisms, such as blue dream (marijuana) and black tar (heroin). Our paper tackles the problem of euphemistic phrase detection without human effort for the first time, as far as we are aware. We first perform phrase mining on a raw text corpus (e.g., social media posts) to extract quality phrases. Then, we utilize word embedding similarities to select a set of euphemistic phrase candidates. Finally, we rank those candidates by a masked language model -- SpanBERT. Compared to strong baselines, we report 20-50% higher detection accuracies using our algorithm for detecting euphemistic phrases.

الحساب واللغة

Masked Face Recognition Challenge: The InsightFace Track Report

72 - Jiankang Deng , Jia Guo , Xiang An 2021

During the COVID-19 coronavirus epidemic, almost everyone wears a facial mask, which poses a huge challenge to deep face recognition. In this workshop, we organize Masked Face Recognition (MFR) challenge and focus on bench-marking deep face recogniti on methods under the existence of facial masks. In the MFR challenge, there are two main tracks: the InsightFace track and the WebFace260M track. For the InsightFace track, we manually collect a large-scale masked face test set with 7K identities. In addition, we also collect a children test set including 14K identities and a multi-racial test set containing 242K identities. By using these three test sets, we build up an online model testing system, which can give a comprehensive evaluation of face recognition models. To avoid data privacy problems, no test image is released to the public. As the challenge is still under-going, we will keep on updating the top-ranked solutions as well as this report on the arxiv.

الرؤية الحاسوبية وتمييز الأنماط

Masked Face Recognition Challenge: The WebFace260M Track Report

92 - Zheng Zhu , Guan Huang , Jiankang Deng 2021

According to WHO statistics, there are more than 204,617,027 confirmed COVID-19 cases including 4,323,247 deaths worldwide till August 12, 2021. During the coronavirus epidemic, almost everyone wears a facial mask. Traditionally, face recognition app roaches process mostly non-occluded faces, which include primary facial features such as the eyes, nose, and mouth. Removing the mask for authentication in airports or laboratories will increase the risk of virus infection, posing a huge challenge to current face recognition systems. Due to the sudden outbreak of the epidemic, there are yet no publicly available real-world masked face recognition (MFR) benchmark. To cope with the above-mentioned issue, we organize the Face Bio-metrics under COVID Workshop and Masked Face Recognition Challenge in ICCV 2021. Enabled by the ultra-large-scale WebFace260M benchmark and the Face Recognition Under Inference Time conStraint (FRUITS) protocol, this challenge (WebFace260M Track) aims to push the frontiers of practical MFR. Since public evaluation sets are mostly saturated or contain noise, a new test set is gathered consisting of elaborated 2,478 celebrities and 60,926 faces. Meanwhile, we collect the world-largest real-world masked test set. In the first phase of WebFace260M Track, 69 teams (total 833 solutions) participate in the challenge and 49 teams exceed the performance of our baseline. There are second phase of the challenge till October 1, 2021 and on-going leaderboard. We will actively update this report in the future.

الرؤية الحاسوبية وتمييز الأنماط

Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System

74 - Jinhan Wang , Yunzheng Zhu , Ruchao Fan 2021

This paper describes the SPAPL system for the INTERSPEECH 2021 Challenge: Shared Task on Automatic Speech Recognition for Non-Native Childrens Speech in German. ~ 5 hours of transcribed data and ~ 60 hours of untranscribed data are provided to develo p a German ASR system for children. For the training of the transcribed data, we propose a non-speech state discriminative loss (NSDL) to mitigate the influence of long-duration non-speech segments within speech utterances. In order to explore the use of the untranscribed data, various approaches are implemented and combined together to incrementally improve the system performance. First, bidirectional autoregressive predictive coding (Bi-APC) is used to learn initial parameters for acoustic modelling using the provided untranscribed data. Second, incremental semi-supervised learning is further used to iteratively generate pseudo-transcribed data. Third, different data augmentation schemes are used at different training stages to increase the variability and size of the training data. Finally, a recurrent neural network language model (RNNLM) is used for rescoring. Our system achieves a word error rate (WER) of 39.68% on the evaluation data, an approximately 12% relative improvement over the official baseline (45.21%).

معالجة الصوت والكلام التعلم الآلي

Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate Speech

86 - Wanzheng Zhu , Suma Bhat 2021

Countermeasures to effectively fight the ever increasing hate speech online without blocking freedom of speech is of great social interest. Natural Language Generation (NLG), is uniquely capable of developing scalable solutions. However, off-the-shel f NLG methods are primarily sequence-to-sequence neural models and they are limited in that they generate commonplace, repetitive and safe responses regardless of the hate speech (e.g., Please refrain from using such language.) or irrelevant responses, making them ineffective for de-escalating hateful conversations. In this paper, we design a three-module pipeline approach to effectively improve the diversity and relevance. Our proposed pipeline first generates various counterspeech candidates by a generative model to promote diversity, then filters the ungrammatical ones using a BERT model, and finally selects the most relevant counterspeech response using a novel retrieval-based method. Extensive Experiments on three representative datasets demonstrate the efficacy of our approach in generating diverse and relevant counterspeech.

الحساب واللغة

Sobolev Extension on $L^p$-Quasidisks

95 - Zheng Zhu 2021

In this paper, we study the Sobolev extension property of Lp-quasidisks which are the generalizations of the classical quasidisks. After that, we also find some applications of their Sobolev extension property.

تحليل وظيفي

WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition

105 - Zheng Zhu , Guan Huang , Jiankang Deng 2021

In this paper, we contribute a new million-scale face benchmark containing noisy 4M identities/260M faces (WebFace260M) and cleaned 2M identities/42M faces (WebFace42M) training data, as well as an elaborately designed time-constrained evaluation pro tocol. Firstly, we collect 4M name list and download 260M faces from the Internet. Then, a Cleaning Automatically utilizing Self-Training (CAST) pipeline is devised to purify the tremendous WebFace260M, which is efficient and scalable. To the best of our knowledge, the cleaned WebFace42M is the largest public face recognition training set and we expect to close the data gap between academia and industry. Referring to practical scenarios, Face Recognition Under Inference Time conStraint (FRUITS) protocol and a test set are constructed to comprehensively evaluate face matchers. Equipped with this benchmark, we delve into million-scale face recognition problems. A distributed framework is developed to train face recognition models efficiently without tampering with the performance. Empowered by WebFace42M, we reduce relative 40% failure rate on the challenging IJB-C set, and ranks the 3rd among 430 entries on NIST-FRVT. Even 10% data (WebFace4M) shows superior performance compared with public training set. Furthermore, comprehensive baselines are established on our rich-attribute test set under FRUITS-100ms/500ms/1000ms protocol, including MobileNet, EfficientNet, AttentionNet, ResNet, SENet, ResNeXt and RegNet families. Benchmark website is https://www.face-benchmark.org.

الرؤية الحاسوبية وتمييز الأنماط

A Spherical Hidden Markov Model for Semantics-Rich Human Mobility Modeling

96 - Wanzheng Zhu , Chao Zhang , Shuochao Yao 2020

We study the problem of modeling human mobility from semantic trace data, wherein each GPS record in a trace is associated with a text message that describes the users activity. Existing methods fall short in unveiling human movement regularities, be cause they either do not model the text data at all or suffer from text sparsity severely. We propose SHMM, a multi-modal spherical hidden Markov model for semantics-rich human mobility modeling. Under the hidden Markov assumption, SHMM models the generation process of a given trace by jointly considering the observed location, time, and text at each step of the trace. The distinguishing characteristic of SHMM is the text modeling part. We use fixed-size vector representations to encode the semantics of the text messages, and model the generation of the l2-normalized text embeddings on a unit sphere with the von Mises-Fisher (vMF) distribution. Compared with other alternatives like multi-variate Gaussian, our choice of the vMF distribution not only incurs much fewer parameters, but also better leverages the discriminative power of text embeddings in a directional metric space. The parameter inference for the vMF distribution is non-trivial since it involves functional inversion of ratios of Bessel functions. We theoretically prove that: 1) the classical Expectation-Maximization algorithm can work with vMF distributions; and 2) while closed-form solutions are hard to be obtained for the M-step, Newtons method is guaranteed to converge to the optimal solution with quadratic convergence rate. We have performed extensive experiments on both synthetic and real-life data. The results on synthetic data verify our theoretical analysis; while the results on real-life data demonstrate that SHMM learns meaningful semantics-rich mobility models, outperforms state-of-the-art mobility models for next location prediction, and incurs lower training cost.

التعلم الآلي الذكاء الاصطناعي الحساب واللغة

Doped Mott Insulators in the Triangular Lattice Hubbard Model

91 - Zheng Zhu , D. N. Sheng , Ashvin Vishwanath 2020

We investigate the evolution of the Mott insulators in the triangular lattice Hubbard Model, as a function of hole doping $delta$ in both the strong and intermediate coupling limit. Using the density matrix renormalization group (DMRG) method, at lig ht hole doping $deltalesssim 10%$, we find a significant difference between strong and intermediate couplings. Notably, at intermediate coupling an unusual metallic state emerges, with short ranged spin correlations but long ranged spin-chirality order. Moreover, no clear Fermi surface or wave-vector is observed. These features disappear on increasing interaction strength or on further doping. At strong coupling, the 120 degree magnetic order of the insulating magnet persists for light doping, and produces hole pockets with a well defined Fermi surface. On further doping, $delta approx 10%sim 20%$ SDW order and coherent hole Fermi pockets are found at both strong and intermediate coupling. At even higher doping $delta gtrsim 20%$, the SDW order is suppressed and the spin-singlet Cooper pair correlations are simultaneously enhanced. We interpret this as the onset of superconductivity on suppressing magnetic order. We also briefly comment on the strong particle hole asymmetry of the model, and contrast electron versus hole doping.

الإلكترونات المرتبطة بشدة

Pointwise inequalities for Sobolev functions on outward cuspidal domains

204 - Sylvester Eriksson-Bique , Pekka Koskela , Jan Maly 2019

We show that the first order Sobolev spaces on cuspidal symmetric domains can be characterized via pointwise inequalities. In particular, they coincide with the Hajlasz-Sobolev spaces.

تحليل وظيفي

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد