أوراق بحثية, رسائل ماجستير ودكتوراه منشورة من قبل Weiwei Guo

Deep Natural Language Processing for LinkedIn Search

104 - Weiwei Guo , Xiaowei Liu , Sida Wang 2021

Many search systems work with large amounts of natural language data, e.g., search queries, user profiles, and documents. Building a successful search system requires a thorough understanding of textual data semantics, where deep learning based natur al language processing techniques (deep NLP) can be of great help. In this paper, we introduce a comprehensive study for applying deep NLP techniques to five representative tasks in search systems: query intent prediction (classification), query tagging (sequential tagging), document ranking (ranking), query auto completion (language modeling), and query suggestion (sequence to sequence). We also introduce BERT pre-training as a sixth task that can be applied to many of the other tasks. Through the model design and experiments of the six tasks, readers can find answers to four important questions: (1). When is deep NLP helpful/not helpful in search systems? (2). How to address latency challenges? (3). How to ensure model robustness? This work builds on existing efforts of LinkedIn search, and is tested at scale on LinkedIns commercial search engines. We believe our experiences can provide useful insights for the industry and research communities.

استرجاع المعلومات الحساب واللغة

Deep Natural Language Processing for LinkedIn Search Systems

129 - Weiwei Guo , Xiaowei Liu , Sida Wang 2021

Many search systems work with large amounts of natural language data, e.g., search queries, user profiles and documents, where deep learning based natural language processing techniques (deep NLP) can be of great help. In this paper, we introduce a c omprehensive study of applying deep NLP techniques to five representative tasks in search engines. Through the model design and experiments of the five tasks, readers can find answers to three important questions: (1) When is deep NLP helpful/not helpful in search systems? (2) How to address latency challenges? (3) How to ensure model robustness? This work builds on existing efforts of LinkedIn search, and is tested at scale on a commercial search engine. We believe our experiences can provide useful insights for the industry and research communities.

الحساب واللغة الذكاء الاصطناعي

Deep Search Query Intent Understanding

70 - Xiaowei Liu , Weiwei Guo , Huiji Gao 2020

Understanding a users query intent behind a search is critical for modern search engine success. Accurate query intent prediction allows the search engine to better serve the users need by rendering results from more relevant categories. This paper a ims to provide a comprehensive learning framework for modeling query intent under different stages of a search. We focus on the design for 1) predicting users intents as they type in queries on-the-fly in typeahead search using character-level models; and 2) accurate word-level intent prediction models for complete queries. Various deep learning components for query text understanding are experimented. Offline evaluation and online A/B test experiments show that the proposed methods are effective in understanding query intent and efficient to scale for online search systems.

الحساب واللغة الذكاء الاصطناعي استرجاع المعلومات

Efficient Neural Query Auto Completion

101 - Sida Wang , Weiwei Guo , Huiji Gao 2020

Query Auto Completion (QAC), as the starting point of information retrieval tasks, is critical to user experience. Generally it has two steps: generating completed query candidates according to query prefixes, and ranking them based on extracted feat ures. Three major challenges are observed for a query auto completion system: (1) QAC has a strict online latency requirement. For each keystroke, results must be returned within tens of milliseconds, which poses a significant challenge in designing sophisticated language models for it. (2) For unseen queries, generated candidates are of poor quality as contextual information is not fully utilized. (3) Traditional QAC systems heavily rely on handcrafted features such as the query candidate frequency in search logs, lacking sufficient semantic understanding of the candidate. In this paper, we propose an efficient neural QAC system with effective context modeling to overcome these challenges. On the candidate generation side, this system uses as much information as possible in unseen prefixes to generate relevant candidates, increasing the recall by a large margin. On the candidate ranking side, an unnormalized language model is proposed, which effectively captures deep semantics of queries. This approach presents better ranking performance over state-of-the-art neural ranking methods and reduces $sim$95% latency compared to neural language modeling methods. The empirical results on public datasets show that our model achieves a good balance between accuracy and efficiency. This system is served in LinkedIn job search with significant product impact observed.

الحساب واللغة

DeText: A Deep Text Ranking Framework with BERT

98 - Weiwei Guo , Xiaowei Liu , Sida Wang 2020

Ranking is the most important component in a search system. Mostsearch systems deal with large amounts of natural language data,hence an effective ranking system requires a deep understandingof text semantics. Recently, deep learning based natural la nguageprocessing (deep NLP) models have generated promising results onranking systems. BERT is one of the most successful models thatlearn contextual embedding, which has been applied to capturecomplex query-document relations for search ranking. However,this is generally done by exhaustively interacting each query wordwith each document word, which is inefficient for online servingin search product systems. In this paper, we investigate how tobuild an efficient BERT-based ranking model for industry use cases.The solution is further extended to a general ranking framework,DeText, that is open sourced and can be applied to various rankingproductions. Offline and online experiments of DeText on threereal-world search systems present significant improvement overstate-of-the-art approaches.

استرجاع المعلومات الحساب واللغة

n-Lie bialgebras

227 - Ruipu Bai , Weiwei Guo , Lixin Lin 2016

The $n$-Lie bialgebras are studied. In Section 2, the $n$-Lie coalgebra with rank $r$ is defined, and the structure of it is discussed. In Section 3, the $n$-Lie bialgebra is introduced. A triple $(L, mu, Delta)$ is an $n$-Lie bialgebra if and only i f $Delta$ is a conformal $1$-cocycle on the $n$-Lie algebra $L$ associated to $L$-modules $(L^{otimes n}, rho_s^{mu})$, $1leq sleq n$, and the structure of $n$-Lie bialgebras is investigated by the structural constants. In Section 4, two-dimensional extension of finite dimensional $n$-Lie bialgebras are studied. For an $m$ dimensional $n$-Lie bialgebra $(L, mu, Delta)$, and an $ad_{mu}$-invariant symmetric bilinear form on $L$, the $m+2$ dimensional $(n+1)$-Lie bialgebra is constructed. In the last section, the bialgebra structure on the finite dimensional simple $n$-Lie algebra $A_n$ is discussed. It is proved that only bialgebra structures on the simple $n$-Lie algebra $A_n$ are rank zero, and rank two.

حلقات وجبر الفيزياء الرياضية الفيزياء الرياضية

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد