Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Learning Embeddings for Rare Words Leveraging Internet Search Engine and Spatial Location Relationships

تضمينات التعلم من أجل كلمات نادرة الاستفادة من محرك البحث عن الإنترنت وعلاقات الموقع المكاني

1125 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Word embedding techniques depend heavily on the frequencies of words in the corpus, and are negatively impacted by failures in providing reliable representations for low-frequency words or unseen words during training. To address this problem, we propose an algorithm to learn embeddings for rare words based on an Internet search engine and the spatial location relationships. Our algorithm proceeds in two steps. We firstly retrieve webpages corresponding to the rare word through the search engine and parse the returned results to extract a set of most related words. We average the vectors of the related words as the initial vector of the rare word. Then, the location of the rare word in the vector space is iteratively fine-tuned according to the order of its relevances to the related words. Compared to other approaches, our algorithm can learn more accurate representations for a wider range of vocabulary. We evaluate our learned rare-word embeddings on the word relatedness task, and the experimental results show that our algorithm achieves state-of-the-art performance.

References used

https://aclanthology.org/

rate research

Search Engine optimization ( SEO ) - The Beginner's Guide

1968 - Google 2015 كتاب

The basics of sEO, create unique page titles, improve the website structure, improve the content, dealing with crawlers, improve SEO for mobile devices, using analytics and promotional operating

SEO Google Shamra

An Empirical Study of Generating Texts for Search Engine Advertising

925 - Association for Computation Linguistics 2021 مقالة

Although there are many studies on neural language generation (NLG), few trials are put into the real world, especially in the advertising domain. Generating ads with NLG models can help copywriters in their creation. However, few studies have adequa tely evaluated the effect of generated ads with actual serving included because it requires a large amount of training data and a particular environment. In this paper, we demonstrate a practical use case of generating ad-text with an NLG model. Specially, we show how to improve the ads' impact, deploy models to a product, and evaluate the generated ads.

search engine advertising texts for search بحث محرك الإعلان نصوص للبحث صناعة حمض الفوسفور

Leveraging Bidding Graphs for Advertiser-Aware Relevance Modeling in Sponsored Search

958 - Association for Computation Linguistics 2021 مقالة

Recently, sponsored search has become one of the most lucrative channels for marketing. As the fundamental basis of sponsored search, relevance modeling has attracted increasing attention due to the tremendous practical value. Most existing methods s olely rely on the query-keyword pairs. However, keywords are usually short texts with scarce semantic information, which may not precisely reflect the underlying advertising intents. In this paper, we investigate the novel problem of advertiser-aware relevance modeling, which leverages the advertisers' information to bridge the gap between the search intents and advertising purposes. Our motivation lies in incorporating the unsupervised bidding behaviors as the complementary graphs to learn desirable advertiser representations. We further propose a Bidding-Graph augmented Triple-based Relevance model BGTR with three towers to deeply fuse the bidding graphs and semantic textual data. Empirically, we evaluate the BGTR model over a large industry dataset, and the experimental results consistently demonstrate its superiority.

sponsored search advertiser-aware relevance modeling relevance modeling برعاية البحث المعلن- علم النمذجة صناعة حمض الفوسفور

TR-SEQ: Named Entity Recognition Dataset for Turkish Search Engine Queries

834 - Association for Computation Linguistics 2021 مقالة

Recognizing named entities in short search engine queries is a difficult task due to their weaker contextual information compared to long sentences. Standard named entity recognition (NER) systems that are trained on grammatically correct and long se ntences fail to perform well on such queries. In this study, we share our efforts towards creating a cleaned and labeled dataset of real Turkish search engine queries (TR-SEQ) and introduce an extended label set to satisfy the search engine needs. A NER system is trained by applying the state-of-the-art deep learning method BERT to the collected data and its high performance on search engine queries is reported. Moreover, we compare our results with the state-of-the-art Turkish NER systems.

search engine queries استفسارات محرك البحث صناعة حمض الفوسفور

Beyond Black \& White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning

606 - Association for Computation Linguistics 2021 مقالة

Supervised learning assumes that a ground truth label exists. However, the reliability of this ground truth depends on human annotators, who often disagree. Prior work has shown that this disagreement can be helpful in training models. We propose a n ovel method to incorporate this disagreement as information: in addition to the standard error computation, we use soft-labels (i.e., probability distributions over the annotator labels) as an auxiliary task in a multi-task neural network. We measure the divergence between the predictions and the target soft-labels with several loss-functions and evaluate the models on various NLP tasks. We find that the soft-label prediction auxiliary task reduces the penalty for errors on ambiguous entities, and thereby mitigates overfitting. It significantly improves performance across tasks, beyond the standard approach and prior work.

leveraging annotator disagreement leveraging annotator supervised learning assumes الاستفادة من الخلاص العنصري الاستفادة من Annotator التعلم الخاضع للإشراف يفترض صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Learning Embeddings for Rare Words Leveraging Internet Search Engine and Spatial Location Relationships

تضمينات التعلم من أجل كلمات نادرة الاستفادة من محرك البحث عن الإنترنت وعلاقات الموقع المكاني

Ask ChatGPT about the research

Read More

suggested questions