ترغب بنشر مسار تعليمي؟ اضغط هنا

CNN Retrieval based Unsupervised Metric Learning for Near-Duplicated Video Retrieval

84   0   0.0 ( 0 )
 نشر من قبل Hao Cheng
 تاريخ النشر 2021
  مجال البحث الهندسة المعلوماتية
والبحث باللغة English




اسأل ChatGPT حول البحث

As important data carriers, the drastically increasing number of multimedia videos often brings many duplicate and near-duplicate videos in the top results of search. Near-duplicate video retrieval (NDVR) can cluster and filter out the redundant contents. In this paper, the proposed NDVR approach extracts the frame-level video representation based on convolutional neural network (CNN) features from fully-connected layer and aggregated intermediate convolutional layers. Unsupervised metric learning is used for similarity measurement and feature matching. An efficient re-ranking algorithm combined with k-nearest neighborhood fuses the retrieval results from two levels of features and further improves the retrieval performance. Extensive experiments on the widely used CC_WEB_VIDEO dataset shows that the proposed approach exhibits superior performance over the state-of-the-art.

قيم البحث

اقرأ أيضاً

Existing video indexing and retrieval methods on popular web-based multimedia sharing websites are based on user-provided sparse tagging. This paper proposes a very specific way of searching for video clips, based on the content of the video. We pres ent our work on Content-based Video Indexing and Retrieval using the Correspondence-Latent Dirichlet Allocation (corr-LDA) probabilistic framework. This is a model that provides for auto-annotation of videos in a database with textual descriptors, and brings the added benefit of utilizing the semantic relations between the content of the video and text. We use the concept-level matching provided by corr-LDA to build correspondences between text and multimedia, with the objective of retrieving content with increased accuracy. In our experiments, we employ only the audio components of the individual recordings and compare our results with an SVM-based approach.
In this paper, we explore the problem of developing personalized chatbots. A personalized chatbot is designed as a digital chatting assistant for a user. The key characteristic of a personalized chatbot is that it should have a consistent personality with the corresponding user. It can talk the same way as the user when it is delegated to respond to others messages. We present a retrieval-based personalized chatbot model, namely IMPChat, to learn an implicit user profile from the users dialogue history. We argue that the implicit user profile is superior to the explicit user profile regarding accessibility and flexibility. IMPChat aims to learn an implicit user profile through modeling users personalized language style and personalized preferences separately. To learn a users personalized language style, we elaborately build language models from shallow to deep using the users historical responses; To model a users personalized preferences, we explore the conditional relations underneath each post-response pair of the user. The personalized preferences are dynamic and context-aware: we assign higher weights to those historical pairs that are topically related to the current query when aggregating the personalized preferences. We match each response candidate with the personalized language style and personalized preference, respectively, and fuse the two matching signals to determine the final ranking score. Comprehensive experiments on two large datasets show that our method outperforms all baseline models.
This report describes metrics for the evaluation of the effectiveness of segment-based retrieval based on existing binary information retrieval metrics. This metrics are described in the context of a task for the hyperlinking of video segments. This evaluation approach re-uses existing evaluation measures from the standard Cranfield evaluation paradigm. Our adaptation approach can in principle be used with any kind of effectiveness measure that uses binary relevance, and for other segment-baed retrieval tasks. In our video hyperlinking setting, we use precision at a cut-off rank n and mean average precision.
Content based video retrieval is an approach for facilitating the searching and browsing of large image collections over World Wide Web. In this approach, video analysis is conducted on low level visual properties extracted from video frame. We belie ved that in order to create an effective video retrieval system, visual perception must be taken into account. We conjectured that a technique which employs multiple features for indexing and retrieval would be more effective in the discrimination and search tasks of videos. In order to validate this claim, content based indexing and retrieval systems were implemented using color histogram, various texture features and other approaches. Videos were stored in Oracle 9i Database and a user study measured correctness of response.
The proliferation of smartphones has led to the increased popularity of location-based search and recommendation systems. Online platforms like Google and Yelp allow location-based search in the form of nearby feature to query for hotels or restauran ts in the vicinity. Moreover, hotel booking platforms like Booking[dot]com, Expedia, or Trivago allow travelers searching for accommodations using either their desired location as a search query or near a particular landmark. Since the popularity of different locations in a city varies, certain locations may get more queries than other locations. Thus, the exposure received by different establishments at these locations may be very different from their intrinsic quality as captured in their ratings. Today, many small businesses (shops, hotels, or restaurants) rely on such online platforms for attracting customers. Thus, receiving less exposure than that is expected can be unfavorable for businesses. It could have a negative impact on their revenue and potentially lead to economic starvation or even shutdown. By gathering and analyzing data from three popular platforms, we observe that many top-rated hotels and restaurants get less exposure vis-a-vis their quality, which could be detrimental for them. Following a meritocratic notion, we define and quantify such exposure disparity due to location-based searches on these platforms. We attribute this exposure disparity mainly to two kinds of biases -- Popularity Bias and Position Bias. Our experimental evaluation on multiple datasets reveals that although the platforms are doing well in delivering distance-based results, exposure disparity exists for individual businesses and needs to be reduced for business sustainability.
التعليقات
جاري جلب التعليقات جاري جلب التعليقات
سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا