ترغب بنشر مسار تعليمي؟ اضغط هنا

Where Do All These Search Terms Come From? - Two Experiments in Domain-Specific Search

69   0   0.0 ( 0 )
 نشر من قبل Daniel Hienert
 تاريخ النشر 2018
  مجال البحث الهندسة المعلوماتية
والبحث باللغة English




اسأل ChatGPT حول البحث

Within a search session users often apply different search terms, as well as different variations and combinations of them. This way, they want to make sure that they find relevant information for different stages and aspects of their information task. Research questions which arise from this search ap- proach are: Where do users get all the ideas, hints and suggestions for new search terms or their variations from? How many ideas come from the user? How many from outside the IR system? What is the role of the used search sys- tem? To investigate these questions we used data from two experiments: first, from a user study with eye tracking data; second, from a large-scale log analy- sis. We found that in both experiments a large part of the search terms has been explicitly seen or shown before on the interface of the search system.



قيم البحث

اقرأ أيضاً

Academic search engines allow scientists to explore related work relevant to a given query. Often, the user is also aware of the aspect to retrieve a relevant document. In such cases, existing search engines can be used by expanding the query with te rms describing that aspect. However, this approach does not guarantee good results since plain keyword matches do not always imply relevance. To address this issue, we define and solve a novel academic search task, called aspect-based retrieval, which allows the user to specify the aspect along with the query to retrieve a ranked list of relevant documents. The primary idea is to estimate a language model for the aspect as well as the query using a domain-specific knowledge base and use a mixture of the two to determine the relevance of the article. Our evaluation of the results over the Open Research Corpus dataset shows that our method outperforms keyword-based expansion of query with aspect with and without relevance feedback.
Since its emergence in the 1990s the World Wide Web (WWW) has rapidly evolved into a huge mine of global information and it is growing in size everyday. The presence of huge amount of resources on the Web thus poses a serious problem of accurate sear ch. This is mainly because todays Web is a human-readable Web where information cannot be easily processed by machine. Highly sophisticated, efficient keyword based search engines that have evolved today have not been able to bridge this gap. So comes up the concept of the Semantic Web which is envisioned by Tim Berners-Lee as the Web of machine interpretable information to make a machine processable form for expressing information. Based on the semantic Web technologies we present in this paper the design methodology and development of a semantic Web search engine which provides exact search results for a domain specific search. This search engine is developed for an agricultural Website which hosts agricultural information about the state of West Bengal.
We discuss the origin of the anti-helium-3 and -4 events possibly detected by AMS-02. Using up-to-date semi-analytical tools, we show that spallation from primary hydrogen and helium nuclei onto the ISM predicts a $overline{{}^3{rm He}}$ flux typical ly one to two orders of magnitude below the sensitivity of AMS-02 after 5 years, and a $overline{{}^4{rm He}}$ flux roughly 5 orders of magnitude below the AMS-02 sensitivity. We argue that dark matter annihilations face similar difficulties in explaining this event. We then entertain the possibility that these events originate from anti-matter-dominated regions in the form of anti-clouds or anti-stars. In the case of anti-clouds, we show how the isotopic ratio of anti-helium nuclei might suggest that BBN has happened in an inhomogeneous manner, resulting in anti-regions with a anti-baryon-to-photon ratio $bar{eta}simeq10^{-3}eta$. We discuss properties of these regions, as well as relevant constraints on the presence of anti-clouds in our Galaxy. We present constraints from the survival of anti-clouds in the Milky-Way and in the early Universe, as well as from CMB, gamma-ray and cosmic-ray observations. In particular, these require the anti-clouds to be almost free of normal matter. We also discuss an alternative where anti-domains are dominated by surviving anti-stars. We suggest that part of the unindentified sources in the 3FGL catalog can originate from anti-clouds or anti-stars. AMS-02 and GAPS data could further probe this scenario.
Information overload is a prevalent challenge in many high-value domains. A prominent case in point is the explosion of the biomedical literature on COVID-19, which swelled to hundreds of thousands of papers in a matter of months. In general, biomedi cal literature expands by two papers every minute, totalling over a million new papers every year. Search in the biomedical realm, and many other vertical domains is challenging due to the scarcity of direct supervision from click logs. Self-supervised learning has emerged as a promising direction to overcome the annotation bottleneck. We propose a general approach for vertical search based on domain-specific pretraining and present a case study for the biomedical domain. Despite being substantially simpler and not using any relevance labels for training or development, our method performs comparably or better than the best systems in the official TREC-COVID evaluation, a COVID-related biomedical search competition. Using distributed computing in modern cloud infrastructure, our system can scale to tens of millions of articles on PubMed and has been deployed as Microsoft Biomedical Search, a new search experience for biomedical literature: https://aka.ms/biomedsearch.
Grain growth during star formation affects the physical and chemical processes in the evolution of star-forming clouds. We investigate the origin of the millimeter (mm)-sized grains recently observed in Class I protostellar envelopes. We use the coag ulation model developed in our previous paper and find that a hydrogen number density of as high as $10^{10}~{rm cm^{-3}}$, instead of the typical density $10^5~{rm cm^{-3}}$, is necessary for the formation of mm-sized grains. Thus, we test a hypothesis that such large grains are transported to the envelope from the inner, denser parts, finding that gas drag by outflow efficiently launches the large grains as long as the central object has not grown to $gtrsim 0.1$ M$_{odot}$. By investigating the shattering effect on the mm-sized grains, we ensure that the large grains are not significantly fragmented after being injected in the envelope. We conclude that the mm-sized grains observed in the protostellar envelopes are not formed in the envelopes but formed in the inner parts of the star-forming regions and transported to the envelopes before a significant mass growth of the central object, and that they survive in the envelopes.
التعليقات
جاري جلب التعليقات جاري جلب التعليقات
سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا