ترغب بنشر مسار تعليمي؟ اضغط هنا

Visual Sensation and Perception Computational Models for Deep Learning: State of the art, Challenges and Prospects

364   0   0.0 ( 0 )
 نشر من قبل Bing Wei
 تاريخ النشر 2021
  مجال البحث الهندسة المعلوماتية
والبحث باللغة English




اسأل ChatGPT حول البحث

Visual sensation and perception refers to the process of sensing, organizing, identifying, and interpreting visual information in environmental awareness and understanding. Computational models inspired by visual perception have the characteristics of complexity and diversity, as they come from many subjects such as cognition science, information science, and artificial intelligence. In this paper, visual perception computational models oriented deep learning are investigated from the biological visual mechanism and computational vision theory systematically. Then, some points of view about the prospects of the visual perception computational models are presented. Finally, this paper also summarizes the current challenges of visual perception and predicts its future development trends. Through this survey, it will provide a comprehensive reference for research in this direction.



قيم البحث

اقرأ أيضاً

57 - Aref Hakimzadeh , Yanbo Xue , 2021
In Maurice Merleau-Pontys phenomenology of perception, analysis of perception accounts for an element of intentionality, and in effect therefore, perception and action cannot be viewed as distinct procedures. In the same line of thinking, Alva No{e} considers perception as a thoughtful activity that relies on capacities for action and thought. Here, by looking into psychology as a source of inspiration, we propose a computational model for the action involved in visual perception based on the notion of equilibrium as defined by Jean Piaget. In such a model, Piagets equilibrium reflects the minds status, which is used to control the observation process. The proposed model is built around a modified version of convolutional neural networks (CNNs) with enhanced filter performance, where characteristics of filters are adaptively adjusted via a high-level control signal that accounts for the thoughtful activity in perception. While the CNN plays the role of the visual system, the control signal is assumed to be a product of mind.
186 - M. I. Dyakonov 2012
This is a brief review of the experimental and theoretical quantum computing. The hopes for eventually building a useful quantum computer rely entirely on the so-called threshold theorem. In turn, this theorem is based on a number of assumptions, tre ated as axioms, i.e. as being satisfied exactly. Since in reality this is not possible, the prospects of scalable quantum computing will remain uncertain until the required precision, with which these assumptions should be approached, is established. Some related sociological aspects are also discussed. .
Motion perception is a critical capability determining a variety of aspects of insects life, including avoiding predators, foraging and so forth. A good number of motion detectors have been identified in the insects visual pathways. Computational mod elling of these motion detectors has not only been providing effective solutions to artificial intelligence, but also benefiting the understanding of complicated biological visual systems. These biological mechanisms through millions of years of evolutionary development will have formed solid modules for constructing dynamic vision systems for future intelligent machines. This article reviews the computational motion perception models originating from biological research of insects visual systems in the literature. These motion perception models or neural networks comprise the looming sensitive neuronal models of lobula giant movement detectors (LGMDs) in locusts, the translation sensitive neural systems of direction selective neurons (DSNs) in fruit flies, bees and locusts, as well as the small target motion detectors (STMDs) in dragonflies and hover flies. We also review the applications of these models to robots and vehicles. Through these modelling studies, we summarise the methodologies that generate different direction and size selectivity in motion perception. At last, we discuss about multiple systems integration and hardware realisation of these bio-inspired motion perception models.
Given its unchallenged capabilities in terms of sensitivity and spatial resolution, the combination of imaging spectropolarimetry and numeric Stokes inversion represents the dominant technique currently used to remotely sense the physical properties of the solar atmosphere and, in particular, its important driving magnetic field. Solar magnetism manifests itself in a wide range of spatial, temporal, and energetic scales. The ubiquitous but relatively small and weak fields of the so-called quiet Sun are believed today to be crucial for answering many open questions in solar physics, some of which have substantial practical relevance due to the strong Sun-Earth connection. However, such fields are very challenging to detect because they require spectropolarimetric measurements with high spatial (sub-arcsec), spectral (<100 mA), and temporal (<10 s) resolution along with high polarimetric sensitivity (<0.001 of the intensity). We collect and discuss both well-established and upcoming instrumental solutions developed during the last decades to push solar observations toward the above-mentioned parameter regime. This typically involves design trade-offs due to the high dimensionality of the data and signal-to-noise-ratio considerations, among others. We focus on the main three components that form a spectro-polarimeter, namely, wavelength discriminators, the devices employed to encode the incoming polarization state into intensity images (polarization modulators), and the sensor technologies used to register them. We consider the instrumental solutions introduced to perform this kind of measurements at different optical wavelengths and from various observing locations, i.e., ground-based, from the stratosphere or near space.
We introduce a new recurrent agent architecture and associated auxiliary losses which improve reinforcement learning in partially observable tasks requiring long-term memory. We employ a temporal hierarchy, using a slow-ticking recurrent core to allo w information to flow more easily over long time spans, and three fast-ticking recurrent cores with connections designed to create an information asymmetry. The emph{reaction} core incorporates new observations with input from the slow core to produce the agents policy; the emph{perception} core accesses only short-term observations and informs the slow core; lastly, the emph{prediction} core accesses only long-term memory. An auxiliary loss regularizes policies drawn from all three cores against each other, enacting the prior that the policy should be expressible from either recent or long-term memory. We present the resulting emph{Perception-Prediction-Reaction} (PPR) agent and demonstrate its improved performance over a strong LSTM-agent baseline in DMLab-30, particularly in tasks requiring long-term memory. We further show significant improvements in Capture the Flag, an environment requiring agents to acquire a complicated mixture of skills over long time scales. In a series of ablation experiments, we probe the importance of each component of the PPR agent, establishing that the entire, novel combination is necessary for this intriguing result.

الأسئلة المقترحة

التعليقات
جاري جلب التعليقات جاري جلب التعليقات
سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا