Dynamics of feed forward induced interference training

80 0 0.0 ( 0 )

Download Cite

Added by Shirui Tang

Publication date 2020

fields Electronic Engineering

and research's language is English

Authors Shirui Tang

Signal Processing

visit our facebook page

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Preceptron model updating with back propagation has become the routine of deep learning. Continuous feed forward procedure is required in order for backward propagate to function properly. Doubting the underlying physical interpretation on transformer based models such as GPT brought about by the routine explaination, a new method of training is proposed in order to keep self-consistency of the physics. By treating the GPT model as a space-time diagram, and then trace the worldlines of signals, identifing the possible paths of signals in order fot a self-attention event to occure. With a slight modification, self-attention can be viewed as an ising model interaction, which enables the goal to be designed as energy of system. Target is treated as an external magnetic field inducing signals modeled as magnetic dipoles. A probability network is designed to pilot input signals travelling for different durations through different routes. A rule of updating the probabilities is designed in order to form constructive interference at target locations so that instantaneous energy can be maximised. Experiment was conducted on a 4-class classification problem extracted from MNIST. The results exhibit interesting but expected behavours, which do not exist in a bp updated network, but more like learning in a real human, especially in the few-shot scenario.

rate research

A silicon photonics feed-forward neural network for nonlinear distortion mitigation in an optical link

139 - Mattia Mancinelli , Paolo Bettotti , Lorenzo Pavesi 2020

We design and model a single-layer, passive, all-optical silicon photonics neural network to mitigate optical link nonlinearities. The network nodes are formed by silicon microring resonators whose transfer function has been experimentally measured. Both the transmitted amplitude and phase maps of the nonlinear response of the microrings are parametrized as a function of the wavelength and of the signal power to form tunable activation functions of the single nodes in the complex valued network. Training of the network is achieved by a particle swarm optimizer which selects the complex weights and the activation functions. We demonstrate that a single feed-forward layer with a single node perceptron is effective in compensating linear and nonlinear distortions over a broad range of signal-to-noise-ratio and propagation lengths. We propose to implement this simple neuronal network as an optical link transparent layer to correct signal distortions.

Signal Processing Optics

Transformer Feed-Forward Layers Are Key-Value Memories

150 - Mor Geva , Roei Schuster , Jonathan Berant 2020

Feed-forward layers constitute two-thirds of a transformer models parameters, yet their role in the network remains under-explored. We show that feed-forward layers in transformer-based language models operate as key-value memories, where each key correlates with textual patterns in the training examples, and each value induces a distribution over the output vocabulary. Our experiments show that the learned patterns are human-interpretable, and that lower layers tend to capture shallow patterns, while upper layers learn more semantic ones. The values complement the keys input patterns by inducing output distributions that concentrate probability mass on tokens likely to appear immediately after each pattern, particularly in the upper layers. Finally, we demonstrate that the output of a feed-forward layer is a composition of its memories, which is subsequently refined throughout the models layers via residual connections to produce the final output distribution.

Computation and Language

Fiber-compatible photonic feed-forward with 99% fidelity

147 - G. L. Zanin , M. J. Jacquet , M. Spagnolo 2020

Both photonic quantum computation and the establishment of a quantum internet require fiber-based measurement and feed-forward in order to be compatible with existing infrastructure. Here we present a fiber-compatible scheme for measurement and feed-forward, whose performance is benchmarked by carrying out remote preparation of single-photon polarization states at telecom-wavelengths. The result of a projective measurement on one photon deterministically controls the path a second photon takes with ultrafast optical switches. By placing well-calibrated {bulk} passive polarization optics in the paths, we achieve a measurement and feed-forward fidelity of (99.0 $pm$ 1)%, after correcting for other experimental errors. Our methods are useful for photonic quantum experiments including computing, communication, and teleportation.

Quantum Physics

Annealing Dynamics via Quantum Interference of Forward and Backward Time Evolved States

113 - Kentaro Imafuku 2018

Toward an alternative approach to the quantum mechanic ground state search, we theoretically introduce a protocol in which energy of two identical systems are deterministically exchanged. The protocol utilizes a quantum interference between forward and backward time evolved states with respect to a given Hamiltonian. In addition, to make use the protocol for the ground state search, we construct a network with which we may be able to efficiently apply the protocol successively among multiple systems so that energy of one of them is gradually approaching the lowest one. Although rigorous analysis on the validity of the network is left as a future challenge, some properties of the network are also investigated.

Quantum Physics

Integrated Information Theory and Isomorphic Feed-Forward Philosophical Zombies

100 - Jake R. Hanson , Sara I. Walker 2019

Any theory amenable to scientific inquiry must have testable consequences. This minimal criterion is uniquely challenging for the study of consciousness, as we do not know if it is possible to confirm via observation from the outside whether or not a physical system knows what it feels like to have an inside - a challenge referred to as the hard problem of consciousness. To arrive at a theory of consciousness, the hard problem has motivated the development of phenomenological approaches that adopt assumptions of what properties consciousness has based on first-hand experience and, from these, derive the physical processes that give rise to these properties. A leading theory adopting this approach is Integrated Information Theory (IIT), which assumes our subjective experience is a unified whole, subsequently yielding a requirement for physical feedback as a necessary condition for consciousness. Here, we develop a mathematical framework to assess the validity of this assumption by testing it in the context of isomorphic physical systems with and without feedback. The isomorphism allows us to isolate changes in $Phi$ without affecting the size or functionality of the original system. Indeed, we show that the only mathematical difference between a conscious system with $Phi>0$ and an isomorphic philosophical zombies with $Phi=0$ is a permutation of the binary labels used to internally represent functional states. This implies $Phi$ is sensitive to functionally arbitrary aspects of a particular labeling scheme, with no clear justification in terms of phenomenological differences. In light of this, we argue any quantitative theory of consciousness, including IIT, should be invariant under isomorphisms if it is to avoid the existence of isomorphic philosophical zombies and the epistemological problems they pose.

Information Theory Information Theory