ترغب بنشر مسار تعليمي؟ اضغط هنا

An improved variant of simulated annealing that converges under fast cooling

128   0   0.0 ( 0 )
 نشر من قبل Michael Choi
 تاريخ النشر 2019
  مجال البحث
والبحث باللغة English
 تأليف Michael C.H. Choi




اسأل ChatGPT حول البحث

Given a target function $U$ to minimize on a finite state space $mathcal{X}$, a proposal chain with generator $Q$ and a cooling schedule $T(t)$ that depends on time $t$, in this paper we study two types of simulated annealing (SA) algorithms with generators $M_{1,t}(Q,U,T(t))$ and $M_{2,t}(Q,U,T(t))$ respectively. While $M_{1,t}$ is the classical SA algorithm, we introduce a simple and improved variant that we call $M_{2,t}$ which provably converges faster. When $T(t) > c_{M_2}/log(t+1)$ follows the logarithmic cooling schedule, our proposed algorithm is strongly ergodic both in total variation and in relative entropy, and converges to the set of global minima, where $c_{M_2}$ is a constant that we explicitly identify. If $c_{M_1}$ is the optimal hill-climbing constant that appears in logarithmic cooling of $M_{1,t}$, we show that $c_{M_1} geq c_{M_2}$ and give simple conditions under which $c_{M_1} > c_{M_2}$. Our proposed $M_{2,t}$ thus converges under a faster logarithmic cooling in this regime. The other situation that we investigate corresponds to $c_{M_1} > c_{M_2} = 0$, where we give a class of fast and non-logarithmic cooling schedule that works for $M_{2,t}$ (but not for $M_{1,t}$). In addition to these asymptotic convergence results, we compare and analyze finite-time behaviour between these two annealing algorithms as well. Finally, we present two algorithms to simulate $M_{2,t}$.

قيم البحث

اقرأ أيضاً

Using classical simulated annealing to maximise a function $psi$ defined on a subset of $R^d$, the probability $p(psi(theta_n)leq psi_{max}-epsilon)$ tends to zero at a logarithmic rate as $n$ increases; here $theta_n$ is the state in the $n$-th stag e of the simulated annealing algorithm and $psi_{max}$ is the maximal value of $psi$. We propose a modified scheme for which this probability is of order $n^{-1/3}log n$, and hence vanishes at an algebraic rate. To obtain this faster rate, the exponentially decaying acceptance probability of classical simulated annealing is replaced by a more heavy-tailed function, and the system is cooled faster. We also show how the algorithm may be applied to functions that cannot be computed exactly but only approximated, and give an example of maximising the log-likelihood function for a state-space model.
169 - I. Pavlyukevich 2007
We solve a problem of non-convex stochastic optimisation with help of simulated annealing of Levy flights of a variable stability index. The search of the ground state of an unknown potential is non-local due to big jumps of the Levy flights process. The convergence to the ground state is fast due to a polynomial decrease rate of the temperature.
Finding a ground state of a given Hamiltonian on a graph $G=(V,E)$ is an important but hard problem. One of the potential methods is to use a Markov chain Monte Carlo to sample the Gibbs distribution whose highest peaks correspond to the ground state s. In this short paper, we investigate the stochastic cellular automata, in which all spins are updated independently and simultaneously. We prove that (i) if the temperature is sufficiently high and fixed, then the mixing time is at most of order $log|V|$, and that (ii) if the temperature drops in time $n$ as $1/log n$, then the limiting measure is uniformly distributed over the ground states.
We propose a new stochastic algorithm (generalized simulated annealing) for computationally finding the global minimum of a given (not necessarily convex) energy/cost function defined in a continuous D-dimensional space. This algorithm recovers, as p articular cases, the so called classical (Boltzmann machine) and fast (Cauchy machine) simulated annealings, and can be quicker than both. Key-words: simulated annealing; nonconvex optimization; gradient descent; generalized statistical mechanics.
We consider the minimization problem of $phi$-divergences between a given probability measure $P$ and subsets $Omega$ of the vector space $mathcal{M}_mathcal{F}$ of all signed finite measures which integrate a given class $mathcal{F}$ of bounded or u nbounded measurable functions. The vector space $mathcal{M}_mathcal{F}$ is endowed with the weak topology induced by the class $mathcal{F}cup mathcal{B}_b$ where $mathcal{B}_b$ is the class of all bounded measurable functions. We treat the problems of existence and characterization of the $phi$-projections of $P$ on $Omega$. We consider also the dual equality and the dual attainment problems when $Omega$ is defined by linear constraints.
التعليقات
جاري جلب التعليقات جاري جلب التعليقات
سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا