No Arabic abstract
The turnpike phenomenon stipulates that the solution of an optimal control problem in large time, remains essentially close to a steady-state of the dynamics, itself being the optimal solution of an associated static optimal control problem. Under general assumptions, it is known that not only the optimal state and the optimal control, but also the adjoint state coming from the application of the Pontryagin maximum principle, are exponentially close to a steady-state, except at the beginning and at the end of the time frame. In such results, the turnpike set is a singleton, which is a steady-state. In this paper, we establish a turnpike result for finite-dimensional optimal control problems in which some of the coordinates evolve in a monotone way, and some others are partial steady-states of the dynamics. We prove that the discrepancy between the optimal trajectory and the turnpike set is then linear, but not exponential: we thus speak of a linear turnpike theorem.
Optimal control problems with a very large time horizon can be tackled with the Receding Horizon Control (RHC) method, which consists in solving a sequence of optimal control problems with small prediction horizon. The main result of this article is the proof of the exponential convergence (with respect to the prediction horizon) of the control generated by the RHC method towards the exact solution of the problem. The result is established for a class of infinite-dimensional linear-quadratic optimal control problems with time-independent dynamics and integral cost. Such problems satisfy the turnpike property: the optimal trajectory remains most of the time very close to the solution to the associated static optimization problem. Specific terminal cost functions, derived from the Lagrange multiplier associated with the static optimization problem, are employed in the implementation of the RHC method.
This paper is devoted to analysing the explicit slow decay rate and turnpike in the infinite-horizon linear quadratic optimal control problems for hyperbolic systems. Assume that some weak observability or controllability are satisfied, by which, the lower and upper bounds of the corresponding algebraic Riccati operator are estimated, respectively. Then based on these two bounds, the explicit slow decay rate of the closed-loop system with Riccati-based optimal feedback control is obtained. The averaged turnpike property for this problem is also further discussed. We then apply these results to the LQ optimal control problems constraint to networks of one-dimensional wave equations and also some multi-dimensional ones with local controls which lack of GCC(Geometric Control Condition).
The well-known Deficiency One Theorem gives structural conditions on a chemical reaction network under which, for any set of parameter values, the steady states of the corresponding mass action system may be easily characterized. It is also known, however, that mass action systems are not uniquely associated with reaction networks and that some representations may satisfy the Deficiency One Theorem while others may not. In this paper we present a mixed-integer linear programming framework capable of determining whether a given mass action system has a dynamically equivalent or linearly conjugate representation which has an underlying network satisfying the Deficiency One Theorem. This extends recent computational work determining linearly conjugate systems which are weakly reversible and have a deficiency of zero.
We study dynamic allocation problems for discrete time multi-armed bandits under uncertainty, based on the the theory of nonlinear expectations. We show that, under strong independence of the bandits and with some relaxation in the definition of optimality, a Gittins allocation index gives optimal choices. This involves studying the interaction of our uncertainty with controls which determine the filtration. We also run a simple numerical example which illustrates the interaction between the willingness to explore and uncertainty aversion of the agent when making decisions.
We investigate the kinetic Schrodinger problem, obtained considering Langevin dynamics instead of Brownian motion in Schrodingers thought experiment. Under a quasilinearity assumption we establish exponential entropic turnpike estimates for the corresponding Schrodinger bridges and exponentially fast convergence of the entropic cost to the sum of the marginal entropies in the long-time regime, which provides as a corollary an entropic Talagrand inequality. In order to do so, we profit from recent advances in the understanding of classical Schrodinger bridges and adaptations of Bakry-Emery formalism to the kinetic setting. Our quantitative results are complemented by basic structural results such as dual representation of the entropic cost and the existence of Schrodinger potentials.