ترغب بنشر مسار تعليمي؟ اضغط هنا

Gittins theorem under uncertainty

98   0   0.0 ( 0 )
 نشر من قبل Tanut Treetanthiploet
 تاريخ النشر 2019
  مجال البحث
والبحث باللغة English




اسأل ChatGPT حول البحث

We study dynamic allocation problems for discrete time multi-armed bandits under uncertainty, based on the the theory of nonlinear expectations. We show that, under strong independence of the bandits and with some relaxation in the definition of optimality, a Gittins allocation index gives optimal choices. This involves studying the interaction of our uncertainty with controls which determine the filtration. We also run a simple numerical example which illustrates the interaction between the willingness to explore and uncertainty aversion of the agent when making decisions.



قيم البحث

اقرأ أيضاً

79 - Silvana Pesenti , Qiuqi Wang , 2020
Optimization of distortion riskmetrics with distributional uncertainty has wide applications in finance and operations research. Distortion riskmetrics include many commonly applied risk measures and deviation measures, which are not necessarily mono tone or convex. One of our central findings is a unifying result that allows us to convert an optimization of a non-convex distortion riskmetric with distributional uncertainty to a convex one, leading to great tractability. The key to the unifying equivalence result is the novel notion of closedness under concentration of sets of distributions. Our results include many special cases that are well studied in the optimization literature, including but not limited to optimizing probabilities, Value-at-Risk, Expected Shortfall, and Yaaris dual utility under various forms of distributional uncertainty. We illustrate our theoretical results via applications to portfolio optimization, optimization under moment constraints, and preference robust optimization.
We provide a characterization in terms of Fatou closedness for weakly closed monotone convex sets in the space of $mathcal{P}$-quasisure bounded random variables, where $mathcal{P}$ is a (possibly non-dominated) class of probability measures. Applications of our results lie within robu
We establish a generalization of Noether theorem for stochastic optimal control problems. Exploiting the tools of jet bundles and contact geometry, we prove that from any (contact) symmetry of the Hamilton-Jacobi-Bellman equation associated to an opt imal control problem it is possible to build a related local martingale. Moreover, we provide an application of the theoretical results to Mertons optimal portfolio problem, showing that this model admits infinitely many conserved quantities in the form of local martingales.
In a model independent discrete time financial market, we discuss the richness of the family of martingale measures in relation to different notions of Arbitrage, generated by a class $mathcal{S}$ of significant sets, which we call Arbitrage de la cl asse $mathcal{S}$. The choice of $mathcal{S}$ reflects into the intrinsic properties of the class of polar sets of martingale measures. In particular: for S=${Omega}$ absence of Model Independent Arbitrage is equivalent to the existence of a martingale measure; for $mathcal{S}$ being the open sets, absence of Open Arbitrage is equivalent to the existence of full support martingale measures. These results are obtained by adopting a technical filtration enlargement and by constructing a universal aggregator of all arbitrage opportunities. We further introduce the notion of market feasibility and provide its characterization via arbitrage conditions. We conclude providing a dual representation of Open Arbitrage in terms of weakly open sets of probability measures, which highlights the robust nature of this concept.
We study an intertemporal consumption and portfolio choice problem under Knightian uncertainty in which agents preferences exhibit local intertemporal substitution. We also allow for market frictions in the sense that the pricing functional is nonlin ear. We prove existence and uniqueness of the optimal consumption plan, and we derive a set of sufficient first-order conditions for optimality. With the help of a backward equation, we are able to determine the structure of optimal consumption plans. We obtain explicit solutions in a stationary setting in which the financial market has different risk premia for short and long positions.
التعليقات
جاري جلب التعليقات جاري جلب التعليقات
سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا