ترغب بنشر مسار تعليمي؟ اضغط هنا

Adaptive Execution: Exploration and Learning of Price Impact

96   0   0.0 ( 0 )
 نشر من قبل Beomsoo Park
 تاريخ النشر 2012
  مجال البحث مالية
والبحث باللغة English




اسأل ChatGPT حول البحث

We consider a model in which a trader aims to maximize expected risk-adjusted profit while trading a single security. In our model, each price change is a linear combination of observed factors, impact resulting from the traders current and prior activity, and unpredictable random effects. The trader must learn coefficients of a price impact model while trading. We propose a new method for simultaneous execution and learning - the confidence-triggered regularized adaptive certainty equivalent (CTRACE) policy - and establish a poly-logarithmic finite-time expected regret bound. This bound implies that CTRACE is efficient in the sense that the ({epsilon},{delta})-convergence time is bounded by a polynomial function of 1/{epsilon} and log(1/{delta}) with high probability. In addition, we demonstrate via Monte Carlo simulation that CTRACE outperforms the certainty equivalent policy and a recently proposed reinforcement learning algorithm that is designed to explore efficiently in linear-quadratic control problems.



قيم البحث

اقرأ أيضاً

Executing a basket of co-integrated assets is an important task facing investors. Here, we show how to do this accounting for the informational advantage gained from assets within and outside the basket, as well as for the permanent price impact of m arket orders (MOs) from all market participants, and the temporary impact that the agents MOs have on prices. The execution problem is posed as an optimal stochastic control problem and we demonstrate that, under some mild conditions, the value function admits a closed-form solution, and prove a verification theorem. Furthermore, we use data of five stocks traded in the Nasdaq exchange to estimate the model parameters and use simulations to illustrate the performance of the strategy. As an example, the agent liquidates a portfolio consisting of shares in Intel Corporation (INTC) and Market Vectors Semiconductor ETF (SMH). We show that including the information provided by three additional assets, FARO Technologies (FARO), NetApp (NTAP) and Oracle Corporation (ORCL), considerably improves the strategys performance; for the portfolio we execute, it outperforms the multi-asset version of Almgren-Chriss by approximately 4 to 4.5 basis points.
In light of micro-scale inefficiencies induced by the high degree of fragmentation of the Bitcoin trading landscape, we utilize a granular data set comprised of orderbook and trades data from the most liquid Bitcoin markets, in order to understand th e price formation process at sub-1 second time scales. To achieve this goal, we construct a set of features that encapsulate relevant microstructural information over short lookback windows. These features are subsequently leveraged first to generate a leader-lagger network that quantifies how markets impact one another, and then to train linear models capable of explaining between 10% and 37% of total variation in $500$ms future returns (depending on which market is the prediction target). The results are then compared with those of various PnL calculations that take trading realities, such as transaction costs, into account. The PnL calculations are based on natural $textit{taker}$ strategies (meaning they employ market orders) that we associate to each model. Our findings emphasize the role of a markets fee regime in determining its propensity to being a leader or a lagger, as well as the profitability of our taker strategy. Taking our analysis further, we also derive a natural $textit{maker}$ strategy (i.e., one that uses only passive limit orders), which, due to the difficulties associated with backtesting maker strategies, we test in a real-world live trading experiment, in which we turned over 1.5 million USD in notional volume. Lending additional confidence to our models, and by extension to the features they are based on, the results indicate a significant improvement over a naive benchmark strategy, which we also deploy in a live trading environment with real capital, for the sake of comparison.
We analyze a proprietary dataset of trades by a single asset manager, comparing their price impact with that of the trades of the rest of the market. In the context of a linear propagator model we find no significant difference between the two, sugge sting that both the magnitude and time dependence of impact are universal in anonymous, electronic markets. This result is important as optimal execution policies often rely on propagators calibrated on anonymous data. We also find evidence that in the wake of a trade the order flow of other market participants first adds further copy-cat trades enhancing price impact on very short time scales. The induced order flow then quickly inverts, thereby contributing to impact decay.
We revisit the epsilon-intelligence model of Toth et al.(2011), that was proposed as a minimal framework to understand the square-root dependence of the impact of meta-orders on volume in financial markets. The basic idea is that most of the daily li quidity is latent and furthermore vanishes linearly around the current price, as a consequence of the diffusion of the price itself. However, the numerical implementation of Toth et al. was criticised as being unrealistic, in particular because all the intelligence was conferred to market orders, while limit orders were passive and random. In this work, we study various alternative specifications of the model, for example allowing limit orders to react to the order flow, or changing the execution protocols. By and large, our study lends strong support to the idea that the square-root impact law is a very generic and robust property that requires very few ingredients to be valid. We also show that the transition from super-diffusion to sub-diffusion reported in Toth et al. is in fact a cross-over, but that the original model can be slightly altered in order to give rise to a genuine phase transition, which is of interest on its own. We finally propose a general theoretical framework to understand how a non-linear impact may appear even in the limit where the bias in the order flow is vanishingly small.
We propose a dynamical theory of market liquidity that predicts that the average supply/demand profile is V-shaped and {it vanishes} around the current price. This result is generic, and only relies on mild assumptions about the order flow and on the fact that prices are (to a first approximation) diffusive. This naturally accounts for two striking stylized facts: first, large metaorders have to be fragmented in order to be digested by the liquidity funnel, leading to long-memory in the sign of the order flow. Second, the anomalously small local liquidity induces a breakdown of linear response and a diverging impact of small orders, explaining the square-root impact law, for which we provide additional empirical support. Finally, we test our arguments quantitatively using a numerical model of order flow based on the same minimal ingredients.
التعليقات
جاري جلب التعليقات جاري جلب التعليقات
سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا