ترغب بنشر مسار تعليمي؟ اضغط هنا

A Reinforcement Learning Approach for an IRS-assisted NOMA Network

98   0   0.0 ( 0 )
 نشر من قبل Ximing Xie
 تاريخ النشر 2021
  مجال البحث هندسة إلكترونية
والبحث باللغة English




اسأل ChatGPT حول البحث

This letter investigates a sum rate maximizationproblem in an intelligent reflective surface (IRS) assisted non-orthogonal multiple access (NOMA) downlink network. Specif-ically, the sum rate of all the users is maximized by jointlyoptimizing the beams at the base station and the phase shiftat the IRS. The deep reinforcement learning (DRL), which hasachieved massive successes, is applied to solve this sum ratemaximization problem. In particular, an algorithm based on thedeep deterministic policy gradient (DDPG) is proposed. Both therandom channel case and the fixed channel case are studied inthis letter. The simulation result illustrates that the DDPG basedalgorithm has the competitive performance on both case.



قيم البحث

اقرأ أيضاً

This paper investigates the application of deep deterministic policy gradient (DDPG) to intelligent reflecting surface (IRS) based unmanned aerial vehicles (UAV) assisted non-orthogonal multiple access (NOMA) downlink networks. The deployment of the UAV equipped with an IRS is important, as the UAV increases the flexibility of the IRS significantly, especially for the case of users who have no line of sight (LoS) path to the base station (BS). Therefore, the aim of this letter is to maximize the sum rate by jointly optimizing the power allocation of the BS, the phase shifting of the IRS and the horizontal position of the UAV. Because the formulated problem is not convex, the DDPG algorithm is utilized to solve it. The computer simulation results are provided to show the superior performance of the proposed DDPG based algorithm.
A novel non-orthogonal multiple access (NOMA) based cache-aided mobile edge computing (MEC) framework is proposed. For the purpose of efficiently allocating communication and computation resources to users computation tasks requests, we propose a lon g-short-term memory (LSTM) network to predict the task popularity. Based on the predicted task popularity, a long-term reward maximization problem is formulated that involves a joint optimization of the task offloading decisions, computation resource allocation, and caching decisions. To tackle this challenging problem, a single-agent Q-learning (SAQ-learning) algorithm is invoked to learn a long-term resource allocation strategy. Furthermore, a Bayesian learning automata (BLA) based multi-agent Q-learning (MAQ-learning) algorithm is proposed for task offloading decisions. More specifically, a BLA based action select scheme is proposed for the agents in MAQ-learning to select the optimal action in every state. We prove that the BLA based action selection scheme is instantaneously self-correcting and the selected action is an optimal solution for each state. Extensive simulation results demonstrate that: 1) The prediction error of the proposed LSTMs based task popularity prediction decreases with increasing learning rate. 2) The proposed framework significantly outperforms the benchmarks like all local computing, all offloading computing, and non-cache computing. 3) The proposed BLA based MAQ-learning achieves an improved performance compared to conventional reinforcement learning algorithms.
A dual-polarized intelligent reflecting surface (IRS) can contribute to a better multiplexing of interfering wireless users. In this paper, we use this feature to improve the performance of dual-polarized massive multiple-input multiple-output (MIMO) with non-orthogonal multiple access (NOMA) under imperfect successive interference cancellation (SIC). By considering the downlink of a multi-cluster scenario, the IRSs assist the base station (BS) to multiplex subsets of users in the polarization domain. Our novel strategy alleviates the impact of imperfect SIC and enables users to exploit polarization diversity with near-zero inter-subset interference. To this end, the IRSs are optimized to mitigate transmissions originated at the BS from the interfering polarization. The formulated optimization is transformed into quadratic constrained quadratic sub-problems, which makes it possible to obtain the optimal solution via interior-points methods. We also derive analytically a closed-form expression for the users ergodic rates by considering large numbers of reflecting elements. This is followed by representative simulation examples and comprehensive discussions. The results show that when the IRSs are large enough, the proposed scheme always outperforms conventional massive MIMO-NOMA and MIMO-OMA systems even if SIC error propagation is present. It is also confirmed that dual-polarized IRSs can make cross-polar transmissions beneficial to the users, allowing them to improve their performance through diversity.
In this paper, the appealing features of a dual-polarized intelligent reflecting surface (IRS) are exploited to improve the performance of dual-polarized massive multiple-input multiple-output (MIMO) with non-orthogonal multiple access (NOMA) under i mperfect successive interference cancellation (SIC). By considering the downlink of a multi-cluster scenario, the IRSs assist the base station (BS) to multiplex subsets of users in the polarization domain. Our novel strategy alleviates the impact of imperfect SIC and enables users to exploit polarization diversity with near-zero inter-subset interference. Our results show that when the IRSs are large enough, the proposed scheme always outperforms conventional massive MIMO-NOMA and MIMO-OMA systems even if SIC error propagation is present. It is also confirmed that dual-polarized IRSs can make cross-polar transmissions beneficial to the users, allowing them to improve their performance through polarization diversity.
134 - X. Gao , Y. Liu , X. Liu 2021
A novel framework of intelligent reflecting surface (IRS)-aided multiple-input single-output (MISO) non-orthogonal multiple access (NOMA) network is proposed, where a base station (BS) serves multiple clusters with unfixed number of users in each clu ster. The goal is to maximize the sum rate of all users by jointly optimizing the passive beamforming vector at the IRS, decoding order, power allocation coefficient vector and number of clusters, subject to the rate requirements of users. In order to tackle the formulated problem, a three-step approach is proposed. More particularly, a long short-term memory (LSTM) based algorithm is first adopted for predicting the mobility of users. Secondly, a K-means based Gaussian mixture model (K-GMM) algorithm is proposed for user clustering. Thirdly, a deep Q-network (DQN) based algorithm is invoked for jointly determining the phase shift matrix and power allocation policy. Simulation results are provided for demonstrating that the proposed algorithm outperforms the benchmarks, while the throughput gain of 35% can be achieved by invoking NOMA technique instead of orthogonal multiple access (OMA).
التعليقات
جاري جلب التعليقات جاري جلب التعليقات
سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا