0% found this document useful (0 votes)

6 views

tham2019--

Uploaded by

Hind Jiddou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

tham2019--

Uploaded by

Hind Jiddou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Proceedings of APSIPA Annual Summit and Conference 2019 18-21 November 2019, Lanzhou, China

Deep Reinforcement Learning for Resource

Allocation in 5G Communications
Mau-Luen Tham*, Amjad Iqbal* and Yoong Choon Chang*
*
Department of Electrical and Electronic Engineering
Lee Kong Chian Faculty of Engineering and Science
Universiti Tunku Abdul Rahman (UTAR), Malaysia
E-mail: thamml@utar.edu.my, amjad.iqbal68@1utar.my and ycchang@utar.edu.my

Abstract— The rapid growth of data traffic has pushed the In traditional radio access network (RAN) deployment,
mobile telecommunication industry towards the adoption of fifth each base station (BS) is physically attached with a fix
generation (5G) communications. Cloud radio access network number of antennas, which handles baseband processing and
(CRAN), one of the 5G key enabler, facilitates fine-grained radio functions within small coverage. Accommodating
management of network resources by separating the remote
higher transmission rates means that a massive number of
radio head (RRH) from the baseband unit (BBU) via a high-
speed front-haul link. Classical resource allocation (RA) schemes physical BSs must be installed. This, however, incurs high
rely on numerical techniques to optimize various performance initial investment, site support, system management, setup
metrics. Most of these works can be defined as instantaneous and wireless channel interference among users [4]. Cloud
since the optimization decisions are derived from the current RAN (CRAN), a new paradigm for 5G communications,
network state without considering past network states. While distributes a set of low-power antennas known as remote
utility theory can incorporate long-term optimization effect into radio head (RRH) geographically at distinct locations within
these optimization actions, the growing heterogeneity and the coverage area [5-6]. All RRHs are then connected to a
complexity of network environments has rendered the RA issue centralized control and processing station known as baseband
intractable. One prospective candidate is reinforcement learning
unit (BBU) via a high-speed front-haul link. Consequently,
(RL), a dynamic programming framework which solves the RA
problems optimally over varying network states. Still, such RRHs are able to coordinate with each other and expand the
method cannot handle the highly dimensional state-action spaces cellular network coverage. This transforms into supportive
in the context of CRAN problems. Driven by the success of channel conditions and ultimately excellent QoS for all user
machine learning, researchers begin to explore the potential of equipments.
deep reinforcement learning (DRL) to address the RA problems. Inspired by the CRAN advantages, resource allocation (RA)
In this work, an overview of the major existing DRL approaches for CRAN has been extensively investigated. Data rate-
in CRAN is presented. We conclude this article by identifying oriented optimization problems for multi-user CRAN were
current technical hurdles and potential future research treated in [7] and [8]. The work in [9] studied the energy
directions.
efficiency (EE) maximization problem of CRAN subject to
Keywords—Deep Reinforcement Learning, 5G, Resource
Allocation, Cloud RAN individual antenna power constraints. It, however, does not
consider any requirement of data service rates, which is
I. INTRODUCTION important for provisioning heterogeneous multimedia services.
In [10], an EE maximization problem has been formulated
Recent years have witnessed the great evolution in mobile under the constraints on per-antenna transmission power and
communications, which began in 1980s with the first proportional data rates among user equipments. The
generation (1G), followed by 2G (1990), 3G (2002), 4G aforementioned RA schemes rely on numerical methods to
(2010) and the upcoming 5G [1]. The International optimize various performance metrics. Specifically,
Telecommunication Union Radiocommunications techniques such as Charnes-Cooper transformation (CCT),
Standardization Sector (ITU-R) has standardized the Lagrange Dual decomposition, parameterized convex
ambitious 5G requirements, referred to as International program, and bi-level optimization are utilized to reach
Mobile Telecommunications 2020 (IMT-2020) [2], which optimality in every single time slots.
encompasses 100 Mb/s user experienced data rate, one-ms Most of the abovementioned RA works can be defined as
latency, mobility up to 500 km/h, and backward compatibility instantaneous since the optimization decisions are derived
to long term evolution (LTE)/LTE-A. Such design goals stem from the current network state without considering past
from the fact that the total mobile data traffic will increase network states. This may lead to suboptimal results from the
significantly to 69 Exabytes per month in 2022 [3], due to the perspective of long-term network performance. For instance,
unprecedented growth of Internet of Things (IoT) devices. pursuing instantaneous energy efficiency may yield to
Under this premise, it is obvious that telecommunication unnecessary turning ON / OFF of RRHs, which is associated
operators need to take into account the costs from with enormous power and timing overheads [11]. Such issue
commoditization and quality of service (QoS) for mobile exhibits same flavor with the well-known ping-pong effect in
users during the initial phase of 5G deployment.
This work is supported by the Universiti Tunku Abdul Rahman under
UTARRF (IPSR/RMC/UTARRF/2017-C2/T08).

978-1-7281-3248-8©2019 APSIPA 1852 APSIPA ASC 2019

Proceedings of APSIPA Annual Summit and Conference 2019 18-21 November 2019, Lanzhou, China

5G handover scenario [12]. Classical work in [13] has

demonstrated the possibility of incorporating long-term
optimization effect into these RA solutions via utility theory.
However, the growing complexity and heterogeneity of
network environments has rendered the RA solution
intractable.
Reinforcement Learning (RL), a dynamic programming
method, has been regarded as one of the promising candidates
in optimizing long-term utility of resource allocation [14].
The popularity arises from the fact that several networking
problems can be modeled as Markov Decision Processes
(MDPs), where RL becomes relevant. RL is about training an
agent which interacts with its state-changing environment.
The RL agent repeatedly interacts with the environment and
collects reward as evaluation. At each time slot, the RL agent Fig. 1. DRL-based CRAN scenario.
chooses an action, which will affect the environment state. Q-
learning, one of the widely used RL algorithms, learns a and calculates reward based on past executed actions.
policy which tells the RL agent to select actions that Specifically, in each time slot , based on the current state,
potentially maximizes the expected value of the total BBU pool will make an action in order to maximize the
cumulative reward, starting from the current state. The work desired objective function. Different BBUs are associated
in [15] has adopted Q-learning optimization for ON or OFF with different computing capabilities and therefore computing
policy of BSs with the goal of maximizing EE while meeting task offloading among BBUs may be necessary.
user rate demands. Specifically, a Q-table containing rewards Without loss of generality, let us consider that = 1 and
for all feasible state-action pairs is constructed. The challenge each RRH and UE has one single antenna. Correspondingly,
lies in the convergence of Q-table when the state-action space the signal-to-interference-plus-noise ratio (SINR) at UE can
grows. Consequently, a range of specific CRAN problems be formulated as [18]:
poses severe memory and computational challenges to Q-
learning based problems. |ℎ |
A major step forward from constructing Q-table of all = , ∈ (1)
∑ |ℎ | +
feasible state-action pairs exhaustively is the adoption of
neural networks for estimating Q-table value, which can be
denoted as deep Q learning (DQN) [16]. Broadly speaking, where ℎ = [ℎ , ℎ , … . ℎ ] and each component
deep reinforcement learning (DRL) indicates a set of ℎ represents the channel state information (CSI) from RRH
algorithms that estimates value functions (DQN) or policy r to UE u; likewise =[ , ,…. ] and each
functions (policy gradient method) via deep neural networks component stands for the beamforming weight from RRH
(DNN). In this way, DRL enables RL to scale to CRAN RA r to UE u. stands for the additive white Gaussian noise
problems that were earlier deemed infeasible. This is aligned (AWGN). The Shannon capacity for user u can be
with the trend that recent success of machine learning has calculated by
drawn unparalleled research interest in integrating versatile
machine intelligence into 5G [17]. In this paper, we present an
overview of the DRL-based RA research in CRAN and = 1+ , ∈ (2)
pinpoint some open research issues.
The rest of the paper is organized as follows. Section II where B and denotes the transmission bandwidth and
describes the basic principles of DRL-based CRAN. Section realistic capacity gap, respectively. Regarding the power
III presents an overview of the major approaches to DRL- modeling for each RRH, we adopt the model as in [17]:
based RA in CRAN. Section IV features some interesting
open research issues and concludes the paper. 1
, + , ; ∈ℳ
=
II. DRL-BASED CRAN (3)
, ; ∈ℵ
Fig. 1 depicts a typical DRL-based CRAN scenario, which
consists of which consists of a set of RRHs ℛ = where Pr,trans denotes the transmission power of RRH r,
{1,2, … , } , a set of U user equipments (UEs) = {1,2, … } adhering to , = ∑ ∈ℳ ∑ ∈ , . is a constant
and a set of B BBUs = {1,2, … } Most of the signal representing the efficiency of power amplifier. Pr,active denotes
processing is done at the cloud BBU pool , which is the power usage of the active RRH, which is critical in
connected to all RRHs via optical fiber or other lossless wired sustaining the essential activity of the RRH. The RRH will
connection. The BBU pool acts a DRL agent, which turn into sleep status whenever it is not chosen for data
continuously interacts with the environment, takes an action transmission. Still, it will consume power of Pr,sleep. ℳ ⊆ ℛ

1853
Proceedings of APSIPA Annual Summit and Conference 2019 18-21 November 2019, Lanzhou, China

and ℵ ⊆ ℛ stand for the sets of active and inactive RRHs, offloading decision.
respectively. Besides that, we take into account the transition Similar work can be found in [22], where the authors
power (from active / sleep to sleep / active status). Υ equals considered both the error probability of decoding and
the set of mode-transition RRHs in the existing time slot , violation probability of delay in order to support low latency
which is controlled by the BBU. Armed with the above communications. In [23], a stepwise RA algorithm that
framework, we can define state and action spaces in the minimizes the total power consumption of CRAN has been
subsequent section. proposed. It relies on the combination of DQN and convex
optimization to select which RRHs to turn ON and to allocate
III. DRL-BASED RA IN CRAN transmission power among these active RRHs. Such low-
complexity algorithm, however, may yield infeasible solution
Generally speaking, DRL consists of two phases namely if the number of active RRHs is too low. Furthermore, similar
offline DNN construction phase and online deep Q learning to [19-22], the training process is not in a self-supervised
phase [16]. DNN is adopted to estimate the correlation learning mode. The authors in [24] have addressed this issue
between each state-action match ( , ) and its value function by proposing Monte Carlo Tree Search (MCTS) algorithm. In
( , ), which is the expected cumulative reward when the MCTS, beginning from a root state, it will mimic routes into
environment commences at state and pursues action . ( , the future in order to attain a favorable action by calculating
) can be formulated as: the reward value. Besides that, the work in [24] has improved
∞
the traditional DNN by separating the last DNN layers to
build a sub neural network for accommodating higher action
( , )= ( , ) = , = (4) dimension.
=0
IV. RESEARCH DIRECTIONS AND OPEN ISSUES
where represents the reward achieved in time slot , and
∈(0,1] is the discount factor which indicates the adjustment We pinpoint issues that remain worthy for further
between the prompt and future rewards. (4) lies at the heart of investigation as well as future research.
most DRL-based RA schemes where different system
assumptions, objective functions and optimization variables • The prime challenge discovered in all R&D efforts is the
dictate the specific definitions of , and . Table I difficulty in searching optimality for the DRL-based
summarizes the existing related works. problem. A significant portion stems from training
In [19], a RL-based offloading strategy has been proposed process involving the large state-action dimension.
to choose the RRH and the offloading rate based on the Therefore, an effective DRL-based RA should be able to
existing battery level, the past data rate to each RRH and the shrink the state-action space by using transfer learning. In
estimated amount of the harvested energy. The authors further this way, it can constantly absorb the features of
accelerate the learning speed based on convolutional neural newcomers and lessens random explorations at the early
network (CNN) which compresses the state space. In [20], a stage [17]. Stepwise design could be another efficient
double DQN based strategic computation offloading scheme way to scale down complexity while approaching the
has been designed for ultra-dense sliced RAN. Furthermore, optimal system performance. As demonstrated in [23],
the double DQN is coupled with a Q-function decomposition the continuous action space of dynamic power allocation
approach. In [21], a DQN method which uses a DNN to has been effectively shifted from the MDP to convex
predict the action-value function of Q-learning has been optimization.
devised to manage the computational resource allocation and
Table I. Comparison of Existing DRL Based RA Algorithms
Work [19] [20] [21] [22] [23] [24]
Q-function
Learning Algorithm CNN+Q-learning decomposition + DNN+Q-learning DNN+Q-learning DNN+Q-learning MCTS+MLT
double DQN
Sum Cost of Delay
Objective Function Latency Energy Task Success Rate Power Latency & Energy
and Energy
Communication
Offloading Rate & Offloading Rate & Binary Offloading
Computation Binary On/Off & Resource, Offloading
Action Space Computation Computation & Computation
Resource Power adaptation Rate & Computation
Resource Resource Resource
Resource
Battery Level,
Renewable Energy Task Queue State, Waiting Time of
Computing
generated in a Time Energy Queue the Tasks to be
Capability,
Slot & Number of State & Channel Computing processed at the User Demand Rate
State Space Radio Bandwidth
Potential Qualities between Capability Head of Buffers, On/OFF of RRHs
Resource State &
Transmission Rates UEs and RRHs Queue Length of
Task Request State
corresponding to the Buffers & CSI
Each Edge Device

1854
Proceedings of APSIPA Annual Summit and Conference 2019 18-21 November 2019, Lanzhou, China

• Another issue seldom discussed in most of existing DRL- OFDMA Systems”, IEEE Trans. Veh. Technol., vol. 59, no. 8,
based RA works is signaling overhead. From the pp. 4105-4115, 2010.
implementation viewpoint, incorporating the signaling [14] R. S. Sutton, A. G. Barto, Reinforcement Learning: An
Introduction, Cambridge, MA:MIT Press, 1998.
overhead into RA problem will be beneficial. The
[15] M. Miozzo, L. Giupponi, M. Rossi, and P. Dini, "Distributed Q-
signaling overhead is tightly connected with the accuracy Learning for Energy Harvesting Heterogeneous Networks",
of channel estimation. That is, when fast fading happens, IEEE ICC 2015 workshop on Green Communications and
more signaling will be exchanged so that DRL agent can Networks with Energy Harvesting Smart Grids and Renewable
keep up with the CSI. It is still unclear how much Energies, 2015.
performance degradation must be sacrificed when [16] V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, et
imperfect CSI occurs. Therefore, an effective DRL-based al. Human-level control through deep reinforcement learning,
RA should be able to record and preserve historical Nature, Vol. 518, No. 7540, 2015, pp. 529–533.
observations, enabling the DRL agent to excecute [17] C. Zhang, P. Patras, H. Haddadi, “Deep learning in mobile and
wireless networking: A survey”, 2018, [online] Available:
accurate CSI prediction, given partial observations.
https://arxiv.org/abs/1803.04311.
Recurrent neural network (RNN) such as long short-term [18] B. Dai and W. Yu, “Energy efficiency of downlink transmission
memory (LSTM) could be one of the promising solutions. strategies for cloud radio access networks”, IEEE J. Sel. Areas
Commun., vol. 34, no. 4, pp. 1037–1050, 2016.
REFERENCES [19] M. Min, L. Xiao, Y. Chen et al., “Learning-based computation
offloading for iot devices with energy harvesting,” IEEE Trans.
[1] S. Lien, S. Shieh, Y. Huang, B. Su, Y. Hsu and H. Wei, "5G Veh. Technol., vol. 68, no. 2, pp. 1930–1941, Feb 2019.
New Radio: Waveform, Frame Structure, Multiple Access, and [20] X. Chen, H. Zhang, C. Wu et al., “Optimized computation
Initial Access," IEEE Commun. Mag., vol. 55, no. 6, pp. 64-71, offloading performance in virtual edge computing systems via
June 2017. deep reinforcement learning,” IEEE Internet Things J., vol. 6,
[2] J.-C. Guey et al., "On 5G Radio Access Architecture and no. 3, pp. 4005-4018, June 2019.
Technology", IEEE Wireless Commun., vol. 22, no. 5, pp. 2-5, [21] J. Li, H. Gao, T. Lv et al., “Deep reinforcement learning based
Oct. 2015. computation offloading and resource allocation for mec,” in
[3] J. Wu, “Green wireless communications: from concept to reality Proc. IEEE WCNC, pp. 1–6, April 2018.
[industry perspectives],” IEEE Wireless Commun., vol. 19, pp. [22] T. Yang, Y. Hu, M. C. Gursoy et al., “Deep reinforcement
4–5, Aug. 2012. learning based resource allocation in low latency edge
[4] X. Wang, “C-RAN: The Road Towards Green RAN,” China computing networks,” in Proc. ISWCS, pp. 1–5, Aug 2018.
Commun. J., Jun 2010. [23] Z. Xu, Y. Wang, J. Tang, J. Wang, and M. C. Gursoy, "A deep
[5] NGMN Alliance 5G White Paper, Mar. 2015, [online] reinforcement learning based framework for power-efficient
Available: https://www.ngmn.org/5g-white-paper/5g-white- resource allocation in cloud RANs", in Proc. IEEE ICC, pp. 1-6,
paper.html. May 2017.
[6] P. Rost et al., "Cloud technologies for flexible 5G radio access [24] J. Chen, S. Chen, Q. Wang, B. Cao, G. Feng and J. Hu, "iRAF:
networks," IEEE Commun. Mag., vol. 52, no. 5, pp. 68-76, May A Deep Reinforcement Learning Approach for Collaborative
2014. Mobile Edge Computing IoT Networks," IEEE Internet Things
[7] V. D. Papoutsis and S. A. Kotsopoulos, “Chunk-based resource J, vol. 6, no. 4, pp. 7011-7024, Aug. 2019.
allocation in distributed MISO-OFDMA systems with fairness
guarantee,” IEEE Commun. Lett., vol. 15, no. 4, pp. 377–379,
Apr. 2011.
[8] C. He, B. Sheng, P. Zhu, and X. You, “Energy efficiency and
spectral efficiency tradeoff in downlink distributed antenna
systems,” IEEE Wireless Commun. Lett., vol. 1, no. 3, pp. 153–
156, Jun. 2012.
[9] C. He, B. Sheng, P. Zhu, X. You, and G. Y. Li, “Energy-and
spectralefficiency tradeoff for distributed antenna systems with
proportional fairness,” IEEE J. Sel. Areas Commun., vol. 31, no.
5, pp. 894–902, May 2013.
[10] M.-L. Tham, S. F. Chien, D. W. Holtby, S. Alimov, "Energy-
efficient power allocation for distributed antenna systems with
proportional fairness", IEEE Trans. Green Commun. Netw., vol.
1, no. 2, pp. 145-157, Jun. 2017.
[11] Z. Xu, Y. Wang, J. Tang, J. Wang, and M. C. Gursoy, "A deep
reinforcement learning based framework for power-efficient
resource allocation in cloud RANs", in Proc. IEEE ICC, pp. 1-6,
May 2017.
[12] M. Tayyab, X. Gelabert and R. Jäntti, "A Survey on Handover
Management: From LTE to NR," IEEE Access, vol. 7, pp.
118907-118930, 2019.
[13] C. M. Yen, C. J. Chang, and L. C. Wang, “A Utility-Based
TMCR Scheduling Scheme for Downlink Multiuser MIMO-

1855

Concise Guide to OTN optical transport networks
From Everand
Concise Guide to OTN optical transport networks
alasdair gilchrist
4/5 (2)
Deep Q-Network for 5G NR Downlink Scheduling
No ratings yet
Deep Q-Network for 5G NR Downlink Scheduling
6 pages
Deep Learning DL Based Joint Resource Allocation and RRH Association in 5G-Multi-Tier Networks
No ratings yet
Deep Learning DL Based Joint Resource Allocation and RRH Association in 5G-Multi-Tier Networks
1 page
Journal Pre-Proof: Computer Networks
No ratings yet
Journal Pre-Proof: Computer Networks
15 pages
Deep Reinforcement Learning For RAN Optimization and Control
No ratings yet
Deep Reinforcement Learning For RAN Optimization and Control
6 pages
Dynamic SDN-Based Radio Access Network Slicing With Deep Reinforcement Learning For URLLC and eMBB Services
No ratings yet
Dynamic SDN-Based Radio Access Network Slicing With Deep Reinforcement Learning For URLLC and eMBB Services
14 pages
progress
No ratings yet
progress
30 pages
Intelligent Cognitive Radio in 5G AI-Based Hierarchical Cognitive Cellular Networks
No ratings yet
Intelligent Cognitive Radio in 5G AI-Based Hierarchical Cognitive Cellular Networks
8 pages
Reinforcement Learning Framework For Dynamic Power Transmission in Cloud RAN Systems
No ratings yet
Reinforcement Learning Framework For Dynamic Power Transmission in Cloud RAN Systems
6 pages
AI Based 5G RAN Planning
No ratings yet
AI Based 5G RAN Planning
6 pages
Technologies: Effective 5G Wireless Downlink Scheduling and Resource Allocation in Cyber-Physical Systems
No ratings yet
Technologies: Effective 5G Wireless Downlink Scheduling and Resource Allocation in Cyber-Physical Systems
20 pages
Deep Reinforcement Learning Based Dynamic Resource Allocation in 5G Ultra-Dense Networks
No ratings yet
Deep Reinforcement Learning Based Dynamic Resource Allocation in 5G Ultra-Dense Networks
7 pages
ML For The 5G RAN
No ratings yet
ML For The 5G RAN
10 pages
Cross-Layer Optimization For Statistical Qos Provision in C-Ran With Finite-Length Coding
No ratings yet
Cross-Layer Optimization For Statistical Qos Provision in C-Ran With Finite-Length Coding
13 pages
Deep Reinforcement Learning For Mobile 5G and Beyond Fundamentals Applications and Challenges
No ratings yet
Deep Reinforcement Learning For Mobile 5G and Beyond Fundamentals Applications and Challenges
16 pages
Recent Research in Cloud Radio Access Network (C-RAN) For 5G Cellular
No ratings yet
Recent Research in Cloud Radio Access Network (C-RAN) For 5G Cellular
18 pages
Energy Efficiency in Open RAN: RF Channel Reconfiguration Use Case
No ratings yet
Energy Efficiency in Open RAN: RF Channel Reconfiguration Use Case
9 pages
2019 QMTC
No ratings yet
2019 QMTC
4 pages
HUSSEIN
No ratings yet
HUSSEIN
23 pages
Joint RRH-Association, Sub-Channel Assignment
No ratings yet
Joint RRH-Association, Sub-Channel Assignment
1 page
Distributed_Channel_Allocation_for_Mobile_6G_Subnetworks_via_Multi-Agent_Deep_Q-Learning
No ratings yet
Distributed_Channel_Allocation_for_Mobile_6G_Subnetworks_via_Multi-Agent_Deep_Q-Learning
6 pages
multi-agent-dynamic-resource-allocation-in-6G-in-X-with-subnetworks
No ratings yet
multi-agent-dynamic-resource-allocation-in-6G-in-X-with-subnetworks
15 pages
Deep Reinforcement Learning For Mobile 5G and Beyond Fundamentals Applications and Challenges
No ratings yet
Deep Reinforcement Learning For Mobile 5G and Beyond Fundamentals Applications and Challenges
9 pages
Intelligent Traffic Steering in Beyond 5G Open
No ratings yet
Intelligent Traffic Steering in Beyond 5G Open
29 pages
A Deep Reinforcement Learning-Based Resource
No ratings yet
A Deep Reinforcement Learning-Based Resource
15 pages
(Ebook) Practical Channel-Aware Resource Allocation: With MATLAB and Python Code by Michael Ghorbanzadeh, Ahmed Abdelhadi ISBN 9783030736316, 3030736318 - Download the full ebook now for a seamless reading experience
100% (1)
(Ebook) Practical Channel-Aware Resource Allocation: With MATLAB and Python Code by Michael Ghorbanzadeh, Ahmed Abdelhadi ISBN 9783030736316, 3030736318 - Download the full ebook now for a seamless reading experience
80 pages
Camera Ready TSP
No ratings yet
Camera Ready TSP
16 pages
Artificial Intelligence and Machine Learning in NG-RAN
No ratings yet
Artificial Intelligence and Machine Learning in NG-RAN
5 pages
SCMA C-RAN Armin Farhadi
No ratings yet
SCMA C-RAN Armin Farhadi
21 pages
System Architecture and Key Technologies For 5G Heterogeneous Cloud Radio Access Networks
No ratings yet
System Architecture and Key Technologies For 5G Heterogeneous Cloud Radio Access Networks
20 pages
System Architecture and Key Technologies For 5G
No ratings yet
System Architecture and Key Technologies For 5G
20 pages
GAN-powered Deep Distributional Reinforcement Learning For Resource Management in Network Slicingg
No ratings yet
GAN-powered Deep Distributional Reinforcement Learning For Resource Management in Network Slicingg
16 pages
Meta Federated Reinforcement Learning For Distributed Resource Allocation
No ratings yet
Meta Federated Reinforcement Learning For Distributed Resource Allocation
11 pages
Fair Resource Allocation in Virtualized O-RAN Platforms: And, And
No ratings yet
Fair Resource Allocation in Virtualized O-RAN Platforms: And, And
34 pages
Artifical Intelligencein 5 GNetworks
No ratings yet
Artifical Intelligencein 5 GNetworks
10 pages
Gedikli et al. - 2022 - Deep reinforcement learning based flexible preambl
No ratings yet
Gedikli et al. - 2022 - Deep reinforcement learning based flexible preambl
14 pages
Wireless_Traffic_Prediction_With_Scalable_Gaussian_Process_Framework_Algorithms_and_Verification
No ratings yet
Wireless_Traffic_Prediction_With_Scalable_Gaussian_Process_Framework_Algorithms_and_Verification
16 pages
Spectral efficient network and resource selection model in 5G networks
No ratings yet
Spectral efficient network and resource selection model in 5G networks
9 pages
Else 2020 Conference 4
No ratings yet
Else 2020 Conference 4
7 pages
RL For Traffic Scheduling
No ratings yet
RL For Traffic Scheduling
14 pages
Cloud RAN For Mobile Networks
0% (1)
Cloud RAN For Mobile Networks
39 pages
Deep Reinforcement Learning Based Computation Offloading and Resource Allocation For MEC
No ratings yet
Deep Reinforcement Learning Based Computation Offloading and Resource Allocation For MEC
6 pages
QoS-Driven Scheduling in 5G Radio Access PDF
No ratings yet
QoS-Driven Scheduling in 5G Radio Access PDF
7 pages
Traffic-Aware Cloud RAN: A Key For Green 5G Networks
No ratings yet
Traffic-Aware Cloud RAN: A Key For Green 5G Networks
12 pages
A Deep Q-Network Based-Resource Allocation
No ratings yet
A Deep Q-Network Based-Resource Allocation
5 pages
ActorCritic_Reinforcement_Learning_for_Throughput-Optimized_Power_Allocation_in_Energy_Harvesting_NOMA_Relay-Assisted_Networks
No ratings yet
ActorCritic_Reinforcement_Learning_for_Throughput-Optimized_Power_Allocation_in_Energy_Harvesting_NOMA_Relay-Assisted_Networks
13 pages
RNN-Based Radio Resource Management On
No ratings yet
RNN-Based Radio Resource Management On
14 pages
Learning_to_Allocate_Radio_Resources_in_Mobile_6G_in_X_Subnetworks_fv
No ratings yet
Learning_to_Allocate_Radio_Resources_in_Mobile_6G_in_X_Subnetworks_fv
8 pages
Flexible C-RAN: Radio Technology For 5G
No ratings yet
Flexible C-RAN: Radio Technology For 5G
10 pages
6g
No ratings yet
6g
15 pages
Federated Deep Reinforcement Learning For User Access Control in Open Radio Access Networks
No ratings yet
Federated Deep Reinforcement Learning For User Access Control in Open Radio Access Networks
6 pages
A Machine Learning Framework For Resource Allocation Assisted by Cloud Computing
No ratings yet
A Machine Learning Framework For Resource Allocation Assisted by Cloud Computing
8 pages
publi-4843_1
No ratings yet
publi-4843_1
32 pages
Multi-Agent_Deep_Reinforcement_Learning_Joint_Beamforming_for_Slicing_Resource_Allocation
No ratings yet
Multi-Agent_Deep_Reinforcement_Learning_Joint_Beamforming_for_Slicing_Resource_Allocation
5 pages
Preprints202108 0074 v1
No ratings yet
Preprints202108 0074 v1
33 pages
Electronics 14 01686 v2
No ratings yet
Electronics 14 01686 v2
23 pages
Paper 1 - O-RAN-Minimum-Delay
No ratings yet
Paper 1 - O-RAN-Minimum-Delay
8 pages
RAN-Evolution 2 Open-RAN Challenges and Opportunities
No ratings yet
RAN-Evolution 2 Open-RAN Challenges and Opportunities
6 pages
344944102_oa
No ratings yet
344944102_oa
10 pages
IEEE ICC 2017 MV CameraReady PDF
No ratings yet
IEEE ICC 2017 MV CameraReady PDF
6 pages
An agent based framework for open pit mine planning
No ratings yet
An agent based framework for open pit mine planning
18 pages
Ai Assignment
No ratings yet
Ai Assignment
14 pages
Rapid Autonomous Vehicle Drifting With Deep Reinforcement Learning
No ratings yet
Rapid Autonomous Vehicle Drifting With Deep Reinforcement Learning
11 pages
Driverless Car Autonomous Driving Using Deep Reinforcement Learning in Urban Environment
No ratings yet
Driverless Car Autonomous Driving Using Deep Reinforcement Learning in Urban Environment
6 pages
Cloud-Based Industrial Cyber-Physical System For Data-Driven Reasoning. A Review and Use Case On An Industry 4.0 Pilot Line
No ratings yet
Cloud-Based Industrial Cyber-Physical System For Data-Driven Reasoning. A Review and Use Case On An Industry 4.0 Pilot Line
9 pages
Graph RL Malware Detection
No ratings yet
Graph RL Malware Detection
8 pages
Python Deep Learning Second Edition Ivan Vasilev & Daniel Slater & Gianmario Spacagna &Peter Roelants & Valentino Zocca - Own the ebook now with all fully detailed chapters
No ratings yet
Python Deep Learning Second Edition Ivan Vasilev & Daniel Slater & Gianmario Spacagna &Peter Roelants & Valentino Zocca - Own the ebook now with all fully detailed chapters
51 pages
Large Language Model Federated Learning with Blockchain and Unlearning for Cross-Organizational Collaboration
No ratings yet
Large Language Model Federated Learning with Blockchain and Unlearning for Cross-Organizational Collaboration
16 pages
Models For Machine Learning: M. Tim Jones
No ratings yet
Models For Machine Learning: M. Tim Jones
10 pages
Decision-Making Strategy On Highway For Autonomous Vehicles Using Deep Reinforcement Learning
No ratings yet
Decision-Making Strategy On Highway For Autonomous Vehicles Using Deep Reinforcement Learning
11 pages
Yang - Good Brief of RL
No ratings yet
Yang - Good Brief of RL
87 pages
4a - Approximate Reinforcement Learning
No ratings yet
4a - Approximate Reinforcement Learning
55 pages
Machine Learning in Finance: From Theory to Practice Matthew F. Dixon pdf download
100% (5)
Machine Learning in Finance: From Theory to Practice Matthew F. Dixon pdf download
59 pages
6cs4-02 ML Unit-4
No ratings yet
6cs4-02 ML Unit-4
59 pages
Probabilistic Robotics Final Report
No ratings yet
Probabilistic Robotics Final Report
9 pages
Hamsa Seminar Report
No ratings yet
Hamsa Seminar Report
18 pages
A Study On Dynamic Pricing in The Airline Industry Using Reinforcement Learning Analyzing The Impact of Reinforcement Learning On Airline Pricing Strategies
No ratings yet
A Study On Dynamic Pricing in The Airline Industry Using Reinforcement Learning Analyzing The Impact of Reinforcement Learning On Airline Pricing Strategies
4 pages
New CZ3005 Module 5 - Reinforcement Learning
No ratings yet
New CZ3005 Module 5 - Reinforcement Learning
31 pages
PDF Recent Trends and Future Technology in Applied Intelligence Malek Mouhoub download
No ratings yet
PDF Recent Trends and Future Technology in Applied Intelligence Malek Mouhoub download
55 pages
Reinforcement Learning - Ipynb - Colaboratory
No ratings yet
Reinforcement Learning - Ipynb - Colaboratory
7 pages
Report
No ratings yet
Report
4 pages
Crop Yield Prediction Using Deep Reinforcement Learning Model For Sustainable Agrarian Applications
No ratings yet
Crop Yield Prediction Using Deep Reinforcement Learning Model For Sustainable Agrarian Applications
17 pages
Karol Przystalski, Rohit M. Thanki - Explainable Machine Learning in Medicine-Springer Cham (2024) (1)
No ratings yet
Karol Przystalski, Rohit M. Thanki - Explainable Machine Learning in Medicine-Springer Cham (2024) (1)
92 pages
P22 Digital Twin-Driven Robotic Disassembly Sequence Dynamic Planning Under Uncertain Missing Condition
No ratings yet
P22 Digital Twin-Driven Robotic Disassembly Sequence Dynamic Planning Under Uncertain Missing Condition
10 pages
Deep Reinforcement Learning in Computer Vision: A Comprehensive Survey
No ratings yet
Deep Reinforcement Learning in Computer Vision: A Comprehensive Survey
103 pages
Reinforcement-Learning-Cheatsheet
No ratings yet
Reinforcement-Learning-Cheatsheet
16 pages
A Survey On Deep Reinforcement Learning Algorithms For Robotic Manipulation
No ratings yet
A Survey On Deep Reinforcement Learning Algorithms For Robotic Manipulation
35 pages
Reinforcement Learning For IoT - Final
No ratings yet
Reinforcement Learning For IoT - Final
45 pages
Artificial Intelligence A Z Learn How To Build An AI 2
100% (1)
Artificial Intelligence A Z Learn How To Build An AI 2
33 pages
AI (6th) May2022
No ratings yet
AI (6th) May2022
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

tham2019--

Uploaded by

tham2019--

Uploaded by

Proceedings of APSIPA Annual Summit and Conference 2019 18-21 November 2019, Lanzhou, China

Deep Reinforcement Learning for Resource

978-1-7281-3248-8©2019 APSIPA 1852 APSIPA ASC 2019

5G handover scenario [12]. Classical work in [13] has

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.