Redirigiendo al acceso original de articulo en 17 segundos...
Inicio  /  Algorithms  /  Vol: 17 Par: 1 (2024)  /  Artículo
ARTÍCULO
TITULO

Distributed Data-Driven Learning-Based Optimal Dynamic Resource Allocation for Multi-RIS-Assisted Multi-User Ad-Hoc Network

Yuzhu Zhang and Hao Xu    

Resumen

This study investigates the problem of decentralized dynamic resource allocation optimization for ad-hoc network communication with the support of reconfigurable intelligent surfaces (RIS), leveraging a reinforcement learning framework. In the present context of cellular networks, device-to-device (D2D) communication stands out as a promising technique to enhance the spectrum efficiency. Simultaneously, RIS have gained considerable attention due to their ability to enhance the quality of dynamic wireless networks by maximizing the spectrum efficiency without increasing the power consumption. However, prevalent centralized D2D transmission schemes require global information, leading to a significant signaling overhead. Conversely, existing distributed schemes, while avoiding the need for global information, often demand frequent information exchange among D2D users, falling short of achieving global optimization. This paper introduces a framework comprising an outer loop and inner loop. In the outer loop, decentralized dynamic resource allocation optimization has been developed for self-organizing network communication aided by RIS. This is accomplished through the application of a multi-player multi-armed bandit approach, completing strategies for RIS and resource block selection. Notably, these strategies operate without requiring signal interaction during execution. Meanwhile, in the inner loop, the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm has been adopted for cooperative learning with neural networks (NNs) to obtain optimal transmit power control and RIS phase shift control for multiple users, with a specified RIS and resource block selection policy from the outer loop. Through the utilization of optimization theory, distributed optimal resource allocation can be attained as the outer and inner reinforcement learning algorithms converge over time. Finally, a series of numerical simulations are presented to validate and illustrate the effectiveness of the proposed scheme.

 Artículos similares

       
 
Andrea Menapace, Ariele Zanfei, Manuel Felicetti, Diego Avesani, Maurizio Righetti and Rudy Gargano    
Developing data-driven models for bursts detection is currently a demanding challenge for efficient and sustainable management of water supply systems. The main limit in the progress of these models lies in the large amount of accurate data required. The... ver más
Revista: Applied Sciences

 
Rodrigo L. Rose, Tejas G. Puranik and Dimitri N. Mavris    
The complexity of commercial aviation operations has grown substantially in recent years, together with a diversification of techniques for collecting and analyzing flight data. As a result, data-driven frameworks for enhancing flight safety have grown i... ver más
Revista: Aerospace

 
Taoying Li, Miao Hua and Qian Yin    
The big data from various sensors installed on-board for monitoring the status of ship devices is very critical for improving the efficiency and safety of ship operations and reducing the cost of operation and maintenance. However, how to utilize these d... ver más
Revista: Information

 
Jianjin Wang, Peng Shi, Peng Jiang, Jianwei Hu, Simin Qu, Xingyu Chen, Yingbing Chen, Yunqiu Dai and Ziwei Xiao    
Flooding contributes to tremendous hazards every year; more accurate forecasting may significantly mitigate the damages and loss caused by flood disasters. Current hydrological models are either purely knowledge-based or data-driven. A combination of dat... ver más
Revista: Water

 
Christoph Reichert, Stefan Dürschmid, Rudolf Kruse and Hermann Hinrichs    
Brain?computer interfacing (BCI) is a promising technique for regaining communication and control in severely paralyzed people. Many BCI implementations are based on the recognition of task-specific event-related potentials (ERP) such as P300 responses. ... ver más
Revista: Computers