|
|
|
Juyao Wei, Zhenggang Lu, Zheng Yin and Zhipeng Jing
This paper presents a novel data-driven multiagent reinforcement learning (MARL) controller for enhancing the running stability of independently rotating wheels (IRW) and reducing wheel?rail wear. We base our active guidance controller on the multiagent ...
ver más
|
|
|
|
|
|
|
Shao Xuan Seah and Sutthiphong Srigrarom
This paper explores the use of deep reinforcement learning in solving the multi-agent aircraft traffic planning (individual paths) and collision avoidance problem for a multiple UAS, such as that for a cargo drone network. Specifically, the Deep Q-Networ...
ver más
|
|
|
|
|
|
|
Wenhao Ma and Hongzhen Xu
Cloud computing has experienced rapid growth in recent years and has become a critical computing paradigm. Combining multiple cloud services to satisfy complex user requirements has become a research hotspot in cloud computing. Service composition in mul...
ver más
|
|
|
|
|
|
|
Xi Lyu, Yushan Sun, Lifeng Wang, Jiehui Tan and Liwen Zhang
This study aims to solve the problems of sparse reward, single policy, and poor environmental adaptability in the local motion planning task of autonomous underwater vehicles (AUVs). We propose a two-layer deep deterministic policy gradient algorithm-bas...
ver más
|
|
|
|
|
|
|
Yihan Niu, Feixiang Zhu, Moxuan Wei, Yifan Du and Pengyu Zhai
Maritime Autonomous Surface Ships (MASS) are becoming of interest to the maritime sector and are also on the agenda of the International Maritime Organization (IMO). With the boom in global maritime traffic, the number of ships is increasing rapidly. The...
ver más
|
|
|
|
|
|
|
Jue Ma, Dejun Ning, Chengyi Zhang and Shipeng Liu
Prioritized experience replay (PER) is an important technique in deep reinforcement learning (DRL). It improves the sampling efficiency of data in various DRL algorithms and achieves great performance. PER uses temporal difference error (TD-error) to mea...
ver más
|
|
|
|
|
|
|
Pengyu Zhai, Yingjun Zhang and Wang Shaobo
Ship collisions often result in huge losses of life, cargo and ships, as well as serious pollution of the water environment. Meanwhile, it is estimated that between 75% and 86% of maritime accidents are related to human factors. Thus, it is necessary to ...
ver más
|
|
|
|
|
|
|
Jiabao Yu, Jiawei Chen, Ying Chen, Zhiguo Zhou and Junwei Duan
Although broad reinforcement learning (BRL) provides a more intelligent autonomous decision-making method for the collision avoidance problem of unmanned surface vehicles (USVs), the algorithm still has the problem of over-estimation and has difficulty c...
ver más
|
|
|
|
|
|
|
Ajmery Sultana and Xavier Fernando
Recently, the growing demand of various emerging applications in the realms of sixth-generation (6G) wireless networks has made the term internet of Things (IoT) very popular. Device-to-device (D2D) communication has emerged as one of the significant ena...
ver más
|
|
|
|
|
|
|
Wenzel Pilar von Pilchau, Anthony Stein and Jörg Hähner
State-of-the-art Deep Reinforcement Learning Algorithms such as DQN and DDPG use the concept of a replay buffer called Experience Replay. The default usage contains only the experiences that have been gathered over the runtime. We propose a method called...
ver más
|
|
|
|