ARTÍCULO
TITULO

Pareto Optimal Solutions for Network Defense Strategy Selection Simulator in Multi-Objective Reinforcement Learning

Yang Sun    
Yun Li    
Wei Xiong    
Zhonghua Yao    
Krishna Moniz and Ahmed Zahir    

Resumen

Using Pareto optimization in Multi-Objective Reinforcement Learning (MORL) leads to better learning results for network defense games. This is particularly useful for network security agents, who must often balance several goals when choosing what action to take in defense of a network. If the defender knows his preferred reward distribution, the advantages of Pareto optimization can be retained by using a scalarization algorithm prior to the implementation of the MORL. In this paper, we simulate a network defense scenario by creating a multi-objective zero-sum game and using Pareto optimization and MORL to determine optimal solutions and compare those solutions to different scalarization approaches. We build a Pareto Defense Strategy Selection Simulator (PDSSS) system for assisting network administrators on decision-making, specifically, on defense strategy selection, and the experiment results show that the Satisficing Trade-Off Method (STOM) scalarization approach performs better than linear scalarization or GUESS method. The results of this paper can aid network security agents attempting to find an optimal defense policy for network security games.

 Artículos similares

       
 
Jafar Jafari-Asl, Seyed Arman Hashemi Monfared and Soroush Abolfathi    
This study investigates the optimal and safe operation of pumping stations in water distribution systems (WDSs) with the aim of reducing the environmental footprint of water conveyance processes. We introduced the nonlinear chaotic honey badger algorithm... ver más
Revista: Water

 
Na Wei, Yuxin Peng, Kunming Lu, Guixing Zhou, Xingtao Guo and Minghui Niu    
The parallel reservoirs in the upper reach of the Hanjiang River are key projects for watershed management, development, and protection. The optimal operation of parallel reservoirs is a multiple-stage, multiple-objective, and multiple-decision attribute... ver más
Revista: Applied Sciences

 
Lucilene Silva, Tomas Grönstedt, Carlos Xisto, Luiz Whitacker, Cleverson Bringhenti and Marcus Lejon    
The ratio between blade height and chord, named the aspect ratio (AR), plays an important role in compressor aerodynamic design. Once selected, it influences stage performance, blade losses and the stage stability margin. The choice of the design AR invo... ver más
Revista: Aerospace

 
Meng Ma, Zhirong Zhong, Zhi Zhai and Ruobin Sun    
There are hundreds of various sensors used for online Prognosis and Health Management (PHM) of LREs. Inspired by the fact that a limited number of key sensors are selected for inflight control purposes in LRE, it is practical to optimal placement of redu... ver más
Revista: Aerospace

 
Lu Sun, Bao Zhang, Ping Wang, Zhihong Gan, Pengpeng Han and Yijian Wang    
The process of intelligent multi-objective parametric optimization design for mirrors is discussed in detail in this paper, with the error of the mirror surface shape and the total mass being examined as the optimization objectives. The establishment of ... ver más
Revista: Applied Sciences