REVISTA
AI

   
Redirigiendo al acceso original de articulo en 19 segundos...
Inicio  /  AI  /  Vol: 3 Par: 2 (2022)  /  Artículo
ARTÍCULO
TITULO

Reinforcement Learning Your Way: Agent Characterization through Policy Regularization

Charl Maree and Christian Omlin    

Resumen

The increased complexity of state-of-the-art reinforcement learning (RL) algorithms has resulted in an opacity that inhibits explainability and understanding. This has led to the development of several post hoc explainability methods that aim to extract information from learned policies, thus aiding explainability. These methods rely on empirical observations of the policy, and thus aim to generalize a characterization of agents? behaviour. In this study, we have instead developed a method to imbue agents? policies with a characteristic behaviour through regularization of their objective functions. Our method guides the agents? behaviour during learning, which results in an intrinsic characterization; it connects the learning process with model explanation. We provide a formal argument and empirical evidence for the viability of our method. In future work, we intend to employ it to develop agents that optimize individual financial customers? investment portfolios based on their spending personalities.

 Artículos similares

       
 
Ziyi Wang, Xinran Li, Luoyang Sun, Haifeng Zhang, Hualin Liu and Jun Wang    
Efficient yet sufficient exploration remains a critical challenge in reinforcement learning (RL), especially for Markov Decision Processes (MDPs) with vast action spaces. Previous approaches have commonly involved projecting the original action space int... ver más
Revista: Algorithms

 
Depeng Gao, Shuai Wang, Yuwei Yang, Haifei Zhang, Hao Chen, Xiangxiang Mei, Shuxi Chen and Jianlin Qiu    
Servo motors play an important role in automation equipment and have been used in several manufacturing fields. However, the commonly used control methods need their parameters to be set manually, which is rather difficult, and this means that these meth... ver más
Revista: Algorithms

 
Sungwon Moon, Seolwon Koo, Yujin Lim and Hyunjin Joo    
With recent technological advancements, the commercialization of autonomous vehicles (AVs) is expected to be realized soon. However, it is anticipated that a mixed traffic of AVs and human-driven vehicles (HVs) will persist for a considerable period unti... ver más
Revista: Applied Sciences

 
Siyao Lu, Rui Xu, Zhaoyu Li, Bang Wang and Zhijun Zhao    
The International Lunar Research Station, to be established around 2030, will equip lunar rovers with robotic arms as constructors. Construction requires lunar soil and lunar rovers, for which rovers must go toward different waypoints without encounterin... ver más
Revista: Aerospace

 
Yu-Hung Chang, Chien-Hung Liu and Shingchern D. You    
The dynamic flexible job-shop problem (DFJSP) is a realistic and challenging problem that many production plants face. As the product line becomes more complex, the machines may suddenly break down or resume service, so we need a dynamic scheduling frame... ver más
Revista: Information